neuraloverflow
's Collections
To read
updated
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper
•
2310.11453
•
Published
•
96
Self-RAG: Learning to Retrieve, Generate, and Critique through
Self-Reflection
Paper
•
2310.11511
•
Published
•
74
In-Context Learning Creates Task Vectors
Paper
•
2310.15916
•
Published
•
42
Matryoshka Diffusion Models
Paper
•
2310.15111
•
Published
•
41
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper
•
2310.13639
•
Published
•
24
Safe RLHF: Safe Reinforcement Learning from Human Feedback
Paper
•
2310.12773
•
Published
•
28
An Image is Worth Multiple Words: Learning Object Level Concepts using
Multi-Concept Prompt Learning
Paper
•
2310.12274
•
Published
•
11
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Paper
•
2310.11441
•
Published
•
26
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper
•
2310.10638
•
Published
•
29
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and
Latent Diffusion
Paper
•
2310.03502
•
Published
•
78
How FaR Are Large Language Models From Agents with Theory-of-Mind?
Paper
•
2310.03051
•
Published
•
34
Large Language Models Cannot Self-Correct Reasoning Yet
Paper
•
2310.01798
•
Published
•
33
Enable Language Models to Implicitly Learn Self-Improvement From Data
Paper
•
2310.00898
•
Published
•
23
PixArt-α: Fast Training of Diffusion Transformer for
Photorealistic Text-to-Image Synthesis
Paper
•
2310.00426
•
Published
•
61
Conditional Diffusion Distillation
Paper
•
2310.01407
•
Published
•
20
Vision Transformers Need Registers
Paper
•
2309.16588
•
Published
•
77
Latent Consistency Models: Synthesizing High-Resolution Images with
Few-Step Inference
Paper
•
2310.04378
•
Published
•
19
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper
•
2310.17680
•
Published
•
70
Personas as a Way to Model Truthfulness in Language Models
Paper
•
2310.18168
•
Published
•
5
A Picture is Worth a Thousand Words: Principled Recaptioning Improves
Image Generation
Paper
•
2310.16656
•
Published
•
40
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo
Labelling
Paper
•
2311.00430
•
Published
•
57
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation,
Generation and Editing
Paper
•
2311.00571
•
Published
•
41
Controllable Music Production with Diffusion Models and Guidance
Gradients
Paper
•
2311.00613
•
Published
•
25
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper
•
2311.00618
•
Published
•
21
The Generative AI Paradox: "What It Can Create, It May Not Understand"
Paper
•
2311.00059
•
Published
•
18
Grounding Visual Illusions in Language: Do Vision-Language Models
Perceive Illusions Like Humans?
Paper
•
2311.00047
•
Published
•
8
CapsFusion: Rethinking Image-Text Data at Scale
Paper
•
2310.20550
•
Published
•
25
Beyond U: Making Diffusion Models Faster & Lighter
Paper
•
2310.20092
•
Published
•
11
LoRAShear: Efficient Large Language Model Structured Pruning and
Knowledge Recovery
Paper
•
2310.18356
•
Published
•
22
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
•
2310.20587
•
Published
•
16
TinyStories: How Small Can Language Models Be and Still Speak Coherent
English?
Paper
•
2305.07759
•
Published
•
33
Textbooks Are All You Need
Paper
•
2306.11644
•
Published
•
142
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
Paper
•
2310.16795
•
Published
•
26
FLAP: Fast Language-Audio Pre-training
Paper
•
2311.01615
•
Published
•
16
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Paper
•
2311.05556
•
Published
•
81
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper
•
2311.02462
•
Published
•
34
The Impact of Large Language Models on Scientific Discovery: a
Preliminary Study using GPT-4
Paper
•
2311.07361
•
Published
•
12
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads
to Answers Faster
Paper
•
2311.08263
•
Published
•
15
Technical Report: Large Language Models can Strategically Deceive their
Users when Put Under Pressure
Paper
•
2311.07590
•
Published
•
16
Music ControlNet: Multiple Time-varying Controls for Music Generation
Paper
•
2311.07069
•
Published
•
43
Prompt Engineering a Prompt Engineer
Paper
•
2311.05661
•
Published
•
20
PolyMaX: General Dense Prediction with Mask Transformer
Paper
•
2311.05770
•
Published
•
6
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
Diffusion GANs
Paper
•
2311.09257
•
Published
•
45
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper
•
2311.10093
•
Published
•
56
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with
Modality Collaboration
Paper
•
2311.04257
•
Published
•
20
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as
an Alternative to Attention Layers in Transformers
Paper
•
2311.10642
•
Published
•
23
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
71
Exponentially Faster Language Modelling
Paper
•
2311.10770
•
Published
•
117
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning
Paper
•
2311.11501
•
Published
•
33
System 2 Attention (is something you might need too)
Paper
•
2311.11829
•
Published
•
39
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
185
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
Model
Paper
•
2311.13231
•
Published
•
26
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper
•
2312.03818
•
Published
•
32
Magicoder: Source Code Is All You Need
Paper
•
2312.02120
•
Published
•
80
FaceStudio: Put Your Face Everywhere in Seconds
Paper
•
2312.02663
•
Published
•
30
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
Paper
•
2312.03491
•
Published
•
33
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper
•
2312.04474
•
Published
•
30
DeepCache: Accelerating Diffusion Models for Free
Paper
•
2312.00858
•
Published
•
21