Snorlax
's Collections
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Paper
•
2309.04663
•
Published
•
5
Textbooks Are All You Need II: phi-1.5 technical report
Paper
•
2309.05463
•
Published
•
87
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic
Image Design and Generation
Paper
•
2310.08541
•
Published
•
17
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large
Language Models by Extrapolating Errors from Small Models
Paper
•
2310.13671
•
Published
•
18
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper
•
2310.17680
•
Published
•
69
Detecting Pretraining Data from Large Language Models
Paper
•
2310.16789
•
Published
•
10
The Generative AI Paradox: "What It Can Create, It May Not Understand"
Paper
•
2311.00059
•
Published
•
18
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Paper
•
2311.06243
•
Published
•
17
Prompt Engineering a Prompt Engineer
Paper
•
2311.05661
•
Published
•
20
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper
•
2311.05437
•
Published
•
47
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with
Modality Collaboration
Paper
•
2311.04257
•
Published
•
20
NExT-Chat: An LMM for Chat, Detection and Segmentation
Paper
•
2311.04498
•
Published
•
11
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper
•
2311.01282
•
Published
•
35
SelfEval: Leveraging the discriminative nature of generative models for
evaluation
Paper
•
2311.10708
•
Published
•
14
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
355k
•
•
4.21k
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak
Supervision
Paper
•
2312.09390
•
Published
•
32
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
•
2312.10003
•
Published
•
36
Weight subcloning: direct initialization of transformers using larger
pretrained ones
Paper
•
2312.09299
•
Published
•
17
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
79
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model
Scaling Laws
Paper
•
2401.00448
•
Published
•
28
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual
Concept Understanding
Paper
•
2401.04575
•
Published
•
14
MoE-Mamba: Efficient Selective State Space Models with Mixture of
Experts
Paper
•
2401.04081
•
Published
•
71
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and
DeepSpeed-Inference
Paper
•
2401.08671
•
Published
•
14
Can Large Language Models Understand Context?
Paper
•
2402.00858
•
Published
•
21
Specialized Language Models with Cheap Inference from Limited Domain
Data
Paper
•
2402.01093
•
Published
•
45
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
•
2402.10555
•
Published
•
33
Priority Sampling of Large Language Models for Compilers
Paper
•
2402.18734
•
Published
•
16