Cartinoe5930
's Collections
NLP Paper Reading
updated
Large Language Models as Optimizers
Paper
•
2309.03409
•
Published
•
75
From Sparse to Dense: GPT-4 Summarization with Chain of Density
Prompting
Paper
•
2309.04269
•
Published
•
32
Textbooks Are All You Need II: phi-1.5 technical report
Paper
•
2309.05463
•
Published
•
87
Efficient Memory Management for Large Language Model Serving with
PagedAttention
Paper
•
2309.06180
•
Published
•
25
Agents: An Open-source Framework for Autonomous Language Agents
Paper
•
2309.07870
•
Published
•
42
Connecting Large Language Models with Evolutionary Algorithms Yields
Powerful Prompt Optimizers
Paper
•
2309.08532
•
Published
•
52
Contrastive Decoding Improves Reasoning in Large Language Models
Paper
•
2309.09117
•
Published
•
37
Language Modeling Is Compression
Paper
•
2309.10668
•
Published
•
82
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper
•
2309.11495
•
Published
•
38
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper
•
2309.12307
•
Published
•
87
Enable Language Models to Implicitly Learn Self-Improvement From Data
Paper
•
2310.00898
•
Published
•
23
Language Models can be Logical Solvers
Paper
•
2311.06158
•
Published
•
18
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming
Paper
•
2311.07689
•
Published
•
7
Contrastive Chain-of-Thought Prompting
Paper
•
2311.09277
•
Published
•
34
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Paper
•
2311.10702
•
Published
•
18
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
70
System 2 Attention (is something you might need too)
Paper
•
2311.11829
•
Published
•
39
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
184
Fine-tuning Language Models for Factuality
Paper
•
2311.08401
•
Published
•
28
TinyGSM: achieving >80% on GSM8k with small language models
Paper
•
2312.09241
•
Published
•
37
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak
Supervision
Paper
•
2312.09390
•
Published
•
32
LLM in a flash: Efficient Large Language Model Inference with Limited
Memory
Paper
•
2312.11514
•
Published
•
258
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper
•
2312.00752
•
Published
•
138
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective
Depth Up-Scaling
Paper
•
2312.15166
•
Published
•
56
Paper
•
2401.04088
•
Published
•
159
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
•
144
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language
Modeling
Paper
•
2401.16380
•
Published
•
48