Papers - a TangoDJ Collection

TangoDJ 's Collections

Paper - Application

Paper - LLM Laws

Papers - Intro, Review, Survey

Papers

models

models multimodal

Papers - LLM on Data

Pappers - Robots

Papers

updated 5 days ago

Text-to-3D using Gaussian Splatting

Paper • 2309.16585 • Published Sep 28, 2023 • 31
FP8-LM: Training FP8 Large Language Models

Paper • 2310.18313 • Published Oct 27, 2023 • 31
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 28
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5 • 70
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15 • 36
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 101
Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 95
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21 • 47
Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22 • 23
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 60
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 254
Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27
Scaling Laws for Pre-training Agents and World Models

Paper • 2411.04434 • Published 19 days ago
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 30
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14 • 15
Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47
Towards Understanding Sycophancy in Language Models

Paper • 2310.13548 • Published Oct 20, 2023 • 4
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Paper • 2406.10162 • Published Jun 14
RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9 • 34
Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4 • 11
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 16
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Paper • 2411.00640 • Published 24 days ago • 3