AndrewB's picture

AndrewB

aboundy

·

AI & ML interests

None yet

Organizations

upvoted a paper 8 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

upvoted a collection 8 months ago

Deepseek Papers

Deepseek papers collection • 25 items • Updated 1 day ago • 273

upvoted an article 9 months ago

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 305

upvoted 3 papers over 1 year ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 69

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12, 2024 • 44

upvoted a collection over 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 248