Younes Belkada's picture

Younes Belkada

ybelkada

·

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Recent Activity

New activity about 11 hours ago

ybelkada/t5-11b-sharded

New activity 4 days ago

ybelkada/mpt-7b-bf16-sharded

New activity 6 days ago

mlx-community/falcon-mamba-7b-bf16

Articles

Welcome FalconMamba: The first strong attention-free 7B model

Welcome Llama 3 - Meta's new open LLM

GaLore: Advancing Large Model Training on Consumer-grade Hardware

quanto: a pytorch quantization toolkit

Fine-Tuning Gemma Models in Hugging Face

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Fine-tune Llama 2 with DPO

The Falcon has landed in the Hugging Face ecosystem

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Introducing RWKV — An RNN with the advantages of a transformer

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Organizations

ybelkada's activity

upvoted a paper about 1 month ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 29

upvoted an article 3 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted a collection 3 months ago

🦅 🐍 FalconMamba 7B

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated Oct 10 • 29

upvoted a collection 5 months ago

4M Models

Multimodal models from https://4m.epfl.ch/ • 14 items • Updated Jun 14 • 29

upvoted 2 papers 5 months ago

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning

Paper • 2303.02861 • Published Mar 6, 2023 • 2

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Paper • 2406.04904 • Published Jun 7 • 4

upvoted a collection 6 months ago

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 25 items • Updated 13 days ago • 18

upvoted a paper 6 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28 • 12

upvoted 2 articles 7 months ago

Article

Overview of natively supported quantization schemes in 🤗 Transformers

Sep 12, 2023

• 10

Article

Mixture of Experts Explained

Dec 11, 2023

• 195

upvoted a collection 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

upvoted 3 papers 7 months ago

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16 • 4

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 20

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 254

upvoted a collection 7 months ago

Pile-T5

T5 trained on the Pile with Llama Tokenizer • 4 items • Updated Jul 6 • 17

upvoted a paper 7 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 62

upvoted 4 articles 8 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 93

Article

Fine-Tuning Gemma Models in Hugging Face

Feb 23

• 23

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 25

Article

quanto: a pytorch quantization toolkit

Mar 18

• 31