321 131 827

Maxime Labonne PRO

mlabonne

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Recent Activity

liked a dataset about 2 hours ago

cognitivecomputations/dolphin-r1

liked a dataset about 16 hours ago

rubenroy/GammaCorpus-v2-5m

liked a dataset about 17 hours ago

bespokelabs/Bespoke-Stratos-17k

View all activity

Articles

Organizations

mlabonne's activity

upvoted an article 2 days ago

Article

Reverse-engineering Custom-GPT prompts

•

3 days ago

• 5

upvoted an article 7 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

8 days ago

• 92

upvoted an article 9 days ago

Article

Making ML-powered web games with Transformers.js

Jul 5, 2023

• 11

upvoted an article 14 days ago

Article

The Large Language Model Course

•

14 days ago

• 83

upvoted a paper 20 days ago

Enhancing Human-Like Responses in Large Language Models

Paper • 2501.05032 • Published 21 days ago • 49

upvoted an article 2 months ago

Article

The Beginners Guide to Cleaning a Dataset

•

Nov 18, 2024

• 24

upvoted 2 articles 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13, 2024

• 98

Article

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 39

upvoted a paper 3 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 13

upvoted 2 articles 4 months ago

Article

VLM Art Analysis

•

Oct 4, 2024

• 11

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

upvoted a collection 6 months ago

🧠 Abliteration

Collection

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated Nov 18, 2024 • 27

upvoted an article 6 months ago

Article

Introduction to ggml

Aug 13, 2024

• 135

upvoted a paper 6 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2, 2024 • 8

upvoted an article 6 months ago

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

•

Aug 4, 2024

• 28

upvoted a paper 6 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 24

upvoted a collection 6 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 37

upvoted 2 papers 6 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 45

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18, 2024 • 17

upvoted a collection 6 months ago

Bad Data Toolbox

Collection

PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18, 2024 • 15

Maxime Labonne PRO

AI & ML interests

Recent Activity

Articles

The Large Language Model Course

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

mlabonne's activity

Reverse-engineering Custom-GPT prompts

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Making ML-powered web games with Transformers.js

The Large Language Model Course

The Beginners Guide to Cleaning a Dataset

Releasing the largest multilingual open pretraining dataset

Decoding Strategies in Large Language Models

VLM Art Analysis

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Introduction to ggml

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks