Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

hf-doc-build/doc-build

updated a dataset 2 days ago

hf-doc-build/doc-build-dev

updated a dataset 2 days ago

hf-doc-build/doc-build

View all activity

Organizations

upvoted 3 changelogs 9 days ago

Changelog

Custom Domains for Spaces

about 1 month ago

• 74

Changelog

Repositories total file size is now displayed

30 days ago

• 160

Changelog

GGUF Metadata Editor

11 days ago

• 60

upvoted a paper 23 days ago

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published 26 days ago • 34

upvoted an article about 1 month ago

Article

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

Feb 10, 2023

• 106

upvoted a paper about 1 month ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 187

upvoted 3 papers about 2 months ago

SLiC-HF: Sequence Likelihood Calibration with Human Feedback

Paper • 2305.10425 • Published May 17, 2023 • 6

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11 • 47

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 177

upvoted a collection 2 months ago

Testing datasets

5 items • Updated Aug 18 • 1

upvoted 4 papers 2 months ago

panda-gym: Open-source goal-conditioned environments for robotic learning

Paper • 2106.13687 • Published Jun 25, 2021 • 3

Cell-Free Latent Go-Explore

Paper • 2208.14928 • Published Aug 31, 2022 • 1

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Paper • 2402.03046 • Published Feb 5, 2024 • 7

Distributional Preference Alignment of LLMs via Optimal Transport

Paper • 2406.05882 • Published Jun 9, 2024 • 2

upvoted an article 2 months ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

Aug 8

• 71

upvoted 2 papers 2 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 186

EMA Without the Lag: Bias-Corrected Iterate Averaging Schemes

Paper • 2508.00180 • Published Jul 31 • 1

upvoted a collection 2 months ago

Gemma 3 Release

28 items • Updated Aug 11 • 515

upvoted an article 2 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7

• 96

upvoted a collection 2 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 361