43 140 289

Gabriel Martín Blázquez

gabrielmbmb

https://gabrielmb.com

AI & ML interests

ML Engineer

Recent Activity

upvoted an article about 22 hours ago

We now support VLMs in smolagents!

upvoted an article 1 day ago

Welcome to Inference Providers on the Hub 🔥

upvoted an article 2 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Articles

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

• 32

Organizations

gabrielmbmb's activity

upvoted an article about 22 hours ago

Article

We now support VLMs in smolagents!

6 days ago

• 63

upvoted an article 1 day ago

Article

Welcome to Inference Providers on the Hub 🔥

2 days ago

• 153

upvoted an article 2 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

2 days ago

• 414

upvoted an article 7 days ago

Article

Mastering Long Contexts in LLMs with KVPress

•

7 days ago

• 56

upvoted an article 9 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

•

10 days ago

• 28

upvoted an article 15 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

15 days ago

• 128

upvoted a paper 17 days ago

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Paper • 2410.01560 • Published Oct 2, 2024 • 4

upvoted a paper 20 days ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 17

upvoted an article 20 days ago

Article

🌁#81: Key AI Concepts to Follow in 2025

•

Dec 23, 2024

• 24

upvoted an article 22 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

27 days ago

• 32

upvoted 3 papers about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 344

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 18

upvoted a paper about 2 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 47

upvoted an article about 2 months ago

Article

They Said It Couldn’t Be Done

•

Dec 5, 2024

• 77

upvoted an article 2 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 134

upvoted a collection 2 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 1 day ago • 64

upvoted a paper 2 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 18

upvoted a paper 3 months ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63

upvoted a collection 3 months ago

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 268