Juan CM's picture

Juan CM PRO

jucamohedano

·

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

updated a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

published a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

updated a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_cot

View all activity

Organizations

updated a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

Viewer • Updated 5 days ago • 60 • 10

published a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

Viewer • Updated 5 days ago • 60 • 10

updated a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_cot

Viewer • Updated 5 days ago • 60 • 9

published a dataset 5 days ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_cot

Viewer • Updated 5 days ago • 60 • 9

updated 2 collections 4 months ago

Model merging

2 items • Updated Nov 1, 2025

Model search via model weights

2 items • Updated Nov 1, 2025

liked a Space 4 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

updated a collection 5 months ago

Model merging

2 items • Updated Nov 1, 2025

upvoted 4 articles 5 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

Aug 7, 2025

•

109

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

112

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

595

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

251

liked a Space 7 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 8 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 250

updated a collection 9 months ago

Model search via model weights

2 items • Updated Nov 1, 2025

upvoted 2 papers 9 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13, 2025 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46

upvoted a collection about 1 year ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 173

upvoted an article about 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

+1

Dec 31, 2024

•

1.17k