jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict Viewer β’ Updated 5 days ago β’ 60 β’ 10
jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict Viewer β’ Updated 5 days ago β’ 60 β’ 10
Running on CPU Upgrade Featured 3k The Smol Training Playbook π 3k The secrets to building world-class LLMs
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 β’ 251
Running 3.7k The Ultra-Scale Playbook π 3.7k The ultimate guide to training LLM on large GPU Clusters
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 250
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper β’ 2502.09619 β’ Published Feb 13, 2025 β’ 36
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper β’ 2505.22453 β’ Published May 28, 2025 β’ 46
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 β’ 1.17k