The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 26 days ago • 87
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213
Running 3 😻 Anthropic Citations With Gradio Metadata Key anthropic's citation aPI with gradio chatbot and tool use
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published 17 days ago • 14
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 6 hours ago • 139
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 27 days ago • 90