Public Domain 12M: A Highly Aesthetic Image-Text Dataset with Novel Governance Mechanisms Paper β’ 2410.23144 β’ Published 22 days ago β’ 4
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 10 items β’ Updated about 8 hours ago β’ 172
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Aug 20 β’ 46
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated Aug 18 β’ 198
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module Paper β’ 2311.05556 β’ Published Nov 9, 2023 β’ 81
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper β’ 2406.16860 β’ Published Jun 24 β’ 57
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance Paper β’ 2307.00522 β’ Published Jul 2, 2023 β’ 32
Learning and Leveraging World Models in Visual Representation Learning Paper β’ 2403.00504 β’ Published Mar 1 β’ 31
Probing the 3D Awareness of Visual Foundation Models Paper β’ 2404.08636 β’ Published Apr 12 β’ 12
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper β’ 2407.03471 β’ Published Jul 3 β’ 28
view article Article Breaking Barriers: The Critical Role of Art and Design in Advancing AI Capabilities By fffiloni β’ Jan 15 β’ 3
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper β’ 2406.17557 β’ Published Jun 25 β’ 86
LEDITS++: Limitless Image Editing using Text-to-Image Models Paper β’ 2311.16711 β’ Published Nov 28, 2023 β’ 22
TripoSR: Fast 3D Object Reconstruction from a Single Image Paper β’ 2403.02151 β’ Published Mar 4 β’ 12