MagicQuill: An Intelligent Interactive Image Editing System Paper β’ 2411.09703 β’ Published 9 days ago β’ 52
OpenCulture Collection A multilingual dataset of public domain books and newspapers. β’ 27 items β’ Updated 16 days ago β’ 117
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ 10 days ago β’ 94
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 10 items β’ Updated 2 days ago β’ 173
CLEAR: Character Unlearning in Textual and Visual Modalities Paper β’ 2410.18057 β’ Published about 1 month ago β’ 199
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions Paper β’ 2410.17655 β’ Published about 1 month ago β’ 5
How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold Paper β’ 2410.15002 β’ Published Oct 19 β’ 6
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated 30 days ago β’ 26
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated Aug 18 β’ 198
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper β’ 2410.12705 β’ Published Oct 16 β’ 29
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper β’ 2410.13848 β’ Published Oct 17 β’ 27
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated Oct 15 β’ 141
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated Sep 18 β’ 379
CursorCore: Assist Programming through Aligning Anything Paper β’ 2410.07002 β’ Published Oct 9 β’ 13