Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated about 14 hours ago • 44
AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser Paper • 2511.16397 • Published 13 days ago • 7
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 14 days ago • 219
view article Article OpenRAIL: Towards open and responsible AI licensing frameworks Aug 31, 2022 • 12
Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning Paper • 2510.20150 • Published Oct 23 • 4
DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated 8 days ago • 29
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 302
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 17 days ago • 68
Holo2 Collection Holo2 - Cost-Efficient Models for Cross-Platform Computer-Use Agents • 3 items • Updated 20 days ago • 21
Jan-v2-VL Collection Jan-v2-VL: an 8B VLM focused on reliable, many-step task execution. • 6 items • Updated 20 days ago • 36
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace Paper • 2303.17580 • Published Mar 30, 2023 • 14
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58