Alan Tseng
AI & ML interests
Recent Activity
Organizations
Today, we announce Mistral 3, the next generation of Mistral models. Mistral 3 includes three state-of-the-art small, dense models (14B, 8B, and 3B) and Mistral Large 3 – our most capable model to date – a sparse mixture-of-experts trained with 41B active and 675B total parameters.
All models are released under the Apache 2.0 license.
Ministrals :
https://huggingface.co/collections/mistralai/ministral-3
Mistral Large 3:
https://huggingface.co/collections/mistralai/mistral-large-3
I've been waiting for this. Very exciting!
By the way, here are the links to the models because I find the naming and the Huggingface layout to be confusing.
3B
mistralai/Ministral-3-3B-Base-2512
mistralai/Ministral-3-3B-Instruct-2512
mistralai/Ministral-3-3B-Instruct-2512-BF16
mistralai/Ministral-3-3B-Instruct-2512-GGUF
mistralai/Ministral-3-3B-Instruct-2512-ONNX
mistralai/Ministral-3-3B-Reasoning-2512
mistralai/Ministral-3-3B-Reasoning-2512-GGUF
8B
mistralai/Ministral-3-8B-Base-2512
mistralai/Ministral-3-8B-Instruct-2512
mistralai/Ministral-3-8B-Instruct-2512-BF16
mistralai/Ministral-3-8B-Instruct-2512-GGUF
mistralai/Ministral-3-8B-Reasoning-2512
mistralai/Ministral-3-8B-Reasoning-2512-GGUF
14B
mistralai/Ministral-3-14B-Base-2512
mistralai/Ministral-3-14B-Instruct-2512
mistralai/Ministral-3-14B-Instruct-2512-BF16
mistralai/Ministral-3-14B-Instruct-2512-GGUF
mistralai/Ministral-3-14B-Reasoning-2512
mistralai/Ministral-3-14B-Reasoning-2512-GGUF
Large
mistralai/Mistral-Large-3-675B-Base-2512
mistralai/Mistral-Large-3-675B-Instruct-2512
mistralai/Mistral-Large-3-675B-Instruct-2512-Eagle
mistralai/Mistral-Large-3-675B-Instruct-2512-NVFP4