Pristine-Mini-8B-A1B-Base (8.6B A1.4B)

Pristine is a model series / dataset based on a pruned version of Ring-Mini-2.0 (16B A1.4B --> 8.6B A1.4B). The main motivation behind this project is that Granite 4.0 7B A1B has been pretty disappointing and with a high-performance 8B-A1B these small MoEs can be avenged.

Note: This model hasn't been further trained. Do not use it as is or create quantizations of it!

Downloads last month
8
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for qingy2024/Pristine-Mini-8B-A1B-Base

Finetuned
(1)
this model
Quantizations
1 model