Edit model card

4 (full) experts used for inference, instead of 2

Downloads last month
2
GGUF
Model size
3.38B params
Architecture
llama

4-bit

5-bit

Inference API
Unable to determine this model's library. Check the docs .

Collection including indischepartij/TinyUltra-4x1.1B-Base-Alpha-4experts-GGUF