indischepartij
/

TinyUltra-4x1.1B-Base-Alpha-4experts-GGUF

Inference Endpoints

Model card Files Files and versions Community

Edit model card

4 (full) experts used for inference, instead of 2

Downloads last month: 2

GGUF

Model size

3.38B params

Architecture

llama

4-bit

5-bit

Inference API

Unable to determine this model's library. Check the docs .

Collection including indischepartij/TinyUltra-4x1.1B-Base-Alpha-4experts-GGUF

GGUFs

8 items • Updated Mar 10