Quantized By llama.cpp Release b4077

Downloads last month
29
GGUF
Model size
70.6B params
Architecture
llama

4-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for Mimi-333/Llama-3.1-70B-Japanese-Instruct-2407-GGUF

Quantized
(2)
this model