iMatrix GGUFs for https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

iMat generated using Kalomaze's groups_merged.txt

Downloads last month
76
GGUF
Model size
70.6B params
Architecture
llama
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Quantized
(105)
this model

Dataset used to train MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF