This model is optimized for peformance on the Nvidia Jetson Orin Nano.

Downloads last month
13
Safetensors
Model size
393M params
Tensor type
I32
·
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for jsbaicenter/Llama-3.2-1b-Instruct-AWQ-4bit-GEMM

Quantized
(215)
this model