--- language: - en - de - fr - it - pt - hi - es - th license: llama3.2 base_model: NousResearch/Hermes-3-Llama-3.2-3B base_model_relation: quantized library_name: mlc-llm pipeline_tag: text-generation tags: - Llama-3 - instruct - finetune - chatml - gpt4 - synthetic data - distillation - function calling - json mode - axolotl - roleplaying - chat --- 4-bit [OmniQuant](https://arxiv.org/abs/2308.13137) quantized version of [Hermes-3-Llama-3.2-3B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B) for inference with [Private LLM](http://privatellm.app).