Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
numen-tech
/
gemma-2-9b-it-w4a16g128asym
like
0
Text Generation
MLC-LLM
conversational
4-bit precision
arxiv:
2308.13137
License:
gemma
Model card
Files
Files and versions
Community
Use this model
Edit model card
4-bit
OmniQuant
quantized version of
gemma-2-9b-it
.
Downloads last month
0
Inference Examples
Text Generation
Inference API (serverless) does not yet support mlc-llm models for this pipeline type.
Model tree for
numen-tech/gemma-2-9b-it-w4a16g128asym
Base model
google/gemma-2-9b
Finetuned
google/gemma-2-9b-it
Quantized
(
93
)
this model