Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-quantized.w8a16
like
1
Follow
Neural Magic
176
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
text-generation-inference
Inference Endpoints
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
302fdc8
Meta-Llama-3.1-8B-quantized.w8a16
/
README.md
Commit History
Update README.md
302fdc8
verified
alexmarques
commited on
Aug 13
Update README.md
ddadfa9
verified
alexmarques
commited on
Jul 31
Update README.md
b5146ed
verified
alexmarques
commited on
Jul 31
Create README.md
257f5ca
verified
alexmarques
commited on
Jul 31