Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-quantized.w8a16
like
1
Follow
Neural Magic
175
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
text-generation-inference
Inference Endpoints
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
ddadfa9
Meta-Llama-3.1-8B-quantized.w8a16
Commit History
Update README.md
ddadfa9
verified
alexmarques
commited on
Jul 31
Update README.md
b5146ed
verified
alexmarques
commited on
Jul 31
Create README.md
257f5ca
verified
alexmarques
commited on
Jul 31
Upload folder using huggingface_hub
2f1d7c5
verified
alexmarques
commited on
Jul 31
initial commit
6bcdb67
verified
alexmarques
commited on
Jul 31