Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
like
4
Follow
Neural Magic
194
Text Generation
Transformers
Safetensors
8 languages
llama
fp8
vllm
conversational
text-generation-inference
Inference Endpoints
compressed-tensors
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Commit History
Update README
2063612
verified
ekurtic
commited on
Oct 19
Update README.md
b4793e6
verified
alexmarques
commited on
Oct 10
Updated compression_config to quantization_config
fc3ee56
verified
mgoin
commited on
Oct 9
Update README.md
019d944
verified
Lin-K76
commited on
Aug 23
Upload folder using huggingface_hub
17e44d2
verified
Lin-K76
commited on
Aug 22
Update README.md
fc4ffcb
verified
alexmarques
commited on
Aug 13
Update README.md
893683d
verified
alexmarques
commited on
Jul 30
Update README.md
b9e995e
verified
Lin-K76
commited on
Jul 27
Update README.md
b589a15
verified
Lin-K76
commited on
Jul 26
Update README.md
3459f3c
verified
Lin-K76
commited on
Jul 26
Update README.md
0c7579a
verified
Lin-K76
commited on
Jul 26
Update README.md
a69a373
verified
Lin-K76
commited on
Jul 26
Upload folder using huggingface_hub
313cd77
verified
Lin-K76
commited on
Jul 26
Upload folder using huggingface_hub
2d2c5f6
verified
Lin-K76
commited on
Jul 26
Update README.md
5b64234
verified
Lin-K76
commited on
Jul 25
Update README.md
0cca697
verified
Lin-K76
commited on
Jul 24
Create README.md
5fe96cf
verified
Lin-K76
commited on
Jul 24
Upload folder using huggingface_hub
eb04e5c
verified
Lin-K76
commited on
Jul 23
initial commit
3beaf1e
verified
Lin-K76
commited on
Jul 23