Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
m33393
/
llama-65b-gptq-cuda-4bit-32g-safetensors
like
2
Text Generation
Transformers
Safetensors
llama
Inference Endpoints
License:
other
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
llama-65b-gptq-cuda-4bit-32g-safetensors
2 contributors
History:
9 commits
m33393
Update README.md
73b449d
over 1 year ago
.gitattributes
Safe
1.48 kB
initial commit
over 1 year ago
4bit-32g.safetensors
Safe
38.5 GB
LFS
Initial Commit
over 1 year ago
README.md
Safe
614 Bytes
Update README.md
over 1 year ago
config.json
Safe
507 Bytes
Initial Commit
over 1 year ago
generation_config.json
Safe
137 Bytes
Initial Commit
over 1 year ago
special_tokens_map.json
Safe
411 Bytes
Initial Commit
over 1 year ago
tokenizer.json
Safe
1.84 MB
Initial Commit
over 1 year ago
tokenizer.model
Safe
500 kB
LFS
Initial Commit
over 1 year ago
tokenizer_config.json
Safe
727 Bytes
Initial Commit
over 1 year ago