m33393
/

llama-65b-gptq-cuda-4bit-32g-safetensors

Text Generation

Inference Endpoints

Model card Files Files and versions Community

llama-65b-gptq-cuda-4bit-32g-safetensors

2 contributors

History: 9 commits

m33393's picture

Update README.md

73b449d over 1 year ago