riczhou
/

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm

Inference Endpoints

Model card Files Files and versions Community

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm

1 contributor

History: 2 commits

riczhou's picture

Upload folder using huggingface_hub

0fdf245 verified 8 months ago