mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC Text Generation • Updated 16 days ago • 474 • 4
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17 • 58