nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • Updated about 1 month ago • 205k • 1.72k
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Paper • 2410.00531 • Published Oct 1 • 29
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 6 days ago • 229