neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a8 Text Generation • Updated 15 days ago • 369 • 2
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e Text Generation • Updated about 5 hours ago • 448
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e Text Generation • Updated about 4 hours ago • 458
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_channel-e2e Text Generation • Updated about 5 hours ago • 31
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8A16_tensor-e2e Text Generation • Updated about 5 hours ago • 27