Dan Alistarh's picture

2 4

Dan Alistarh

dalistarh

·

https://people.csail.mit.edu/alistarh

dalistarh

AI & ML interests

NLP, efficiency

Organizations

upvoted 2 papers about 1 year ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17

upvoted a collection over 1 year ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 57