Qwen2.5 quantization_benchmark

#2
by panxiongfei138 - opened

Can we get this benchmark update for Qwen 2.5 ? https://qwen.readthedocs.io/en/latest/benchmark/quantization_benchmark.html

Or any one tested Qwen2.5 72B Int 8 vs Qwen2.5 32B ? These two models have the same GPU resources ,but we want a better one.
Any suggestions ?

Sign up or log in to comment