Qwen2.5 quantization_benchmark
#2
by
panxiongfei138
- opened
Can we get this benchmark update for Qwen 2.5 ? https://qwen.readthedocs.io/en/latest/benchmark/quantization_benchmark.html
Or any one tested Qwen2.5 72B Int 8 vs Qwen2.5 32B ? These two models have the same GPU resources ,but we want a better one.
Any suggestions ?