排名,类别,机构,大学数学,大学医学,大学经济,大学计算机,大学物理,大学化学,大学哲学,大学管理,平均正确率 🥇,通义千问2(qwen-max),阿里巴巴,39.60%,79.00%,77.00%,79.61%,55.00%,65.22%,83.00%,78.15%,69.57% 🥈,文心一言4(ERNIEBot-4),百度,45.54%,72.00%,75.00%,84.47%,51.25%,54.35%,80.00%,73.95%,67.07% 🥉,GPT4-turbo,OpenAI,44.55%,79.00%,73.00%,80.58%,45.00%,54.35%,72.00%,71.43%,64.99% 4,GPT4,OpenAI,46.53%,75.00%,72.00%,77.67%,47.50%,60.87%,67.00%,73.11%,64.96% 5,讯飞星火v3.0,科大讯飞,42.57%,79.00%,64.00%,63.11%,45.00%,50.00%,73.00%,72.27%,61.12% 6,商汤日日新(Sensenova),商汤科技,39.60%,62.00%,79.00%,74.76%,37.50%,36.96%,75.00%,59.66%,58.06% 7,MiniMax(abab5.5-chat),MiniMax,31.68%,59.00%,60.00%,64.08%,40.00%,41.30%,72.00%,64.71%,54.10% 8,360智脑(360GPT_S2_V9),360,38.61%,57.00%,60.00%,54.37%,43.75%,52.17%,59.00%,62.18%,53.39% 9,百川(baichuan2-13b-chat-v1),百川智能,17.82%,49.00%,59.00%,51.46%,17.50%,30.43%,63.00%,59.66%,43.48% 10,千帆-llama2,Meta/百度千帆,33.66%,44.00%,49.00%,35.92%,28.75%,28.26%,55.00%,57.14%,41.47% 11,ChatGLM3-6B,清华&智谱,21.78%,45.00%,46.00%,47.57%,30.00%,21.74%,55.00%,62.18%,41.16% 12,GPT-3.5-turbo,OpenAI,18.81%,54.00%,48.00%,55.34%,16.25%,34.78%,48.00%,49.58%,40.60% 13,BLOOMZ-7B,BigScience,22.77%,29.00%,25.00%,31.07%,23.75%,23.91%,35.00%,35.29%,28.22% 14,悟道・天鹰(AquilaChat-7B),智源研究院,22.77%,24.00%,26.00%,22.33%,17.50%,21.74%,36.00%,33.61%,25.49%