排名,大模型,机构,中学试题正确率,大学试题正确率,平均正确率 🥇,通义千问2(qwen-max),阿里巴巴,84.80%,69.57%,77.19% 🥈,文心一言4(ERNIE-Bot4.0),百度,79.07%,67.07%,73.07% 🥉,GPT4-Turbo,OpenAI,70.65%,64.99%,67.82% 4,讯飞星火v3.0,科大讯飞,72.21%,61.12%,66.66% 5,GPT4,OpenAI,66.62%,64.96%,65.79% 6,商汤日日新(Sensenova),商汤科技,68.07%,58.06%,63.07% 7,MiniMax(abab5.5-chat),MiniMax,62.35%,54.10%,58.23% 8,360智脑(360GPT_S2_V9),360,52.17%,53.39%,52.78% 9,百川(baichuan2-13b-chat-v1),百川智能,57.68%,43.48%,50.58% 10,ChatGLM3-6B,清华&智谱,54.83%,41.16%,48.00% 11,千帆-llama2,Meta/百度千帆,51.27%,41.47%,46.37% 12,GPT3.5-Turbo,OpenAI,25.73%,40.60%,33.17% 13,BLOOMZ-7B,BigScience,32.32%,28.22%,30.27% 14,悟道・天鹰(AquilaChat-7B),智源研究院,22.98%,25.49%,24.24%