LLM_leaderboard / 学科.csv
Li
Update 学科.csv
9224dcc verified
raw
history blame
866 Bytes
排名,大模型,机构,中学试题正确率,大学试题正确率,平均正确率
🥇,通义千问2(qwen-max),阿里巴巴,84.80%,69.57%,77.19%
🥈,文心一言4(ERNIE-Bot4.0),百度,79.07%,67.07%,73.07%
🥉,GPT4-Turbo,OpenAI,70.65%,64.99%,67.82%
4,讯飞星火v3.0,科大讯飞,72.21%,61.12%,66.66%
5,GPT4,OpenAI,66.62%,64.96%,65.79%
6,商汤日日新(Sensenova),商汤科技,68.07%,58.06%,63.07%
7,MiniMax(abab5.5-chat),MiniMax,62.35%,54.10%,58.23%
8,360智脑(360GPT_S2_V9),360,52.17%,53.39%,52.78%
9,百川(baichuan2-13b-chat-v1),百川智能,57.68%,43.48%,50.58%
10,ChatGLM3-6B,清华&智谱,54.83%,41.16%,48.00%
11,千帆-llama2,Meta/百度千帆,51.27%,41.47%,46.37%
12,GPT3.5-Turbo,OpenAI,25.73%,40.60%,33.17%
13,BLOOMZ-7B,BigScience,32.32%,28.22%,30.27%
14,悟道・天鹰(AquilaChat-7B),智源研究院,22.98%,25.49%,24.24%