排名,大模型,机构,通用语言能力,专业与学科能力,安全与责任,综合得分 🥇,文心一言4(ERNIEBot-4),百度,80.03,73.07,68.25,74.58 🥈,GPT4-Turbo,OpenAI,82.59,67.82,67.25,73.66 🥉,通义千问2(qwen-max),阿里巴巴,75.22,77.19,64.64,72.97 4,GPT4,OpenAI,80.60,65.79,59,69.95 5,讯飞星火v3.0,科大讯飞,72.61,66.66,66.61,69.06 6,商汤日日新(Sensenova),商汤科技,71.29,63.07,63.65,66.56 7,MiniMax(abab5.5-chat),MiniMax,71.21,58.23,55.31,62.70 8,ChatGLM3-6B,清华&智谱,70.38,48.00,62.9,61.13 9,360智脑(360GPT_S2_V9),360,67.50,52.78,56.04,59.64 10,GPT3.5-Turbo,OpenAI,72.96,33.17,62.72,57.35 11,百川(baichuan2-13b-chat-v1),百川智能,60.14,50.58,59.33,56.84 12,千帆-llama2,Meta/百度千帆,57.04,46.37,54.01,52.78 13,悟道・天鹰(AquilaChat-7B),智源研究院,56.75,24.24,59.94,47.14 14,BLOOMZ-7B,BigScience,49.80,30.27,45.85,42.43