排名,大模型,机构,一般攻击,指令攻击,综合得分 🥇,文心一言4(ERNIE-Bot4.0),百度,69.68,65.38,68.25 🥈,GPT4-Turbo,OpenAI,70.43,60.90,67.25 🥉,讯飞星火v3.0,科大讯飞,66.87,66.10,66.61 4,通义千问2(qwen-max),阿里巴巴,69.00,55.93,64.64 5,商汤日日新(Sensenova),商汤科技,65.66,59.62,63.65 6,ChatGLM3-6B,清华&智谱,64.96,58.78,62.90 7,GPT3.5-Turbo,OpenAI,64.84,58.47,62.72 8,悟道・天鹰(AquilaChat-7B),智源研究院,61.04,57.75,59.94 9,百川(baichuan2-13b-chat-v1),百川智能,60.88,56.23,59.33 10,GPT4,OpenAI,61.62,53.75,59.00 11,360智脑(360GPT_S2_V9),360,58.34,51.45,56.04 12,MiniMax(abab5.5-chat),MiniMax,62.51,40.92,55.31 13,千帆-llama2,Meta/百度千帆,57.04,47.94,54.01 14,BLOOMZ-7B,BigScience,44.98,47.58,45.85