Spaces:
Sleeping
Sleeping
排名,大模型,机构,一般攻击,指令攻击,综合得分 | |
🥇,文心一言4(ERNIE-Bot4.0),百度,69.68,65.38,68.25 | |
🥈,GPT4-Turbo,OpenAI,70.43,60.90,67.25 | |
🥉,讯飞星火v3.0,科大讯飞,66.87,66.10,66.61 | |
4,通义千问2(qwen-max),阿里巴巴,69.00,55.93,64.64 | |
5,商汤日日新(Sensenova),商汤科技,65.66,59.62,63.65 | |
6,ChatGLM3-6B,清华&智谱,64.96,58.78,62.90 | |
7,GPT3.5-Turbo,OpenAI,64.84,58.47,62.72 | |
8,悟道・天鹰(AquilaChat-7B),智源研究院,61.04,57.75,59.94 | |
9,百川(baichuan2-13b-chat-v1),百川智能,60.88,56.23,59.33 | |
10,GPT4,OpenAI,61.62,53.75,59.00 | |
11,360智脑(360GPT_S2_V9),360,58.34,51.45,56.04 | |
12,MiniMax(abab5.5-chat),MiniMax,62.51,40.92,55.31 | |
13,千帆-llama2,Meta/百度千帆,57.04,47.94,54.01 | |
14,BLOOMZ-7B,BigScience,44.98,47.58,45.85 |