T,Models,ARC,HellaSwag,MMLU,TruthfulQA,Winogrande,GSM8K,Reference Model 🟢,roneneldan/TinyStories-3M,0.06,0.1,0.13,0.2,0.01,0,huggyllama/llama-7b 🟢,roneneldan/TinyStories-1M,0.05,0.11,0.09,0.17,0.01,0,huggyllama/llama-7b 🔶,Fredithefish/ReasonixPajama-3B-HF,0.15,0.24,0.21,0.94,0.01,0.44,huggyllama/llama-7b 🟢,mistralai/Mistral-7B-v0.1,0.54,0.51,0.46,0.75,0,0.91,huggyllama/llama-7b 🔶,rishiraj/meow,0.11,0.49,0.28,0.36,0.02,0.95,huggyllama/llama-7b 🔶,Q-bert/MetaMath-Cybertron-Starling,0.52,0.64,0.51,0.75,0.01,0.99,huggyllama/llama-7b 🔶,AIDC-ai-business/Marcoroni-7B-v3,0.1,0.14,0.2,0.41,0.0,0.95,mistralai/Mistral-7B-v0.1 🔶,amazon/MistralLite,0.09,0.14,0.2,0.43,0.0,0.73,mistralai/Mistral-7B-v0.1 🔶,openchat/openchat_3.5,0.13,0.13,0.23,0.45,0.0,0.97,mistralai/Mistral-7B-v0.1 🔶,meta-math/MetaMath-Mistral-7B,0.08,0.1,0.17,0.42,0.0,0.97,mistralai/Mistral-7B-v0.1 🔶,teknium/OpenHermes-2.5-Mistral-7B,0.07,0.13,0.23,0.39,0.0,0.96,mistralai/Mistral-7B-v0.1 🔶,microsoft/Orca-2-7b,0.88,0.8,0.77,0.91,0.0,1.0,mistralai/Mistral-7B-v0.1 🔶,WizardLM/WizardMath-7B-V1.1,0.1,0.11,0.21,0.4,0.0,0.99,mistralai/Mistral-7B-v0.1 🔶,01-ai/Yi-6B-200K,0.19,0.3,0.3,0.6,0.0,0.93,mistralai/Mistral-7B-v0.1 🔶,mistralai/Mistral-7B-Instruct-v0.2,0.06,0.21,0.17,0.48,0.0,0.95,mistralai/Mistral-7B-v0.1 🔶,Yhyu13/LMCocktail-10.7B-v1,0.1,0.44,0.23,0.51,0.0,0.97,mistralai/Mistral-7B-v0.1 🔶,ehartford/dolphin-2.1-mistral-7b,0.08,0.1,0.2,0.4,0.0,0.92,mistralai/Mistral-7B-v0.1 🔶,openchat/openchat-3.5-1210,0.1,0.12,0.2,0.4,0.0,0.98,mistralai/Mistral-7B-v0.1 🔶,HuggingFaceH4/zephyr-7b-beta,0.06,0.15,0.18,0.37,0.0,0.82,mistralai/Mistral-7B-v0.1 🔶,berkeley-nest/Starling-LM-7B-alpha,0.1,0.13,0.19,0.39,0.0,0.97,mistralai/Mistral-7B-v0.1 🔶,Sao10K/Ana-v1-m7,0.11,0.12,0.19,0.41,0.0,0.84,mistralai/Mistral-7B-v0.1 🔶,Open-Orca/Mistral-7B-OpenOrca,0.08,0.14,0.17,0.36,0.0,0.92,mistralai/Mistral-7B-v0.1 🔶,jondurbin/bagel-dpo-7b-v0.1,0.12,0.14,0.21,0.47,0.0,0.91,mistralai/Mistral-7B-v0.1 🔶,rwitz/go-bruins-v2,0.09,0.13,0.18,0.4,0.0,0.95,mistralai/Mistral-7B-v0.1 🔶,EmbeddedLLM/Mistral-7B-Merge-14-v0.3,0.09,0.11,0.18,0.39,0.0,0.95,mistralai/Mistral-7B-v0.1 🔶,chargoddard/loyal-piano-m7,0.11,0.13,0.19,0.45,0.0,0.97,mistralai/Mistral-7B-v0.1 🔶,rishiraj/CatPPT,0.09,0.12,0.19,0.44,0.0,0.98,mistralai/Mistral-7B-v0.1 🔶,togethercomputer/RedPajama-INCITE-Instruct-3B-v1,0.08,0.12,0.19,0.43,0.0,0.77,mistralai/Mistral-7B-v0.1 🔶,jan-hq/trinity-v1,0.07,0.16,0.18,0.35,0.0,0.95,mistralai/Mistral-7B-v0.1 🔶,lmsys/vicuna-7b-v1.5,0.13,0.16,0.22,0.62,0.0,0.96,mistralai/Mistral-7B-v0.1 🟢,huggyllama/llama-7b,0.11,0.17,0.22,0.46,0.0,0.79,mistralai/Mistral-7B-v0.1 🟢,tiiuae/falcon-7b-instruct,0.06,0.16,0.19,0.56,0.0,0.98,mistralai/Mistral-7B-v0.1 🔶,NousResearch/Nous-Hermes-llama-2-7b,0.09,0.18,0.26,0.5,0.0,0.96,mistralai/Mistral-7B-v0.1 🔶,openaccess-ai-collective/DPOpenHermes-7B-v2,0.08,0.11,0.22,0.41,0.0,0.96,mistralai/Mistral-7B-v0.1 🟢,01-ai/Yi-6B,0.28,0.32,0.3,0.62,0.02,0.94,mistralai/Mistral-7B-v0.1 🔶,Intel/neural-chat-7b-v3-1,0.1,0.15,0.18,0.49,0.0,0.81,mistralai/Mistral-7B-v0.1 🔶,fblgit/juanako-7b-UNA,0.09,0.15,0.18,0.46,0.0,0.81,mistralai/Mistral-7B-v0.1 🔶,Intel/neural-chat-7b-v3-2,0.12,0.14,0.2,0.5,0.0,0.93,mistralai/Mistral-7B-v0.1 🔶,fblgit/una-cybertron-7b-v2-bf16,0.1,0.12,0.21,0.46,0.0,0.92,mistralai/Mistral-7B-v0.1 🔶,Intel/neural-chat-7b-v3-3,0.06,0.15,0.18,0.47,0.0,0.98,mistralai/Mistral-7B-v0.1 🔶,fblgit/una-cybertron-7b-v3-OMA,0.04,0.16,0.17,0.36,0.0,0.94,mistralai/Mistral-7B-v0.1 🔶,fblgit/una-xaberius-34b-v1beta,0.37,0.54,0.33,0.61,0.04,0.96,huggyllama/llama-7b 🔶,upstage/SOLAR-10.7B-Instruct-v1.0,0.11,0.49,0.28,0.36,0.01,0.96,huggyllama/llama-7b 🔶,VAGOsolutions/SauerkrautLM-SOLAR-Instruct,0.12,0.54,0.32,0.34,0.01,0.96,huggyllama/llama-7b