SebastianSchramm
/

Cerebras-GPT-111M-instruction

Text Generation

text-generation-inference

Model card Files Files and versions Community

SebastianSchramm commited on Jun 1, 2023

Commit

09f1ec7

•

1 Parent(s): ff12284

fix eval score table

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -20,13 +20,13 @@ Instruction fine-tuned [cerebras-GPT-111M](https://huggingface.co/cerebras/Cereb
 ## Evaluation
 The model has been evaluated with Huggingface's Open LLM leaderboard. Have a look at the leaderboard for more details: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
-The performance of the instruction fine-tuned model does improve compared to the cerebras base model:
-| Model                                          	| Average ⬆️ 	| ARC (25-shot) ⬆️ 	| HellaSwag (10-shot) ⬆️ 	| MMLU (5-shot) ⬆️ 	| TruthfulQA (0-shot) ⬆️ 	|
-|------------------------------------------------	|-----------	|-----------------	|-----------------------	|-----------------	|-----------------------	|
-| SebastianSchramm/Cerebras-GPT-111M-instruction 	| 31.6      	| 24.3            	| 26.2                  	| 26.5            	| 49.5                  	|
-| cerebras/Cerebras-GPT-111M                     	| 29.9      	| 20              	| 26.7                  	| 26.7            	| 46.3                  	|
-|                                                	|           	|                 	|                       	|                 	|                       	|
 ## Training data

 ## Evaluation
 The model has been evaluated with Huggingface's Open LLM leaderboard. Have a look at the leaderboard for more details: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
+The performance of the instruction fine-tuned model does improve compared to the cerebras base model by about 5.7% (average score):
+Model                                          	| Average 	| ARC (25-shot) 	| HellaSwag (10-shot)  	| MMLU (5-shot)  	| TruthfulQA (0-shot)
+--- | --- | --- | --- | --- | ---
+SebastianSchramm/Cerebras-GPT-111M-instruction 	| 31.6      	| 24.3            	| 26.2                  	| 26.5            	| 49.5
+cerebras/Cerebras-GPT-111M                     	| 29.9      	| 20              	| 26.7                  	| 26.7            	| 46.3
+||||||
 ## Training data