jan-hq leaderboard-pr-bot commited on
Commit
faae4d4
1 Parent(s): 309e861

Adding Evaluation Results (#1)

Browse files

- Adding Evaluation Results (74333fdcf128edd5a5af07dbafee485e7c0d415d)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  license: apache-2.0
3
- base_model: TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T
4
  tags:
5
  - alignment-handbook
6
  - generated_from_trainer
@@ -11,6 +10,7 @@ datasets:
11
  - jan-hq/bagel_sft_binarized
12
  - jan-hq/dolphin_binarized
13
  - jan-hq/openhermes_binarized
 
14
  model-index:
15
  - name: LlamaCorn-sft-adapter
16
  results: []
@@ -117,3 +117,17 @@ The following hyperparameters were used during training:
117
  - Pytorch 2.1.0+cu121
118
  - Datasets 2.14.6
119
  - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
 
10
  - jan-hq/bagel_sft_binarized
11
  - jan-hq/dolphin_binarized
12
  - jan-hq/openhermes_binarized
13
+ base_model: TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T
14
  model-index:
15
  - name: LlamaCorn-sft-adapter
16
  results: []
 
117
  - Pytorch 2.1.0+cu121
118
  - Datasets 2.14.6
119
  - Tokenizers 0.15.0
120
+
121
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
122
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_jan-hq__LlamaCorn-1.1B)
123
+
124
+ | Metric |Value|
125
+ |---------------------------------|----:|
126
+ |Avg. |36.94|
127
+ |AI2 Reasoning Challenge (25-Shot)|34.13|
128
+ |HellaSwag (10-Shot) |59.33|
129
+ |MMLU (5-Shot) |29.01|
130
+ |TruthfulQA (0-shot) |36.78|
131
+ |Winogrande (5-shot) |61.96|
132
+ |GSM8k (5-shot) | 0.45|
133
+