ericzzz
/

falcon-rw-1b-instruct-openorca

Text Generation

text-generation-inference

Model card Files Files and versions Community

ericzzz commited on Feb 2

Commit

68d6f7b

•

1 Parent(s): 300e555

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ Falcon-RW-1B-Instruct-OpenOrca is a 1B parameter, causal decoder-only model base
 **📊 Evaluation Results**
-Falcon-RW-1B-Instruct-OpenOrca is the #1 ranking model on [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) in ~1.5B parameters category! A detailed result can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ericzzz__falcon-rw-1b-instruct-openorca).
 | Metric     | falcon-rw-1b-instruct-openorca | falcon-rw-1b |
 |------------|-------------------------------:|-------------:|
@@ -27,9 +27,8 @@ Falcon-RW-1B-Instruct-OpenOrca is the #1 ranking model on [Open LLM Leaderboard]
 | MMLU       |                          28.77 |        25.28 |
 | TruthfulQA |                          37.42 |        35.96 |
 | Winogrande |                          60.69 |        62.04 |
-| GSM8K      |                           1.21 |         0.53 |
-| DROP       |                          21.94 |         4.64 |
-| **Average**|                      **35.08** |    **32.44** |
 **🚀 Motivations**
 1. To create a smaller, open-source, instruction-finetuned, ready-to-use model accessible for users with limited computational resources (lower-end consumer GPUs).

 **📊 Evaluation Results**
+Falcon-RW-1B-Instruct-OpenOrca was the #1 ranking model (unfortunately not anymore) on [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) in ~1.5B parameters category! A detailed result can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ericzzz__falcon-rw-1b-instruct-openorca).
 | Metric     | falcon-rw-1b-instruct-openorca | falcon-rw-1b |
 |------------|-------------------------------:|-------------:|
 | MMLU       |                          28.77 |        25.28 |
 | TruthfulQA |                          37.42 |        35.96 |
 | Winogrande |                          60.69 |        62.04 |
+| GSM8K      |                           3.41 |         0.53 |
+| **Average**|                      **37.63** |    **37.07** |
 **🚀 Motivations**
 1. To create a smaller, open-source, instruction-finetuned, ready-to-use model accessible for users with limited computational resources (lower-end consumer GPUs).