OEvortex
/

vortex-3b-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Abhaykoul commited on Mar 24

Commit

e595054

•

1 Parent(s): a9722f4

Update README.md

Files changed (1) hide show

README.md +9 -10

README.md CHANGED Viewed

@@ -143,13 +143,12 @@ print(res[0]["text"])
 ```
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_OEvortex__vortex-3b-v2)
-|             Metric              |Value|
-|---------------------------------|----:|
-|Avg.                             |37.46|
-|AI2 Reasoning Challenge (25-Shot)|39.68|
-|HellaSwag (10-Shot)              |65.04|
-|MMLU (5-Shot)                    |25.09|
-|TruthfulQA (0-shot)              |33.80|
-|Winogrande (5-shot)              |59.12|
-|GSM8k (5-shot)                   | 2.05|

 ```
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_OEvortex__vortex-3b-v2)
+| Metric | vortex 3b | vortex 3b-v2 | dolly-v2-3b | pythia-2.8b-deduped |
+|---------|----------:|-------------:|------------------:|----------------------------------:|
+| Avg.    | 35.76     | 37.46        | 25.26             | 36.72                             |
+| AI2 Reasoning Challenge (25-Shot) | 31.91 | 39.68 | 22.83 | 36.26 |
+| HellaSwag (10-Shot) | 56.89 | 65.04 | 26.55 | 60.66 |
+| MMLU (5-Shot) | 27.32 | 25.09 | 24.7 | 26.78 |
+| TruthfulQA (0-shot) | 37.39 | 33.80 |       0            | 35.56 |
+| Winogrande (5-shot) | 60.14 | 59.12 | 59.43 | 60.22 |
+| GSM8k (5-shot) | 0.91 | 2.05 | 1.86 | 0.83 |