princeton-nlp
/

gemma-2-9b-it-SimPO

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

princeton-nlp commited on Jul 16

Commit

16c27b0

•

1 Parent(s): c580d90

Update README.md

Files changed (1) hide show

README.md +4 -25

README.md CHANGED Viewed

@@ -61,32 +61,11 @@ Fine-tuning the [google/gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-
 ## Evaluation
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
-### Results
-[More Information Needed]
-#### Summary
 ## Technical Specifications

 ## Evaluation
+| Model    | AlpacaEval 2 LC Win Rate | AlpacaEval 2 Raw Win Rate | Arena-Hard Win Rate | WildBench Elo |
+| :-------- | :------- | :------- | :------- | :------- |
+| gemma-2-9b-it | 51.1 | 38.1 | 40.8 | 1049.5 |
+| gemma-2-9b-it-SimPO |  |  |  |  |
 ## Technical Specifications