DeepKarkhanis's picture
Update README.md
eee432c verified
metadata
license: apache-2.0
datasets:
  - abacusai/MetaMathFewshot

image/png

DPO finetune of our MetaMath SFT Model on the Truthy DPO dataset

Evaluation Results

Average ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K
75.54 69.20 84.34 76.46 67.58 82.87 72.78