DamiFass's picture
Update README.md
a7ce2f3 verified
---
base_model:
- meta-llama/Llama-3.2-1B
---
## Model Description
This is the meta-llama/Llama-3.2-1B base model fine tuned on the mlabonne/orpo-dpo-mix-40k dataset.
## Evaluation Results
We used lm-evalutation-harness from EleutherAI to evaluate this fine-tuned version of meta-llama/Llama-3.2-1B on the 'Hellaswag' benchmark.
### Hellaswag
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|---------|------:|------|-----:|--------|---|-----:|---|-----:|
|hellaswag| 1|none | 0|acc |↑ |0.4773|± |0.0050|
| | |none | 0|acc_norm|↑ |0.6358|± |0.0048|