DamiFass
/

llama3.2-1B-finetuned-on-mlabonne

Model card Files Files and versions Community

llama3.2-1B-finetuned-on-mlabonne / README.md

DamiFass's picture

Update README.md

a7ce2f3 verified about 1 month ago

|

history blame contribute delete

614 Bytes

	---
	base_model:
	- meta-llama/Llama-3.2-1B
	---

	## Model Description

	This is the meta-llama/Llama-3.2-1B base model fine tuned on the mlabonne/orpo-dpo-mix-40k dataset.

	## Evaluation Results

	We used lm-evalutation-harness from EleutherAI to evaluate this fine-tuned version of meta-llama/Llama-3.2-1B on the 'Hellaswag' benchmark.

	### Hellaswag

	\| Tasks \|Version\|Filter\|n-shot\| Metric \| \|Value \| \|Stderr\|
	\|---------\|------:\|------\|-----:\|--------\|---\|-----:\|---\|-----:\|
	\|hellaswag\| 1\|none \| 0\|acc \|↑ \|0.4773\|± \|0.0050\|
	\| \| \|none \| 0\|acc_norm\|↑ \|0.6358\|± \|0.0048\|