llama-3.2-1B-orpo / README.md
d4niel92's picture
Update README.md
4e86726 verified
metadata
library_name: transformers
datasets:
  - mlabonne/orpo-dpo-mix-40k
base_model:
  - meta-llama/Llama-3.2-1B

Model Card

Model Description

This is a Large Language Model (LLM) trained on a subset of the dataset "mlabonne/orpo-dpo-mix-40k".

Evaluation Results

Hellaswag

Metric Value
Accuracy 0.4517

How to Use

To use this model, simply download the checkpoint and load it into your preferred deep learning framework.