metadata

library_name: transformers
datasets:
  - mlabonne/orpo-dpo-mix-40k
base_model:
  - meta-llama/Llama-3.2-1B

Model Card

Model Description

This is a Large Language Model (LLM) trained on a subset of the dataset "mlabonne/orpo-dpo-mix-40k".

Metric	Value
Accuracy	0.4517

To use this model, simply download the checkpoint and load it into your preferred deep learning framework.