Model Card for Model ID

Just testing out LLM Finetuning. Finetuned on upstage/SOLAR-10.7B-Instruct-v1.0 using argilla/distilabel-intel-orca-dpo-pairs. Followed the Google Colab mentioned in this article: https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	74.08
AI2 Reasoning Challenge (25-Shot)	71.25
HellaSwag (10-Shot)	88.34
MMLU (5-Shot)	66.04
TruthfulQA (0-shot)	71.36
Winogrande (5-shot)	83.19
GSM8k (5-shot)	64.29

Downloads last month: 193

Safetensors

Model size

10.7B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.