nbeerbower
/

llama3.1-cc-8B

Text Generation

text-generation-inference

Model card Files Files and versions

llama3.1-cc-8B

mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated finetuned on flammenai/casual-conversation-DPO.

This is an experimental finetune that formats the conversation data sequentially with the Llama 3 template.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	20.13
IFEval (0-Shot)	50.68
BBH (3-Shot)	26.48
MATH Lvl 5 (4-Shot)	6.34
GPQA (0-shot)	4.70
MuSR (0-shot)	6.50
MMLU-PRO (5-shot)	26.08

Downloads last month: 1

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for nbeerbower/llama3.1-cc-8B

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Finetuned

mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Finetuned

(8)

this model

Merges

1 model

Quantizations

Dataset used to train nbeerbower/llama3.1-cc-8B

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

50.680
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

26.480
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

6.340
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

4.700
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

6.500
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

26.080

View on Papers With Code