Orion-zhen
/

Qwen2.5-7B-Instruct-Uncensored

Text Generation

Model card Files Files and versions

Qwen2.5-7B-Instruct-Uncensored

This model is an uncensored fine-tune version of Qwen2.5-7B-Instruct. However, I can still notice that though uncensored, the model fails to generate detailed descriptions on certain extreme scenarios, which might be associated with deletion on some pretrain datasets in Qwen's pretraining stage.

Check out my roleplay&writing enhanced model based on this model: Orion-zhen/Meissa-Qwen2.5-7B-Instruct

Traning details

I used SFT + DPO to ensure uncensorment as well as trying to maintain original model's capabilities.

SFT:
- NobodyExistsOnTheInternet/ToxicQAFinal
- anthracite-org/kalo-opus-instruct-22k-no-refusal
DPO:
- Orion-zhen/dpo-toxic-zh
- unalignment/toxic-dpo-v0.2
- Crystalcareai/Intel-DPO-Pairs-Norefusals

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	27.99
IFEval (0-Shot)	72.04
BBH (3-Shot)	35.83
MATH Lvl 5 (4-Shot)	1.36
GPQA (0-shot)	7.05
MuSR (0-shot)	13.58
MMLU-PRO (5-shot)	38.07

Downloads last month: 5,226

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for Orion-zhen/Qwen2.5-7B-Instruct-Uncensored

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Finetuned

(2779)

this model

Adapters

Finetunes

Merges

Quantizations

Datasets used to train Orion-zhen/Qwen2.5-7B-Instruct-Uncensored

Spaces using Orion-zhen/Qwen2.5-7B-Instruct-Uncensored 5

Collection including Orion-zhen/Qwen2.5-7B-Instruct-Uncensored

Qwen2.5 Series

10 items • Updated Oct 25, 2024 • 4

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

72.040
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

35.830
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

1.360
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

7.050
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

13.580
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

38.070

View on Papers With Code