Triangle104
/

Qwen2.5-7B-Instruct-Uncensored-Q4_K_S-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 8 days ago

Commit

653be60

•

1 Parent(s): ad1ea08

Update README.md

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -117,6 +117,29 @@ model-index:
 This model was converted to GGUF format from [`Orion-zhen/Qwen2.5-7B-Instruct-Uncensored`](https://huggingface.co/Orion-zhen/Qwen2.5-7B-Instruct-Uncensored) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Orion-zhen/Qwen2.5-7B-Instruct-Uncensored) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`Orion-zhen/Qwen2.5-7B-Instruct-Uncensored`](https://huggingface.co/Orion-zhen/Qwen2.5-7B-Instruct-Uncensored) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Orion-zhen/Qwen2.5-7B-Instruct-Uncensored) for more details on the model.
+---
+Model details:
+-
+This model is an uncensored fine-tune version of Qwen2.5-7B-Instruct.
+However, I can still notice that though uncensored, the model fails to
+generate detailed descriptions on certain extreme scenarios, which might
+ be associated with deletion on some pretrain datasets in Qwen's
+pretraining stage.
+Traning details
+-
+I used SFT + DPO to ensure uncensorment as well as trying to maintain original model's capabilities.
+SFT:
+NobodyExistsOnTheInternet/ToxicQAFinal
+anthracite-org/kalo-opus-instruct-22k-no-refusal
+DPO:
+Orion-zhen/dpo-toxic-zh
+unalignment/toxic-dpo-v0.2
+Crystalcareai/Intel-DPO-Pairs-Norefusals
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)