georgesung commited on
Commit
0dfa493
·
1 Parent(s): 24ab6ee

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ datasets:
4
+ - georgesung/wizard_vicuna_70k_unfiltered
5
+ ---
6
+
7
+ # Overview
8
+ Fine-tuned [Llama-3 8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) with an uncensored/unfiltered Wizard-Vicuna conversation dataset.
9
+ Used QLoRA for fine-tuning.
10
+
11
+ The model here includes the fp32 HuggingFace version, as well as a [4-bit q4_0 gguf version](https://huggingface.co/georgesung/llama3_8b_chat_uncensored/resolve/main/llama3_8b_chat_uncensored_q4_0.gguf?download=true).
12
+
13
+ # Prompt style
14
+ The model was trained with the following prompt style:
15
+ ```
16
+ ### HUMAN:
17
+ Hello
18
+
19
+ ### RESPONSE:
20
+ Hi, how are you?
21
+
22
+ ### HUMAN:
23
+ I'm fine.
24
+
25
+ ### RESPONSE:
26
+ How can I help you?
27
+ ...
28
+ ```
29
+
30
+ # Training code
31
+ Code used to train the model is available [here](https://github.com/georgesung/llm_qlora).
32
+
33
+ To reproduce the results:
34
+ ```
35
+ git clone https://github.com/georgesung/llm_qlora
36
+ cd llm_qlora
37
+ pip install -r requirements.txt
38
+ python train.py configs/llama3_8b_chat_uncensored.yaml
39
+ ```
40
+
41
+ # Fine-tuning guide
42
+ https://georgesung.github.io/ai/qlora-ift/