hieunguyenminh
commited on
Commit
•
224c8da
1
Parent(s):
a259e63
Update README.md
Browse files
README.md
CHANGED
@@ -17,19 +17,15 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
# v3
|
19 |
|
20 |
-
This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the
|
21 |
|
22 |
## Model description
|
23 |
|
24 |
-
|
25 |
-
|
26 |
-
## Intended uses & limitations
|
27 |
-
|
28 |
-
More information needed
|
29 |
|
30 |
## Training and evaluation data
|
31 |
|
32 |
-
|
33 |
|
34 |
## Training procedure
|
35 |
|
@@ -47,7 +43,7 @@ The following hyperparameters were used during training:
|
|
47 |
|
48 |
### Training results
|
49 |
|
50 |
-
|
51 |
|
52 |
### Framework versions
|
53 |
|
|
|
17 |
|
18 |
# v3
|
19 |
|
20 |
+
This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the [hieunguyenminh/roleplay](hieunguyenminh/roleplay) dataset.
|
21 |
|
22 |
## Model description
|
23 |
|
24 |
+
This model can adapt to any type of characters and provide answer that personalize that character.
|
|
|
|
|
|
|
|
|
25 |
|
26 |
## Training and evaluation data
|
27 |
|
28 |
+
It is trained with supervised learning and will be trained with DPO in the future.
|
29 |
|
30 |
## Training procedure
|
31 |
|
|
|
43 |
|
44 |
### Training results
|
45 |
|
46 |
+
Loss after 1 epochs: 0.6
|
47 |
|
48 |
### Framework versions
|
49 |
|