Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
# tiny-chatbot-dpo
|
18 |
|
19 |
-
This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the
|
20 |
|
21 |
## Model description
|
22 |
|
@@ -54,4 +54,5 @@ The following hyperparameters were used during training:
|
|
54 |
- Transformers 4.40.2
|
55 |
- Pytorch 2.2.1+cu121
|
56 |
- Datasets 2.19.1
|
57 |
-
- Tokenizers 0.19.1
|
|
|
|
16 |
|
17 |
# tiny-chatbot-dpo
|
18 |
|
19 |
+
This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the Anthropic/hh-rlhf dataset.
|
20 |
|
21 |
## Model description
|
22 |
|
|
|
54 |
- Transformers 4.40.2
|
55 |
- Pytorch 2.2.1+cu121
|
56 |
- Datasets 2.19.1
|
57 |
+
- Tokenizers 0.19.1
|
58 |
+
- Lora,qLora
|