Lakshmi12
/

tiny-chatbot-dpo

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

Lakshmi12 commited on May 19

Commit

59bce53

•

1 Parent(s): f5cee7f

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # tiny-chatbot-dpo
-This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the None dataset.
 ## Model description
@@ -54,4 +54,5 @@ The following hyperparameters were used during training:
 - Transformers 4.40.2
 - Pytorch 2.2.1+cu121
 - Datasets 2.19.1
-- Tokenizers 0.19.1

 # tiny-chatbot-dpo
+This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v1.0](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) on the Anthropic/hh-rlhf dataset.
 ## Model description
 - Transformers 4.40.2
 - Pytorch 2.2.1+cu121
 - Datasets 2.19.1
+- Tokenizers 0.19.1
+- Lora,qLora