rathi2023 commited on
Commit
c51e1cd
1 Parent(s): e7f13a4

End of training

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: mit
3
- base_model: TheBloke/zephyr-7B-beta-GPTQ
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # zephy_finetuned_nvidia_chatbot
15
 
16
- This model is a fine-tuned version of [TheBloke/zephyr-7B-beta-GPTQ](https://huggingface.co/TheBloke/zephyr-7B-beta-GPTQ) on the None dataset.
17
 
18
  ## Model description
19
 
@@ -38,7 +38,7 @@ The following hyperparameters were used during training:
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: cosine
41
- - training_steps: 250
42
 
43
  ### Training results
44
 
@@ -46,7 +46,7 @@ The following hyperparameters were used during training:
46
 
47
  ### Framework versions
48
 
49
- - Transformers 4.34.1
50
- - Pytorch 2.1.0+cu118
51
  - Datasets 2.14.6
52
  - Tokenizers 0.14.1
 
1
  ---
2
  license: mit
3
+ base_model: TheBloke/zephyr-7B-alpha-GPTQ
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # zephy_finetuned_nvidia_chatbot
15
 
16
+ This model is a fine-tuned version of [TheBloke/zephyr-7B-alpha-GPTQ](https://huggingface.co/TheBloke/zephyr-7B-alpha-GPTQ) on the None dataset.
17
 
18
  ## Model description
19
 
 
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: cosine
41
+ - training_steps: 10
42
 
43
  ### Training results
44
 
 
46
 
47
  ### Framework versions
48
 
49
+ - Transformers 4.34.0
50
+ - Pytorch 2.0.1+cu117
51
  - Datasets 2.14.6
52
  - Tokenizers 0.14.1