meoo225 commited on
Commit
03a9de3
1 Parent(s): c679b62

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.0809
20
- - Bleu Score: 25.4116
21
- - Gen Len: 18.8124
22
 
23
  ## Model description
24
 
@@ -37,22 +37,21 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0002
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
- - num_epochs: 4
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
- | 0.2003 | 1.0 | 838 | 0.1201 | 24.1793 | 18.8017 |
53
- | 0.11 | 2.0 | 1676 | 0.0926 | 24.8581 | 18.81 |
54
- | 0.0832 | 3.0 | 2514 | 0.0833 | 25.3868 | 18.8136 |
55
- | 0.0674 | 4.0 | 3352 | 0.0809 | 25.4116 | 18.8124 |
56
 
57
 
58
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.1032
20
+ - Bleu Score: 24.5858
21
+ - Gen Len: 18.81
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0001
41
  - train_batch_size: 8
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 3
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
+ | 0.2198 | 1.0 | 838 | 0.1347 | 23.608 | 18.7897 |
53
+ | 0.1309 | 2.0 | 1676 | 0.1099 | 24.3086 | 18.8088 |
54
+ | 0.1079 | 3.0 | 2514 | 0.1032 | 24.5858 | 18.81 |
 
55
 
56
 
57
  ### Framework versions
logs/events.out.tfevents.1731548527.feef9f44965a.800.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b00decd1122e272a72b73fa702eb7237e02cf94ee83b4ca9464c15a4bbe0320
3
- size 7459
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aea16b4662e6afa8d5883d88ecb82eeadaede33a7472ac461b5ae8267dae8e21
3
+ size 8189