meoo225
/

FLANT5_base

@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0809
-- Bleu Score: 25.4116
-- Precision: 6.3321
-- Recall: 6.3321
-- Gen Len: 18.8124
-- Err: 6.3321
 ## Model description
@@ -43,22 +43,21 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err    |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:------:|:-------:|:------:|
-| 0.2003        | 1.0   | 838  | 0.1201          | 24.1793    | 5.0179    | 5.0179 | 18.8017 | 5.0179 |
-| 0.11          | 2.0   | 1676 | 0.0926          | 24.8581    | 5.7348    | 5.7348 | 18.81   | 5.7348 |
-| 0.0832        | 3.0   | 2514 | 0.0833          | 25.3868    | 6.0932    | 6.0932 | 18.8136 | 6.0932 |
-| 0.0674        | 4.0   | 3352 | 0.0809          | 25.4116    | 6.3321    | 6.3321 | 18.8124 | 6.3321 |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1032
+- Bleu Score: 24.5858
+- Precision: 5.6153
+- Recall: 5.6153
+- Gen Len: 18.81
+- Err: 5.6153
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err    |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:------:|:-------:|:------:|
+| 0.2198        | 1.0   | 838  | 0.1347          | 23.608     | 4.779     | 4.779  | 18.7897 | 4.779  |
+| 0.1309        | 2.0   | 1676 | 0.1099          | 24.3086    | 5.3763    | 5.3763 | 18.8088 | 5.3763 |
+| 0.1079        | 3.0   | 2514 | 0.1032          | 24.5858    | 5.6153    | 5.6153 | 18.81   | 5.6153 |
 ### Framework versions

logs/events.out.tfevents.1728103469.cf12d7ff8fb4.1938.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f4b00bf16a711a877124400333135b44e9c5225e6c0af23741ba160720080a5f
-size 7424

 version https://git-lfs.github.com/spec/v1
+oid sha256:7f639452e0e88934890e5dd4c10d22e2ce1330e61b4ccae33c6f4cc8ee658532
+size 8515