pakawadeep
/

mt5-small-finetuned-ctfl-backtranslation_7k

Text2Text Generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

pakawadeep commited on Sep 16

Commit

d26a940

•

1 Parent(s): 2eba4f2

Training in progress epoch 14

Files changed (2) hide show

README.md +5 -4
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -16,11 +16,11 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 5.5347
-- Validation Loss: 4.7700
-- Train Bleu: 0.0
 - Train Gen Len: 127.0
-- Epoch: 13
 ## Model description
@@ -60,6 +60,7 @@ The following hyperparameters were used during training:
 | 5.9571     | 4.9619          | 0.0003     | 127.0         | 11    |
 | 5.7344     | 4.8588          | 0.0        | 7.0           | 12    |
 | 5.5347     | 4.7700          | 0.0        | 127.0         | 13    |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 5.4059
+- Validation Loss: 4.7055
+- Train Bleu: 0.0005
 - Train Gen Len: 127.0
+- Epoch: 14
 ## Model description
 | 5.9571     | 4.9619          | 0.0003     | 127.0         | 11    |
 | 5.7344     | 4.8588          | 0.0        | 7.0           | 12    |
 | 5.5347     | 4.7700          | 0.0        | 127.0         | 13    |
+| 5.4059     | 4.7055          | 0.0005     | 127.0         | 14    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea79c52f785696cb5265a1c0a04aca6f3638857a0d468d48d1c1c7284a5758b9
 size 2225556280

 version https://git-lfs.github.com/spec/v1
+oid sha256:9b12081175657a084ca3f3fd78e4b71829a9c193e171f1e9f30b4e653a9ad6bf
 size 2225556280