madatnlp
/

sk-kogptv2-kormath-causal

Text Generation

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

madatnlp commited on May 16, 2022

Commit

becb1cb

·

1 Parent(s): f647f95

End of training

Files changed (2) hide show

README.md +18 -4
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 3.1786
-- Validation Loss: 2.0114
-- Epoch: 0
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'learning_rate': 1e-04, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
@@ -43,6 +43,20 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 3.1786     | 2.0114          | 0     |
 ### Framework versions

 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.2604
+- Validation Loss: 1.7970
+- Epoch: 14
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'learning_rate': 0.00094450003, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
 | 3.1786     | 2.0114          | 0     |
+| 1.7468     | 1.6093          | 1     |
+| 1.3809     | 1.3738          | 2     |
+| 1.1486     | 1.4108          | 3     |
+| 0.9985     | 1.3539          | 4     |
+| 0.8685     | 1.4534          | 5     |
+| 0.7873     | 1.4854          | 6     |
+| 0.7021     | 1.5091          | 7     |
+| 0.6443     | 1.5531          | 8     |
+| 0.6021     | 1.6022          | 9     |
+| 0.5328     | 1.6028          | 10    |
+| 0.4563     | 1.4180          | 11    |
+| 0.3806     | 1.6496          | 12    |
+| 0.3180     | 1.8589          | 13    |
+| 0.2604     | 1.7970          | 14    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f99bc6a434a26b1968e35ee9afb085b547368b8cf87cdb947ab3cb15fb1f6143
 size 658153136

 version https://git-lfs.github.com/spec/v1
+oid sha256:205a27cf2a6c547b92d23588b76d6050cb9ff85fed78db650f5f5d3f45de70c5
 size 658153136