Training complete

Browse files

Files changed (5) hide show

README.md +71 -0
generation_config.json +6 -0
model.safetensors +1 -1
runs/May16_17-38-22_5bf747baa948/events.out.tfevents.1715881117.5bf747baa948.2031.0 +2 -2
runs/May16_17-38-22_5bf747baa948/events.out.tfevents.1715883574.5bf747baa948.2031.1 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,71 @@

+---
+license: apache-2.0
+base_model: google/mt5-small
+tags:
+- summarization
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: mt5-small-finetuned-news-summary-kaggle
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# mt5-small-finetuned-news-summary-kaggle
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.5691
+- Rouge1: 29.8831
+- Rouge2: 11.6462
+- Rougel: 26.8481
+- Rougelsum: 26.8856
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5.6e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 8.1234        | 1.0   | 440  | 3.3123          | 18.1687 | 5.9565  | 16.7581 | 16.7495   |
+| 4.2107        | 2.0   | 880  | 2.8404          | 23.0004 | 8.3723  | 20.8424 | 20.9312   |
+| 3.738         | 3.0   | 1320 | 2.7354          | 26.5882 | 10.1061 | 23.9299 | 24.0001   |
+| 3.4864        | 4.0   | 1760 | 2.6756          | 27.2242 | 10.1775 | 24.4504 | 24.5062   |
+| 3.3642        | 5.0   | 2200 | 2.6224          | 28.7857 | 11.5222 | 26.2568 | 26.3167   |
+| 3.269         | 6.0   | 2640 | 2.5883          | 29.6623 | 11.7765 | 26.8117 | 26.906    |
+| 3.212         | 7.0   | 3080 | 2.5677          | 29.7811 | 11.635  | 26.5844 | 26.6327   |
+| 3.186         | 8.0   | 3520 | 2.5691          | 29.8831 | 11.6462 | 26.8481 | 26.8856   |
+### Framework versions
+- Transformers 4.40.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.40.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:479ce86521f75c3566400c11ce807b89bf097fc7a093793837325073898d83be
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:3a1c524335d2a2ab2ba424c43409c7c92be5d39590628041f06e52b61e2af869
 size 1200729512

runs/May16_17-38-22_5bf747baa948/events.out.tfevents.1715881117.5bf747baa948.2031.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9ecb484714eaf2f74aead82d3f24dca7fd0590a6704a3377b7883334adcf42a6
-size 10028

 version https://git-lfs.github.com/spec/v1
+oid sha256:1cdeaea66c17a9d673abca794a99e6ac5d66b856a2245eb61fb10b3d1780813a
+size 10856

runs/May16_17-38-22_5bf747baa948/events.out.tfevents.1715883574.5bf747baa948.2031.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2c3943bb24727f30eb95e979a7604531b31adcd8696001538035c714ef5ff1a8
+size 562