patrixtano commited on
Commit
0e5b67a
1 Parent(s): b1bab76

End of training

Browse files
Files changed (2) hide show
  1. README.md +9 -9
  2. model.safetensors +1 -1
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.5256
20
- - Score: 43.6341
21
  - Char Order: 6
22
  - Word Order: 0
23
  - Beta: 2
@@ -40,8 +40,8 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
- - train_batch_size: 16
44
- - eval_batch_size: 16
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
@@ -49,11 +49,11 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Score | Char Order | Word Order | Beta |
53
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:----------:|:----------:|:----:|
54
- | 1.2999 | 1.0 | 2105 | 0.7759 | 36.9398 | 6 | 0 | 2 |
55
- | 0.87 | 2.0 | 4210 | 0.5735 | 41.0183 | 6 | 0 | 2 |
56
- | 0.7796 | 3.0 | 6315 | 0.5256 | 43.6341 | 6 | 0 | 2 |
57
 
58
 
59
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0560
20
+ - Score: 28.8160
21
  - Char Order: 6
22
  - Word Order: 0
23
  - Beta: 2
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
+ - train_batch_size: 2
44
+ - eval_batch_size: 2
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
 
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Score | Char Order | Word Order | Beta |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:----------:|:----------:|:----:|
54
+ | 0.1671 | 1.0 | 23181 | 0.0741 | 28.6976 | 6 | 0 | 2 |
55
+ | 0.1169 | 2.0 | 46362 | 0.0598 | 28.7935 | 6 | 0 | 2 |
56
+ | 0.1072 | 3.0 | 69543 | 0.0560 | 28.8160 | 6 | 0 | 2 |
57
 
58
 
59
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c43f372203633cca77b599fdcc503eba5c6592663f83f17575d36e6dc9ad48e2
3
  size 1200729512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a686c0a08a1e4aa831b99de81bf96fc95d655b4b97cc3dffc19c6ad834862cf2
3
  size 1200729512