johnpaulett commited on
Commit
e0fe23b
·
verified ·
1 Parent(s): a2e2267

End of training

Browse files
Files changed (2) hide show
  1. README.md +22 -6
  2. model.safetensors +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.6788
20
 
21
  ## Model description
22
 
@@ -42,16 +42,32 @@ The following hyperparameters were used during training:
42
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_ratio: 0.1
45
- - num_epochs: 10
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 1.867 | 1.0 | 248 | 1.6655 |
52
- | 1.7168 | 2.0 | 496 | 1.7754 |
53
- | 1.6528 | 3.0 | 744 | 1.7000 |
54
- | 1.5864 | 4.0 | 992 | 1.6788 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.6936
20
 
21
  ## Model description
22
 
 
42
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_ratio: 0.1
45
+ - num_epochs: 20
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 1.8693 | 1.0 | 248 | 1.5996 |
52
+ | 1.6968 | 2.0 | 496 | 1.7973 |
53
+ | 1.7187 | 3.0 | 744 | 1.7232 |
54
+ | 1.6518 | 4.0 | 992 | 1.7343 |
55
+ | 1.5003 | 5.0 | 1240 | 1.7727 |
56
+ | 1.3346 | 6.0 | 1488 | 1.7357 |
57
+ | 1.4029 | 7.0 | 1736 | 1.7164 |
58
+ | 1.2762 | 8.0 | 1984 | 1.7123 |
59
+ | 1.2441 | 9.0 | 2232 | 1.6978 |
60
+ | 1.2016 | 10.0 | 2480 | 1.7374 |
61
+ | 1.1887 | 11.0 | 2728 | 1.7076 |
62
+ | 1.0205 | 12.0 | 2976 | 1.6736 |
63
+ | 1.0771 | 13.0 | 3224 | 1.7209 |
64
+ | 1.0607 | 14.0 | 3472 | 1.6753 |
65
+ | 0.909 | 15.0 | 3720 | 1.6172 |
66
+ | 0.9255 | 16.0 | 3968 | 1.7418 |
67
+ | 0.8676 | 17.0 | 4216 | 1.6914 |
68
+ | 0.8533 | 18.0 | 4464 | 1.7310 |
69
+ | 0.845 | 19.0 | 4712 | 1.7893 |
70
+ | 0.869 | 20.0 | 4960 | 1.6936 |
71
 
72
 
73
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:94c0fce2fec27a03ea900f30cf9d324d275d8eb88159e744e8c1cfbf2a0d4db5
3
  size 598635032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df839427b714f54cab0653758b1049138e221b2ff1bc609e44a8aef69b3e2891
3
  size 598635032