CocoRoF commited on
Commit
8fa218a
·
verified ·
1 Parent(s): ce687f0

cc-100_12-14 Done

Browse files
Files changed (1) hide show
  1. README.md +16 -17
README.md CHANGED
@@ -1,22 +1,22 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: CocoRoF/KoModernBERT-chp-02
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
- - name: KoModernBERT-chp-03
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # KoModernBERT-chp-03
16
 
17
- This model is a fine-tuned version of [CocoRoF/KoModernBERT-chp-02](https://huggingface.co/CocoRoF/KoModernBERT-chp-02) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.1413
20
 
21
  ## Model description
22
 
@@ -46,24 +46,23 @@ The following hyperparameters were used during training:
46
  - total_eval_batch_size: 64
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
- - lr_scheduler_warmup_ratio: 0.1
50
  - num_epochs: 1.0
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:------:|:-----:|:---------------:|
56
- | 17.3702 | 0.0904 | 5000 | 2.1672 |
57
- | 17.2707 | 0.1808 | 10000 | 2.1613 |
58
- | 17.384 | 0.2712 | 15000 | 2.1601 |
59
- | 17.1693 | 0.3616 | 20000 | 2.1567 |
60
- | 17.2089 | 0.4520 | 25000 | 2.1509 |
61
- | 17.1292 | 0.5424 | 30000 | 2.1547 |
62
- | 16.9682 | 0.6329 | 35000 | 2.1470 |
63
- | 17.2477 | 0.7233 | 40000 | 2.1433 |
64
- | 17.2785 | 0.8137 | 45000 | 2.1427 |
65
- | 16.9302 | 0.9041 | 50000 | 2.1432 |
66
- | 17.1989 | 0.9945 | 55000 | 2.1413 |
67
 
68
 
69
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: CocoRoF/KoModernBERT-chp-03
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
+ - name: KoModernBERT-chp-04
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ # KoModernBERT-chp-04
16
 
17
+ This model is a fine-tuned version of [CocoRoF/KoModernBERT-chp-03](https://huggingface.co/CocoRoF/KoModernBERT-chp-03) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.1738
20
 
21
  ## Model description
22
 
 
46
  - total_eval_batch_size: 64
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
 
49
  - num_epochs: 1.0
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:-----:|:---------------:|
55
+ | 18.7395 | 0.0904 | 5000 | 2.3312 |
56
+ | 18.4567 | 0.1808 | 10000 | 2.3132 |
57
+ | 18.0597 | 0.2712 | 15000 | 2.2958 |
58
+ | 18.0592 | 0.3616 | 20000 | 2.2780 |
59
+ | 18.1118 | 0.4520 | 25000 | 2.2565 |
60
+ | 18.1207 | 0.5424 | 30000 | 2.2429 |
61
+ | 17.3473 | 0.6329 | 35000 | 2.2228 |
62
+ | 17.8666 | 0.7233 | 40000 | 2.2078 |
63
+ | 17.909 | 0.8137 | 45000 | 2.1974 |
64
+ | 17.1812 | 0.9041 | 50000 | 2.1855 |
65
+ | 17.5599 | 0.9945 | 55000 | 2.1738 |
66
 
67
 
68
  ### Framework versions