cgt commited on
Commit
0d3c35d
·
1 Parent(s): 5190663

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [hfl/chinese-pert-large](https://huggingface.co/hfl/chinese-pert-large) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.9521
18
 
19
  ## Model description
20
 
@@ -39,13 +39,15 @@ The following hyperparameters were used during training:
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 1
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | 0.8543 | 1.0 | 3016 | 0.9521 |
 
 
49
 
50
 
51
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [hfl/chinese-pert-large](https://huggingface.co/hfl/chinese-pert-large) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 1.2148
18
 
19
  ## Model description
20
 
 
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 3
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | 0.8657 | 1.0 | 3016 | 0.9520 |
49
+ | 0.5472 | 2.0 | 6032 | 1.0019 |
50
+ | 0.3045 | 3.0 | 9048 | 1.2148 |
51
 
52
 
53
  ### Framework versions