Akaisora commited on
Commit
cf9102d
1 Parent(s): 37d4e72

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [Salesforce/codegen-350M-multi](https://huggingface.co/Salesforce/codegen-350M-multi) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 0.0518
18
 
19
  ## Model description
20
 
@@ -34,10 +34,10 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
- - train_batch_size: 4
38
- - eval_batch_size: 4
39
  - seed: 42
40
- - gradient_accumulation_steps: 8
41
  - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | No log | 0.99 | 68 | 0.0652 |
51
- | No log | 1.99 | 137 | 0.0545 |
52
- | No log | 2.97 | 204 | 0.0518 |
53
 
54
 
55
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [Salesforce/codegen-350M-multi](https://huggingface.co/Salesforce/codegen-350M-multi) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.0266
18
 
19
  ## Model description
20
 
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
+ - train_batch_size: 1
38
+ - eval_batch_size: 1
39
  - seed: 42
40
+ - gradient_accumulation_steps: 32
41
  - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | No log | 0.99 | 85 | 0.0406 |
51
+ | No log | 1.99 | 170 | 0.0297 |
52
+ | No log | 2.98 | 255 | 0.0266 |
53
 
54
 
55
  ### Framework versions