khairi commited on
Commit
11732bc
1 Parent(s): 428c2b9

Model save

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  library_name: transformers
3
- base_model: khairi/seq2seq-plm
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -11,10 +10,10 @@ model-index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/flursky/ProteinLanguageModeling/runs/iev7z03n)
15
  # seq2seq-plm
16
 
17
- This model is a fine-tuned version of [khairi/seq2seq-plm](https://huggingface.co/khairi/seq2seq-plm) on an unknown dataset.
18
 
19
  ## Model description
20
 
@@ -37,10 +36,12 @@ The following hyperparameters were used during training:
37
  - train_batch_size: 112
38
  - eval_batch_size: 112
39
  - seed: 42
 
 
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
  - lr_scheduler_warmup_steps: 6000
43
- - num_epochs: 3
44
 
45
  ### Framework versions
46
 
 
1
  ---
2
  library_name: transformers
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/flursky/ProteinLanguageModeling/runs/474iqrlv)
14
  # seq2seq-plm
15
 
16
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
17
 
18
  ## Model description
19
 
 
36
  - train_batch_size: 112
37
  - eval_batch_size: 112
38
  - seed: 42
39
+ - gradient_accumulation_steps: 2
40
+ - total_train_batch_size: 224
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 6000
44
+ - num_epochs: 5
45
 
46
  ### Framework versions
47