uer commited on
Commit
f26cf2e
1 Parent(s): 629fe90

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -9
README.md CHANGED
@@ -92,7 +92,7 @@ Stage1:
92
  python3 preprocess.py --corpus_path corpora/cluecorpussmall.txt \
93
  --vocab_path models/google_zh_vocab.txt \
94
  --dataset_path cluecorpussmall_albert_seq128_dataset.pt \
95
- --seq_length 128 --processes_num 32 --target albert
96
  ```
97
 
98
  ```
@@ -102,9 +102,7 @@ python3 pretrain.py --dataset_path cluecorpussmall_albert_seq128_dataset.pt \
102
  --output_model_path models/cluecorpussmall_albert_base_seq128_model.bin \
103
  --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
104
  --total_steps 1000000 --save_checkpoint_steps 100000 --report_steps 50000 \
105
- --learning_rate 1e-4 --batch_size 64 \
106
- --factorized_embedding_parameterization --parameter_sharing \
107
- --embedding word_pos_seg --encoder transformer --mask fully_visible --target albert
108
  ```
109
 
110
  Stage2:
@@ -113,20 +111,18 @@ Stage2:
113
  python3 preprocess.py --corpus_path corpora/cluecorpussmall.txt \
114
  --vocab_path models/google_zh_vocab.txt \
115
  --dataset_path cluecorpussmall_albert_seq512_dataset.pt \
116
- --seq_length 512 --processes_num 32 --target albert
117
  ```
118
 
119
  ```
120
  python3 pretrain.py --dataset_path cluecorpussmall_albert_seq512_dataset.pt \
121
- --pretrained_model_path models/cluecorpussmall_albert_base_seq128_model.bin-1000000 \
122
  --vocab_path models/google_zh_vocab.txt \
 
123
  --config_path models/albert/base_config.json \
124
  --output_model_path models/cluecorpussmall_albert_base_seq512_model.bin \
125
  --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
126
  --total_steps 1000000 --save_checkpoint_steps 100000 --report_steps 50000 \
127
- --learning_rate 1e-4 --batch_size 64 \
128
- --factorized_embedding_parameterization --parameter_sharing \
129
- --embedding word_pos_seg --encoder transformer --mask fully_visible --target albert
130
  ```
131
 
132
  Finally, we convert the pre-trained model into Huggingface's format:
@@ -154,5 +150,6 @@ python3 scripts/convert_albert_from_uer_to_huggingface.py --input_model_path clu
154
  year={2019}
155
  }
156
  ```
 
157
  [base]:https://huggingface.co/uer/albert-base-chinese-cluecorpussmall
158
  [large]:https://huggingface.co/uer/albert-large-chinese-cluecorpussmall
 
92
  python3 preprocess.py --corpus_path corpora/cluecorpussmall.txt \
93
  --vocab_path models/google_zh_vocab.txt \
94
  --dataset_path cluecorpussmall_albert_seq128_dataset.pt \
95
+ --seq_length 128 --processes_num 32 --data_processor albert
96
  ```
97
 
98
  ```
 
102
  --output_model_path models/cluecorpussmall_albert_base_seq128_model.bin \
103
  --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
104
  --total_steps 1000000 --save_checkpoint_steps 100000 --report_steps 50000 \
105
+ --learning_rate 1e-4 --batch_size 64
 
 
106
  ```
107
 
108
  Stage2:
 
111
  python3 preprocess.py --corpus_path corpora/cluecorpussmall.txt \
112
  --vocab_path models/google_zh_vocab.txt \
113
  --dataset_path cluecorpussmall_albert_seq512_dataset.pt \
114
+ --seq_length 512 --processes_num 32 --data_processor albert
115
  ```
116
 
117
  ```
118
  python3 pretrain.py --dataset_path cluecorpussmall_albert_seq512_dataset.pt \
 
119
  --vocab_path models/google_zh_vocab.txt \
120
+ --pretrained_model_path models/cluecorpussmall_albert_base_seq128_model.bin-1000000 \
121
  --config_path models/albert/base_config.json \
122
  --output_model_path models/cluecorpussmall_albert_base_seq512_model.bin \
123
  --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
124
  --total_steps 1000000 --save_checkpoint_steps 100000 --report_steps 50000 \
125
+ --learning_rate 1e-4 --batch_size 64
 
 
126
  ```
127
 
128
  Finally, we convert the pre-trained model into Huggingface's format:
 
150
  year={2019}
151
  }
152
  ```
153
+
154
  [base]:https://huggingface.co/uer/albert-base-chinese-cluecorpussmall
155
  [large]:https://huggingface.co/uer/albert-large-chinese-cluecorpussmall