Summarization
tuanio commited on
Commit
e54a86a
1 Parent(s): 042778f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -3
README.md CHANGED
@@ -108,7 +108,7 @@ Use the code below to get started with the model.
108
 
109
  ### Training Data
110
 
111
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
112
 
113
  [More Information Needed]
114
 
@@ -123,8 +123,74 @@ Use the code below to get started with the model.
123
 
124
  #### Training Hyperparameters
125
 
126
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
127
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
128
  #### Speeds, Sizes, Times [optional]
129
 
130
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
 
108
 
109
  ### Training Data
110
 
111
+ - IndictTTS: https://www.kaggle.com/datasets/tuannguyenvananh/indictts-english
112
 
113
  [More Information Needed]
114
 
 
123
 
124
  #### Training Hyperparameters
125
 
126
+ ```yaml
127
+ data_cfg:
128
+ dataset:
129
+ processor:
130
+ feat_extractor_id: ${model_cfg.model.encoder_id}
131
+ tokenizer_id: ${model_cfg.tokenizer_id}
132
+ path:
133
+ base:
134
+ indict_tts: ../IndicTTS
135
+ cv: ../
136
+ train:
137
+ - train_data/indict_tts_train.jsonl
138
+ # - train_data/cv_train.jsonl
139
+ test:
140
+ - train_data/indict_tts_test.jsonl
141
+ # - train_data/cv_test.jsonl
142
+ dev:
143
+ - train_data/indict_tts_dev.jsonl
144
+ # - train_data/cv_dev.jsonl
145
+ dataloader:
146
+ batch_size: 46
147
+ num_workers: 8
148
+ pin_memory: True
149
+
150
+ model_cfg:
151
+ tokenizer_id: tuanio/wav2vec2-phoneme-ipa-ctc
152
+ model:
153
+ dropout: 0.1
154
+ encoder_id: tuanio/whisper-encoder.medium.en
155
+ optim:
156
+ lr: 1.25e-05
157
+ betas: [0.9, 0.998]
158
+ weight_decay: 0.01
159
+ scheduler:
160
+ name: linear
161
+ total_steps: -1
162
+ warmup_ratio: 0.05
163
+ interval: step
164
+ frequency: 1
165
+
166
+ trainer_cfg:
167
+ log:
168
+ wandb: True
169
+ logger_wandb:
170
+ project: aped_indian-lish
171
+ name: whisper-medium-indict-tts-only-from-epoch1
172
+ log_model: all
173
+ arguments:
174
+ accelerator: gpu
175
+ devices: -1
176
+ max_epochs: 10
177
+ log_every_n_steps: 1
178
+ enable_checkpointing: True
179
+ accumulate_grad_batches: 2
180
+ inference_mode: True
181
+ gradient_clip_val: 5.0
182
+ check_val_every_n_epoch: 1
183
+ val_check_interval: null
184
+
185
+
186
+ experiment_cfg:
187
+ train: True
188
+ valid: True
189
+ test: True
190
+ ckpt:
191
+ resume_ckpt: True
192
+ ckpt_path: ckpt/medium.epoch3.ckpt
193
+ ```
194
  #### Speeds, Sizes, Times [optional]
195
 
196
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->