naqi72
/

speecht5_finetuned_hindi_syed_naqi

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

naqi72 commited on 28 days ago

Commit

d17518e

•

1 Parent(s): df06f12

End of training

Files changed (1) hide show

README.md +11 -26

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the TTS_English_data dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4244
 ## Model description
@@ -38,45 +38,30 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
-- train_batch_size: 12
 - eval_batch_size: 10
 - seed: 42
-- gradient_accumulation_steps: 2
-- total_train_batch_size: 24
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1500
-- training_steps: 3000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
-| No log        | 1.0     | 176  | 0.7486          |
-| 0.9327        | 2.0     | 352  | 0.5466          |
-| 0.6633        | 3.0     | 528  | 0.5009          |
-| 0.6633        | 4.0     | 704  | 0.4828          |
-| 0.5635        | 5.0     | 880  | 0.4694          |
-| 0.5364        | 6.0     | 1056 | 0.4601          |
-| 0.5364        | 7.0     | 1232 | 0.4609          |
-| 0.5155        | 8.0     | 1408 | 0.4463          |
-| 0.5025        | 9.0     | 1584 | 0.4459          |
-| 0.4883        | 10.0    | 1760 | 0.4399          |
-| 0.4883        | 11.0    | 1936 | 0.4342          |
-| 0.4792        | 12.0    | 2112 | 0.4359          |
-| 0.4715        | 13.0    | 2288 | 0.4264          |
-| 0.4715        | 14.0    | 2464 | 0.4273          |
-| 0.4646        | 15.0    | 2640 | 0.4238          |
-| 0.4598        | 16.0    | 2816 | 0.4231          |
-| 0.4598        | 17.0    | 2992 | 0.4227          |
-| 0.452         | 17.0455 | 3000 | 0.4244          |
 ### Framework versions
 - Transformers 4.44.2
 - Pytorch 2.5.0+cu121
-- Datasets 3.0.2
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the TTS_English_data dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4859
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 14
 - eval_batch_size: 10
 - seed: 42
+- gradient_accumulation_steps: 3
+- total_train_batch_size: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- training_steps: 1500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
+| 0.5661        | 4.3353  | 500  | 0.5135          |
+| 0.5339        | 8.6705  | 1000 | 0.4927          |
+| 0.5155        | 13.0058 | 1500 | 0.4859          |
 ### Framework versions
 - Transformers 4.44.2
 - Pytorch 2.5.0+cu121
+- Datasets 3.1.0
 - Tokenizers 0.19.1