JSWOOK commited on
Commit
592ad85
1 Parent(s): fdf9f16

End of training

Browse files
Files changed (4) hide show
  1. README.md +78 -0
  2. config.yaml +14 -0
  3. model.safetensors +1 -1
  4. pytorch_model.bin +3 -0
README.md ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - en
5
+ license: mit
6
+ base_model: pyannote/speaker-diarization-3.1
7
+ tags:
8
+ - speaker-diarization
9
+ - speaker-segmentation
10
+ - generated_from_trainer
11
+ datasets:
12
+ - diarizers-community/voxconverse
13
+ model-index:
14
+ - name: JSWOOK/pyannote_3_fine_tuning
15
+ results: []
16
+ ---
17
+
18
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
19
+ should probably proofread and complete it, then remove this comment. -->
20
+
21
+ # JSWOOK/pyannote_3_fine_tuning
22
+
23
+ This model is a fine-tuned version of [pyannote/speaker-diarization-3.1](https://huggingface.co/pyannote/speaker-diarization-3.1) on the diarizers-community/voxconverse dataset.
24
+ It achieves the following results on the evaluation set:
25
+ - Loss: 0.3134
26
+ - Model Preparation Time: 0.0048
27
+ - Der: 0.0888
28
+ - False Alarm: 0.0134
29
+ - Missed Detection: 0.0337
30
+ - Confusion: 0.0417
31
+
32
+ ## Model description
33
+
34
+ More information needed
35
+
36
+ ## Intended uses & limitations
37
+
38
+ More information needed
39
+
40
+ ## Training and evaluation data
41
+
42
+ More information needed
43
+
44
+ ## Training procedure
45
+
46
+ ### Training hyperparameters
47
+
48
+ The following hyperparameters were used during training:
49
+ - learning_rate: 5e-05
50
+ - train_batch_size: 32
51
+ - eval_batch_size: 32
52
+ - seed: 42
53
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
+ - lr_scheduler_type: cosine
55
+ - num_epochs: 10
56
+
57
+ ### Training results
58
+
59
+ | Training Loss | Epoch | Step | Validation Loss | Model Preparation Time | Der | False Alarm | Missed Detection | Confusion |
60
+ |:-------------:|:-----:|:----:|:---------------:|:----------------------:|:------:|:-----------:|:----------------:|:---------:|
61
+ | No log | 1.0 | 24 | 0.3180 | 0.0048 | 0.0915 | 0.0119 | 0.0385 | 0.0410 |
62
+ | 0.1903 | 2.0 | 48 | 0.3116 | 0.0048 | 0.0903 | 0.0125 | 0.0369 | 0.0409 |
63
+ | 0.1839 | 3.0 | 72 | 0.3089 | 0.0048 | 0.0896 | 0.0128 | 0.0357 | 0.0411 |
64
+ | 0.1825 | 4.0 | 96 | 0.3176 | 0.0048 | 0.0896 | 0.0131 | 0.0352 | 0.0413 |
65
+ | 0.1797 | 5.0 | 120 | 0.3148 | 0.0048 | 0.0892 | 0.0132 | 0.0346 | 0.0413 |
66
+ | 0.1801 | 6.0 | 144 | 0.3141 | 0.0048 | 0.0890 | 0.0133 | 0.0342 | 0.0415 |
67
+ | 0.1735 | 7.0 | 168 | 0.3137 | 0.0048 | 0.0887 | 0.0134 | 0.0338 | 0.0416 |
68
+ | 0.1705 | 8.0 | 192 | 0.3133 | 0.0048 | 0.0887 | 0.0134 | 0.0337 | 0.0416 |
69
+ | 0.1796 | 9.0 | 216 | 0.3133 | 0.0048 | 0.0887 | 0.0134 | 0.0337 | 0.0417 |
70
+ | 0.1644 | 10.0 | 240 | 0.3134 | 0.0048 | 0.0888 | 0.0134 | 0.0337 | 0.0417 |
71
+
72
+
73
+ ### Framework versions
74
+
75
+ - Transformers 4.44.2
76
+ - Pytorch 2.5.0+cu121
77
+ - Datasets 3.1.0
78
+ - Tokenizers 0.19.1
config.yaml ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ architectures:
2
+ - SegmentationModel
3
+ chunk_duration: 10.0
4
+ max_speakers_per_chunk: 3
5
+ max_speakers_per_frame: 2
6
+ min_duration: null
7
+ model_type: pyannet
8
+ sample_rate: 16000
9
+ torch_dtype: float32
10
+ transformers_version: 4.44.2
11
+ warm_up:
12
+ - 0.0
13
+ - 0.0
14
+ weigh_by_cardinality: false
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d60062e5585036f0140e9c8faf403ea4c5f0f741aff45386b9c71ceff5abd394
3
  size 5899124
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0851ed0984dbbc22a62ba602a1cc18eeb3fbbf2a1deb515812f41056a04b9303
3
  size 5899124
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:511ceac6e3c008f19fbe4b6b944cada16c75921dc5b1f3d4d2cc01ebc87b0206
3
+ size 5905907