sarpba commited on
Commit
9456804
1 Parent(s): 2059af9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -66
README.md CHANGED
@@ -10,72 +10,6 @@ pinned: true
10
 
11
  We decided to create an organization to collect the latest (and useable) models for the Hungarian specific finetuned LLMs (Whisper, Bart, LLama, etc). Feel free to join our organization and push your models.
12
 
13
- ## About the models
14
- Hungarian language specific compare test results (on Google/flerus):
15
-
16
- | Original models | WER | CER | Normalized_WER | Normalized_CER | Database | Split | Runtime |
17
- |:--------------------------------------------------------------------------------------------------------|:-------|:------|:-----------------|:-----------------|:--------------|:--------|:----------|
18
- | [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) | 102.46 | 50.31 | 103.37 | 50.19 | google/fleurs | test | 60.44 |
19
- | [openai/whisper-base](https://huggingface.co/openai/whisper-base) | 89.08 | 41.3 | 93.13 | 41.56 | google/fleurs | test | 89.66 |
20
- | [openai/whisper-small](https://huggingface.co/openai/whisper-small) | 48.67 | 15.1 | 45.55 | 15.39 | google/fleurs | test | 175.03 |
21
- | [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) | 32.49 | 9.58 | 29.04 | 10.05 | google/fleurs | test | 393.56 |
22
- | [openai/whisper-large](https://huggingface.co/openai/whisper-large) | 28.2 | 7.77 | 24.76 | 8.31 | google/fleurs | test | 675.77 |
23
- | [openai/whisper-large-v2](https://huggingface.co/openai/whisper-large-v2) | 23.14 | 5.94 | 19.83 | 6.48 | google/fleurs | test | 772.64 |
24
- | [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) | 18.88 | 4.56 | 15.48 | 5.2 | google/fleurs | test | 667.66 |
25
- | Finetuned models | | | | | | | |
26
- | [Hungarians/whisper-small-cv17-hu](https://huggingface.co/Hungarians/whisper-small-cv17-hu) | 188.94 | 75.87 | 188.21 | 77.32 | google/fleurs | test | 472.43 |
27
- | [Hungarians/whisper-tiny-cv16-hu-v3](https://huggingface.co/Hungarians/whisper-tiny-cv16-hu-v3) | 75.9 | 50.61 | 85.55 | 50.91 | google/fleurs | test | 65.17 |
28
- | [Hungarians/whisper-tiny-cv16-hu-v2](https://huggingface.co/Hungarians/whisper-tiny-cv16-hu-v2) | 72.13 | 41.71 | 71.13 | 41.45 | google/fleurs | test | 50.48 |
29
- | [Hungarians/whisper-tiny-cv16-hu-final](https://huggingface.co/Hungarians/whisper-tiny-cv16-hu-final) | 68.43 | 38.24 | 64.07 | 38.14 | google/fleurs | test | 41.48 |
30
- | [Hungarians/whisper-tiny-cv16-hu](https://huggingface.co/Hungarians/whisper-tiny-cv16-hu) | 64.7 | 28.02 | 60.9 | 27.7 | google/fleurs | test | 42.35 |
31
- | [Hungarians/whisper-tiny-hu-cleaned](https://huggingface.co/Hungarians/whisper-tiny-hu-cleaned) | 59.67 | 26.01 | 54.72 | 25.73 | google/fleurs | test | 33.72 |
32
- | [Hungarians/whisper-tiny-cv17-hu](https://huggingface.co/Hungarians/whisper-tiny-cv17-hu) | 58.76 | 24.86 | 56.1 | 24.72 | google/fleurs | test | 39.57 |
33
- | [sarpba/whisper-tiny-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-tiny-cv18-hu-cleaned) | 52.74 | 24.02 | 50.09 | 23.91 | google/fleurs | test | 40.16 |
34
- | [Hungarians/whisper-base-cv16-hu-v2](https://huggingface.co/Hungarians/whisper-base-cv16-hu-v2) | 51.41 | 20.97 | 46.79 | 20.93 | google/fleurs | test | 70.57 |
35
- | [Hungarians/whisper-base-hu-cleaned](https://huggingface.co/Hungarians/whisper-base-hu-cleaned) | 51.38 | 20.05 | 46.54 | 20.14 | google/fleurs | test | 70.84 |
36
- | [Hungarians/whisper-base-cv16-hu](https://huggingface.co/Hungarians/whisper-base-cv16-hu) | 50.06 | 17.71 | 44.83 | 17.44 | google/fleurs | test | 65.49 |
37
- | [Hungarians/whisper-medium-cv16-hu](https://huggingface.co/Hungarians/whisper-medium-cv16-hu) | 49.77 | 24.98 | 47.79 | 25.4 | google/fleurs | test | 498.53 |
38
- | [Hungarians/whisper-base-cv16-hu-final](https://huggingface.co/Hungarians/whisper-base-cv16-hu-final) | 48.37 | 16.28 | 43.84 | 16.31 | google/fleurs | test | 67.07 |
39
- | [Hungarians/whisper-base-cv17-hu](https://huggingface.co/Hungarians/whisper-base-cv17-hu) | 45.61 | 14.95 | 40.79 | 14.94 | google/fleurs | test | 64.15 |
40
- | [sarpba/whisper-base-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-base-cv18-hu-cleaned) | 42.09 | 13.67 | 36.66 | 13.53 | google/fleurs | test | 54.7 |
41
- | [Hungarians/whisper-small-cv16-hu-v2](https://huggingface.co/Hungarians/whisper-small-cv16-hu-v2) | 41.07 | 13.16 | 36.59 | 13.21 | google/fleurs | test | 201.28 |
42
- | [Hungarians/Whisper-small-hu-cleaned](https://huggingface.co/Hungarians/Whisper-small-hu-cleaned) | 39.12 | 13.91 | 41.15 | 14.11 | google/fleurs | test | 274.09 |
43
- | [Hungarians/whisper-small-cv16-hu](https://huggingface.co/Hungarians/whisper-small-cv16-hu) | 37.5 | 11.31 | 32.54 | 11.35 | google/fleurs | test | 608.28 |
44
- | [Hungarians/whisper-small-cv16-hu-v1.5](https://huggingface.co/Hungarians/whisper-small-cv16-hu-v1.5) | 35.61 | 10.99 | 30.33 | 11.04 | google/fleurs | test | 605.69 |
45
- | [Hungarians/whisper-medium-hu-cleaned](https://huggingface.co/Hungarians/whisper-medium-hu-cleaned) | 26.26 | 6.8 | 21.97 | 7.31 | google/fleurs | test | 442.53 |
46
- | Our best models | | | | | | | |
47
- | [sarpba/whisper-tiny-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-tiny-cv18-hu-cleaned) | 52.74 | 24.02 | 50.09 | 23.91 | google/fleurs | test | 40.16 |
48
- | [sarpba/whisper-base-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-base-cv18-hu-cleaned) | 42.09 | 13.67 | 36.66 | 13.53 | google/fleurs | test | 54.7 |
49
- | [sarpba/whisper-small-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-small-cv18-hu-cleaned) | 29.75 | 9.23 | 25.19 | 9.38 | google/fleurs | test | 281.95 |
50
- | [sarpba/whisper-medium-cv18-hu-cleaned](https://huggingface.co/sarpba/whisper-medium-cv18-hu-cleaned) | 23.89 | 6.79 | 19.81 | 7.3 | google/fleurs | test | 541.17 |
51
- | [Hungarians/whisper-large-v2-hu-cleaned](https://huggingface.co/Hungarians/whisper-large-v2-hu-cleaned) | 21.82 | 5.51 | 18.39 | 6.15 | google/fleurs | test | 725.31 |
52
- AZ UTOLSÓ HÁROM SOR INT8 KVANTÁLT MODELL EREDMÉNYE.
53
-
54
- ## Quant loss examle
55
- | Model | WER | CER | Normalized_WER | Normalized_CER | Database | Split | Runtime |
56
- |:----------------------------------------------------------|:------|:------|:-----------------|:-----------------|:--------------|:--------|:----------|
57
- | Hungarians/whisper-base-cv17-hu | 45.61 | 14.95 | 40.79 | 14.94 | google/fleurs | test | 243.97 |
58
- | float16 | 50.55 | 21.01 | 46.81 | 20.99 | google/fleurs | test | 301.41 |
59
- | float32 | 49.69 | 20.77 | 47.38 | 20.74 | google/fleurs | test | 339.15 |
60
- | int8_float32 | 46.71 | 16.67 | 42.51 | 16.51 | google/fleurs | test | 246.06 |
61
- | int8_float16 | 46.5 | 17.13 | 42.23 | 16.92 | google/fleurs | test | 242.12 |
62
- | int8_bfloat16 | 45.7 | 15.06 | 41.03 | 15.04 | google/fleurs | test | 148.05 |
63
- | bfloat16 | 45.6 | 15 | 40.88 | 14.97 | google/fleurs | test | 144.87 |
64
- | int8 | 45.54 | 16.55 | 42.4 | 16.44 | google/fleurs | test | 236.97 |
65
-
66
- As you can see the INT8 quant have better points form original modell.
67
-
68
-
69
- Lower value is better!
70
-
71
- For Homeassistant faster-whisper need to use, the int8, fp16, fp32 modells, from subfolders.
72
-
73
- # Some Hungarian info bellow:
74
-
75
- A kész nodellek mindíg itt vannak, az én (sarpba) repómban a félkész, vagy kisérleti stádiumu cuccok vannak.
76
-
77
- Hosassistant faster-whisperhez az almappákban lévő int8, fp16, fp32 ct2 quantised (ezt nem tom hogy kéne magyarul írni :)) modelleket tudjátok használni a legegyszerűbben cociweb [custom_whisper](https://github.com/cociweb/custom_whisper.git) addonjával.
78
-
79
  ## Közösség
80
 
81
  Ha szeretnél csatlakozni a magyar nyelvű társalkodó csoportunkhoz ahol kérdezhetsz, megoszthatod a tapasztalataidat, vagy egy, a magyar LLM szakértőiből álló csoport tagja szeretnél lenni, csatlakozz FB csoportunkhoz: [Hungarian-LLM](https://www.facebook.com/groups/hungarian.llm).
 
10
 
11
  We decided to create an organization to collect the latest (and useable) models for the Hungarian specific finetuned LLMs (Whisper, Bart, LLama, etc). Feel free to join our organization and push your models.
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ## Közösség
14
 
15
  Ha szeretnél csatlakozni a magyar nyelvű társalkodó csoportunkhoz ahol kérdezhetsz, megoszthatod a tapasztalataidat, vagy egy, a magyar LLM szakértőiből álló csoport tagja szeretnél lenni, csatlakozz FB csoportunkhoz: [Hungarian-LLM](https://www.facebook.com/groups/hungarian.llm).