barto17 commited on
Commit
a369de3
1 Parent(s): 01368ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Lang Id
3
  emoji: 🔥
4
  colorFrom: green
5
  colorTo: indigo
@@ -9,4 +9,14 @@ app_file: app.py
9
  pinned: false
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Language Identification
3
  emoji: 🔥
4
  colorFrom: green
5
  colorTo: indigo
 
9
  pinned: false
10
  ---
11
 
12
+ This repository contains the code for audio transcription and language identification. Both tasks are connected in one pipeline with two models stacked on top of another:
13
+ * Roberta (https://huggingface.co/dominguesm/xlm-roberta-base-lora-language-detection) — Language Detection
14
+ * Whisper (https://huggingface.co/openai/whisper-large) — Transcription
15
+
16
+ Common-Language dataset (https://huggingface.co/datasets/common_language) was used for both tasks.
17
+
18
+ References to the specific code are included in the main app.py file.
19
+
20
+
21
+
22
+ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference