File size: 775 Bytes
9736ec1
a369de3
9736ec1
 
 
 
 
 
 
 
 
a369de3
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
---
title: Language Identification
emoji: 🔥
colorFrom: green
colorTo: indigo
sdk: gradio
sdk_version: 3.44.4
app_file: app.py
pinned: false
---

This repository contains the code for audio transcription and language identification. Both tasks are connected in one pipeline with two models stacked on top of another: 
* Roberta (https://huggingface.co/dominguesm/xlm-roberta-base-lora-language-detection)  — Language Detection
* Whisper (https://huggingface.co/openai/whisper-large) — Transcription

Common-Language dataset (https://huggingface.co/datasets/common_language) was used for both tasks.

References to the specific code are included in the main app.py file.



Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference