Update README.md
Browse files
README.md
CHANGED
@@ -8,9 +8,10 @@ tags:
|
|
8 |
- audio
|
9 |
- GGUF
|
10 |
---
|
11 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/d7Rzpm0cgCToXjtE7_U2u.png" alt="Example" style="width:200px;"/>
|
12 |
-
|
13 |
# OmniAudio-2.6B
|
|
|
|
|
|
|
14 |
OmniAudio is the world's fastest and most efficient audio-language model for on-device deployment - a 2.6B-parameter multimodal model that processes both text and audio inputs. It integrates three components: Gemma-2-2b, Whisper turbo, and a custom projector module, enabling secure, responsive audio-text processing directly on edge devices.
|
15 |
|
16 |
Unlike traditional approaches that chain ASR and LLM models together, OmniAudio-2.6B unifies both capabilities in a single efficient architecture for minimal latency and resource overhead.
|
|
|
8 |
- audio
|
9 |
- GGUF
|
10 |
---
|
|
|
|
|
11 |
# OmniAudio-2.6B
|
12 |
+
|
13 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/d7Rzpm0cgCToXjtE7_U2u.png" alt="Example" style="width:100px;"/>
|
14 |
+
|
15 |
OmniAudio is the world's fastest and most efficient audio-language model for on-device deployment - a 2.6B-parameter multimodal model that processes both text and audio inputs. It integrates three components: Gemma-2-2b, Whisper turbo, and a custom projector module, enabling secure, responsive audio-text processing directly on edge devices.
|
16 |
|
17 |
Unlike traditional approaches that chain ASR and LLM models together, OmniAudio-2.6B unifies both capabilities in a single efficient architecture for minimal latency and resource overhead.
|