Text-to-Speech
Safetensors
Chatterbox
MLX
mlx-audio
speech
speech generation
voice cloning
tts
5-bit
Instructions to use mlx-community/chatterbox-5bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use mlx-community/chatterbox-5bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir chatterbox-5bit mlx-community/chatterbox-5bit
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
File size: 2,486 Bytes
f27fa21 447f7a7 b125b98 447f7a7 f27fa21 447f7a7 f27fa21 447f7a7 f27fa21 447f7a7 b125b98 447f7a7 b125b98 447f7a7 f27fa21 447f7a7 f27fa21 60ce74a 447f7a7 f27fa21 447f7a7 b125b98 447f7a7 b125b98 447f7a7 b125b98 447f7a7 b125b98 f27fa21 447f7a7 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 | ---
license: apache-2.0
language:
- ar
- da
- de
- el
- en
- es
- fi
- fr
- he
- hi
- it
- ja
- ko
- ms
- nl
- no
- pl
- pt
- ru
- sv
- sw
- tr
- zh
base_model:
- ResembleAI/chatterbox
pipeline_tag: text-to-speech
library_name: mlx-audio
tags:
- text-to-speech
- speech
- speech generation
- voice cloning
- mlx
- tts
---
# mlx-community/chatterbox-5bit
This model was converted to MLX format from [`ResembleAI/chatterbox`](https://huggingface.co/ResembleAI/chatterbox) using mlx-audio version **0.2.7**.
Refer to the [original model card](https://huggingface.co/ResembleAI/chatterbox) for more details on the model.
## Use with mlx
```bash
pip install -U mlx-audio
```
### CLI Example:
```bash
python -m mlx_audio.tts.generate --model mlx-community/chatterbox-5bit --text "Hello, this is a test."
```
### Python Example:
```python
from mlx_audio.tts.utils import load_model
from mlx_audio.tts.generate import generate_audio
model = load_model("mlx-community/chatterbox-5bit")
generate_audio(
model=model, text="Hello, this is a test.",
file_prefix="test_audio",
)
```
### Voice Cloning:
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="Hello, this is a test of Chatterbox TTS.",
model="mlx-community/chatterbox-5bit",
ref_audio="path_to_file.wav",
ref_text="Text transcript", # Optional
file_prefix="vc_output",
)
```
## Multilingual Quickstart
Generate speech in 23 languages by specifying the `lang_code` parameter. The model automatically handles pronunciation, intonation, and language-specific phonemes.
```python
from mlx_audio.tts.generate import generate_audio
generate_audio(
text="Olá, tudo bem?",
model="mlx-community/chatterbox-5bit",
lang_code="pt",
file_prefix="multilingual_output",
)
```
### Supported Languages
| Code | Language | Code | Language | Code | Language |
|------|------------|------|------------|------|------------|
| `ar` | Arabic | `he` | Hebrew | `no` | Norwegian |
| `da` | Danish | `hi` | Hindi | `pl` | Polish |
| `de` | German | `it` | Italian | `pt` | Portuguese |
| `el` | Greek | `ja` | Japanese | `ru` | Russian |
| `en` | English | `ko` | Korean | `sv` | Swedish |
| `es` | Spanish | `ms` | Malay | `sw` | Swahili |
| `fi` | Finnish | `nl` | Dutch | `tr` | Turkish |
| `fr` | French | `zh` | Chinese | | |
|