| license: cc-by-4.0 | |
| language: | |
| - en | |
| tags: | |
| - speech | |
| - asr | |
| - coreml | |
| - parakeet | |
| - transducer | |
| - ios | |
| base_model: nvidia/parakeet-tdt-0.6b-v2 | |
| # Parakeet TDT v3 — CoreML INT8 (iOS) | |
| CoreML INT8 conversion of [NVIDIA Parakeet-TDT 0.6B v2](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2) optimized for iOS deployment on Neural Engine. Encoder uses `EnumeratedShapes` for variable-length audio input. | |
| ## Models | |
| | Model | Description | Compute | Quantization | | |
| |-------|-------------|---------|-------------| | |
| | `encoder.mlmodelc` | FastConformer encoder (24L, 1024 hidden) | Neural Engine | INT8 palettized | | |
| | `decoder.mlmodelc` | LSTM prediction network (2L, 640 hidden) | Neural Engine | FP16 | | |
| ## Usage | |
| Used by [speech-swift](https://github.com/soniqo/speech-swift) `ParakeetASR` module: | |
| ```swift | |
| let model = try await ParakeetASRModel.fromPretrained() | |
| let text = try model.transcribeAudio(samples, sampleRate: 16000) | |
| ``` | |
| --- | |
| - **Guide**: [soniqo.audio/guides/parakeet](https://soniqo.audio/guides/parakeet) | |
| - **Docs**: [soniqo.audio](https://soniqo.audio) | |
| - **GitHub**: [soniqo/speech-swift](https://github.com/soniqo/speech-swift) | |