CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. โข 22 items โข Updated โข 3
CoreML INT8 conversion of NVIDIA Parakeet-TDT 0.6B v2 for iOS, with encoder shape optimized for audio segments up to 5 seconds. Smaller and faster than the variable-length variant for short-form audio.
| Model | Description | Compute | Quantization |
|---|---|---|---|
encoder.mlmodelc |
FastConformer encoder (5s max) | Neural Engine | INT8 palettized |
decoder.mlmodelc |
LSTM prediction network | Neural Engine | FP16 |
Used by speech-swift ParakeetASR module:
let model = try await ParakeetASRModel.fromPretrained()
let text = try model.transcribeAudio(samples, sampleRate: 16000)
Base model
nvidia/parakeet-tdt-0.6b-v2