CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. โข 22 items โข Updated โข 3
CoreML INT8 conversion of NVIDIA Parakeet-TDT 0.6B v2 optimized for iOS deployment on Neural Engine. Encoder uses EnumeratedShapes for variable-length audio input.
| Model | Description | Compute | Quantization |
|---|---|---|---|
encoder.mlmodelc |
FastConformer encoder (24L, 1024 hidden) | Neural Engine | INT8 palettized |
decoder.mlmodelc |
LSTM prediction network (2L, 640 hidden) | Neural Engine | FP16 |
Used by speech-swift ParakeetASR module:
let model = try await ParakeetASRModel.fromPretrained()
let text = try model.transcribeAudio(samples, sampleRate: 16000)
Base model
nvidia/parakeet-tdt-0.6b-v2