Parakeet TDT v3 โ€” CoreML INT8 (iOS, 5s)

CoreML INT8 conversion of NVIDIA Parakeet-TDT 0.6B v2 for iOS, with encoder shape optimized for audio segments up to 5 seconds. Smaller and faster than the variable-length variant for short-form audio.

Models

Model Description Compute Quantization
encoder.mlmodelc FastConformer encoder (5s max) Neural Engine INT8 palettized
decoder.mlmodelc LSTM prediction network Neural Engine FP16

Usage

Used by speech-swift ParakeetASR module:

let model = try await ParakeetASRModel.fromPretrained()
let text = try model.transcribeAudio(samples, sampleRate: 16000)

Downloads last month
707
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for aufklarer/Parakeet-TDT-v3-CoreML-INT8-iOS-5s

Finetuned
(30)
this model

Collection including aufklarer/Parakeet-TDT-v3-CoreML-INT8-iOS-5s