aufklarer's picture
Upload README.md with huggingface_hub
6c45cf2 verified
metadata
license: cc-by-4.0
language:
  - en
tags:
  - speech
  - asr
  - coreml
  - parakeet
  - transducer
  - ios
base_model: nvidia/parakeet-tdt-0.6b-v2

Parakeet TDT v3 — CoreML INT8 (iOS)

CoreML INT8 conversion of NVIDIA Parakeet-TDT 0.6B v2 optimized for iOS deployment on Neural Engine. Encoder uses EnumeratedShapes for variable-length audio input.

Models

Model Description Compute Quantization
encoder.mlmodelc FastConformer encoder (24L, 1024 hidden) Neural Engine INT8 palettized
decoder.mlmodelc LSTM prediction network (2L, 640 hidden) Neural Engine FP16

Usage

Used by speech-swift ParakeetASR module:

let model = try await ParakeetASRModel.fromPretrained()
let text = try model.transcribeAudio(samples, sampleRate: 16000)