aufklarer
/

Parakeet-TDT-v3-CoreML-INT8-iOS

Model card Files Files and versions

Parakeet-TDT-v3-CoreML-INT8-iOS / README.md

aufklarer's picture

Upload README.md with huggingface_hub

6c45cf2 verified about 1 month ago

|

history blame contribute delete

1.15 kB

	---
	license: cc-by-4.0
	language:
	- en
	tags:
	- speech
	- asr
	- coreml
	- parakeet
	- transducer
	- ios
	base_model: nvidia/parakeet-tdt-0.6b-v2
	---

	# Parakeet TDT v3 — CoreML INT8 (iOS)

	CoreML INT8 conversion of [NVIDIA Parakeet-TDT 0.6B v2](https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2) optimized for iOS deployment on Neural Engine. Encoder uses `EnumeratedShapes` for variable-length audio input.

	## Models

	\| Model \| Description \| Compute \| Quantization \|
	\|-------\|-------------\|---------\|-------------\|
	\| `encoder.mlmodelc` \| FastConformer encoder (24L, 1024 hidden) \| Neural Engine \| INT8 palettized \|
	\| `decoder.mlmodelc` \| LSTM prediction network (2L, 640 hidden) \| Neural Engine \| FP16 \|

	## Usage

	Used by [speech-swift](https://github.com/soniqo/speech-swift) `ParakeetASR` module:

	```swift
	let model = try await ParakeetASRModel.fromPretrained()
	let text = try model.transcribeAudio(samples, sampleRate: 16000)
	```

	---

	- Guide: [soniqo.audio/guides/parakeet](https://soniqo.audio/guides/parakeet)
	- Docs: [soniqo.audio](https://soniqo.audio)
	- GitHub: [soniqo/speech-swift](https://github.com/soniqo/speech-swift)