Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8 • 80
Whisper Collection Whisper models for automatic speech recognition (ASR) and speech translation, quantized for faster inference speeds. • 18 items • Updated Aug 23 • 2