view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 68
Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models Paper • 2312.06109 • Published Dec 11, 2023 • 20