
vidore/colqwen-omni-v0.1
Visual Document Retrieval
•
Updated
•
3.38k
•
90
Generate speech from text using a reference audio
An interactive view of human heart
Real-time video captioning powered by FastVLM