299
ThinkSound
π
Generate audio for a video using captions and descriptions
Deeply interrogate audio file content
Zero-Shot Material Transfer from a Single Image
Multi-AI Expert Consensus Platform
AI generates PPT with diagrams and images from given topics
The agent using over 9000 vision models from the HF Hub.
Demo for Nanonets-OCR
EfficientVLM
camel doc ocr / core ocr / docscope ocr / monkey ocr
Dolphin Demo