gpt-omni/mini-omni2
Any-to-Any
β’
Updated
β’
127
β’
279
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
a tiny vision language model
A unified multimodal understanding and generation model.
Interact with an AI by sending text, images, or audio
cosmos reason1 / docscopeocr / visionocr / captioner relaxed