InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Paper
β’
2412.09596
β’
Published
β’
87
Text Behind Image using birefnet-lite for background removal
Kolors Character to keep character developed with Flux
Expressive Portrait Animation w/ Hierarchical Motion AttentΒ°
Add vectors to Hub datasets and do in memory vector search.