GUI-World
Collection
Models and datasets from paper GUI-World. • 3 items • Updated
• 1
This is the first VideoLLM with powerful GUI-oriented capabilities, retrained on GUI-World.
It was presented in GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents.
See Github for how to use GUI-Vid for GUI understanding tasks.