One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
authored
a paper
1 day ago
VLog: Video-Language Models by Generative Retrieval of Narration
Vocabulary
liked
a dataset
1 day ago
lmms-lab/AISG_Challenge
commented on
a paper
1 day ago
VLog: Video-Language Models by Generative Retrieval of Narration
Vocabulary