view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 3 days ago β’ 67
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published Aug 22, 2024 β’ 125
view article Article LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning? Jul 25, 2024 β’ 18
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model May 14, 2024 β’ 238