Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it!
Byung-Kwan Lee
BK-Lee
AI & ML interests
Vision Language Models
Recent Activity
upvoted
a
paper
about 12 hours ago
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
upvoted
a
paper
about 13 hours ago
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning