Hyogun Lee's picture

4 14 5

Hyogun Lee

Haawron

·

AI & ML interests

Video understanding, multi-modal LLMs

Recent Activity

upvoted a paper about 3 hours ago

Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding

upvoted a paper about 9 hours ago

Byte Latent Transformer: Patches Scale Better Than Tokens

commented a paper about 10 hours ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

View all activity

Organizations

None yet

Haawron's activity

commented a paper about 10 hours ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 5 days ago • 119 •

New activity in lmms-lab/llava-onevision-qwen2-0.5b-si 5 days ago

Training time

#3 opened 5 days ago by

commented a paper 3 months ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48 •

commented 2 papers 7 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63 •

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19 • 53 •