36 224 12

Byung-Kwan Lee

BK-Lee

https://sites.google.com/view/byungkwanlee

AI & ML interests

Vision Language Models

Recent Activity

upvoted a paper 15 days ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

upvoted a paper 15 days ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

upvoted a paper 21 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

commented 3 papers 3 months ago

commented a paper 8 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18, 2025 • 41 •

commented 3 papers 9 months ago

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28, 2025 • 37 •

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28, 2025 • 37 •

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28, 2025 • 37 •

commented a paper about 1 year ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15 •

New activity in BK-Lee/Phantom-7B about 1 year ago

Open-sourcing fine-tuning code

#1 opened about 1 year ago by

aayush14

New activity in BK-Lee/Meteor-MLM about 1 year ago

Add pipeline tag

#2 opened about 1 year ago by

nielsr

commented a paper over 1 year ago

Intriguing Properties of Large Language and Vision Models

Paper • 2410.04751 • Published Oct 7, 2024 • 16 •

New activity in BK-Lee/CoLLaVO-7B over 1 year ago

Link model to paper, add model card

#1 opened over 1 year ago by

nielsr

New activity in BK-Lee/MoAI-7B over 1 year ago

Add link to paper, pipeline tag

#7 opened over 1 year ago by

nielsr

commented 4 papers over 1 year ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121 •

Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

Paper • 2407.03958 • Published Jul 4, 2024 • 21 •

TokenPacker: Efficient Visual Projector for Multimodal LLM

Paper • 2407.02392 • Published Jul 2, 2024 • 23 •

TroL: Traversal of Layers for Large Language and Vision Models

Paper • 2406.12246 • Published Jun 18, 2024 • 36 •

New activity in BK-Lee/Meteor over 1 year ago

Add example input

#1 opened over 1 year ago by

merve

New activity in zero-gpu-explorers/README over 1 year ago

Error!

#63 opened over 1 year ago by

BK-Lee

New activity in BK-Lee/Meteor-Mamba over 1 year ago

question on computation cost

#3 opened over 1 year ago by

MicFizzy

Byung-Kwan Lee

AI & ML interests

Recent Activity

Organizations

BK-Lee's activity

Open-sourcing fine-tuning code

Add pipeline tag

Link model to paper, add model card

Add link to paper, pipeline tag

Add example input

Error!

question on computation cost