Lewei Lu's picture

Lewei Lu

luotto

·

ottolu

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

liked a dataset 2 days ago

nvidia/OpenScienceReasoning-2

upvoted a paper 11 days ago

Intern-S1: A Scientific Multimodal Foundation Model

View all activity

Organizations

upvoted a paper 1 day ago

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Paper • 2508.21496 • Published 6 days ago • 52

upvoted a paper 11 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 14 days ago • 242

upvoted a paper 16 days ago

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published 17 days ago • 31

upvoted a paper about 1 month ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 97

upvoted 4 papers about 2 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 90

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 113

PyVision: Agentic Vision with Dynamic Tooling

Paper • 2507.07998 • Published Jul 10 • 31

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 86

upvoted 4 papers 2 months ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6 • 9

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 63

upvoted 5 papers 3 months ago

Language-Image Alignment with Fixed Text Encoders

Paper • 2506.04209 • Published Jun 4 • 11

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30 • 34

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 46

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

upvoted a collection 4 months ago

SigLIP2

36 items • Updated Jul 10 • 85

upvoted a paper 4 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 150

upvoted a collection 4 months ago

Qwen3

84 items • Updated 29 days ago • 1.19k