22 85 34

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

liked a dataset 15 days ago

risashinoda/AgroBench

upvoted a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

upvoted a paper about 1 month ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

View all activity

Organizations

Posts 2

Post

1626

OPEN VLM LEADERBOARD JUST RELEASED the FULL EVALUATION RESULTS of GPT-4o

[TL;DR]
GPT-4o shows steady progress compared to GPT-4v (0419), with a 3% improvement on the average score (68.7% -> 72.1%). GPT-4o displays stronger perception and less hallucination.

opencompass/open_vlm_leaderboard