1 4

Lawrence Jang

ljang0

ljang0

AI & ML interests

None yet

Recent Activity

authored a paper about 24 hours ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

authored a paper about 24 hours ago

The BrowserGym Ecosystem for Web Agent Research

authored a paper 1 day ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

View all activity

Organizations

None yet

ljang0's activity

authored 2 papers about 24 hours ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24 • 6

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published 14 days ago • 18

authored a paper 1 day ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 3 days ago • 40

upvoted a paper 2 days ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 3 days ago • 40

upvoted a paper 4 days ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published 14 days ago • 18

upvoted a paper about 2 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24 • 6

commented a paper about 2 months ago

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Paper • 2410.19100 • Published Oct 24 • 6 •

authored 3 papers 3 months ago

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

Paper • 2401.13649 • Published Jan 24 • 1

ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights

Paper • 2406.14596 • Published Jun 20 • 5

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

upvoted a paper 3 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

updated a dataset 3 months ago

ljang0/code_ppo_java

Viewer • Updated Sep 7 • 966 • 40

updated a dataset 4 months ago

ljang0/code_ppo

Viewer • Updated Sep 7 • 974 • 36