Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Fan Zhou
koalazf99
AI & ML interests
Deep Learning; Natural Language Processing; Foundation Models
Recent Activity
upvoted
a
paper
11 days ago
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
upvoted
a
paper
about 2 months ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents