2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 8 days ago • 89
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 13 days ago • 75
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published 10 days ago • 15
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 10 days ago • 19
Can Large Language Models Help Developers with Robotic Finite State Machine Modification? Paper • 2412.05625 • Published Dec 7, 2024
MentalLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models Paper • 2309.13567 • Published Sep 24, 2023 • 3