Ryan Xu's picture

199 99

Ryan Xu

imryanxu

·

AI & ML interests

fishing in lab

Organizations

imryanxu's activity

upvoted a paper 12 days ago

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published 17 days ago • 48

upvoted a paper 21 days ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published 23 days ago • 86

upvoted 2 papers 23 days ago

HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Paper • 2410.12381 • Published 25 days ago • 41

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Paper • 2410.11623 • Published 25 days ago • 46

upvoted a paper 26 days ago

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published 30 days ago • 82

upvoted 6 papers about 1 month ago

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23 • 34

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 99

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21 • 27

Pixel-Space Post-Training of Latent Diffusion Models

Paper • 2409.17565 • Published Sep 26 • 19

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26 • 33

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

upvoted 8 papers about 2 months ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published Sep 20 • 67

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19 • 36

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19 • 47

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 61

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72

Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11 • 27

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11 • 50

upvoted a paper 3 months ago

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

Paper • 2407.13301 • Published Jul 18 • 54