Quanquan Gu's picture

Quanquan Gu

thughost

·

QuanquanGu

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Best-of-Majority: Minimax-Optimal Strategy for Pass@k Inference Scaling

upvoted a paper about 1 month ago

Causal Attention with Lookahead Keys

commented on a paper about 1 month ago

Causal Attention with Lookahead Keys

View all activity

Organizations

Posts 2

Post

746

We've open-sourced the code and models for Self-Play Preference Optimization (SPPO)! 🚀🚀🚀
🤗paper: Self-Play Preference Optimization for Language Model Alignment (2405.00675)
⭐ code: https://github.com/uclaml/SPPO
🤗models: UCLA-AGI/sppo-6635fdd844f2b2e4a94d0b9a

Post

Check out the demo of SPIN-Diffusion made by @angelahzyuan at: UCLA-AGI/SPIN-Diffusion-demo-v1

Papers 29

arxiv:2501.06425

arxiv:2411.10438

arxiv:2410.13782

arxiv:2410.02712

models 0

None public yet

datasets 0

None public yet