10 16 24

Quanquan Gu

thughost

QuanquanGu

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Best-of-Majority: Minimax-Optimal Strategy for Pass@k Inference Scaling

upvoted a paper about 1 month ago

Causal Attention with Lookahead Keys

commented on a paper about 1 month ago

Causal Attention with Lookahead Keys

View all activity

Organizations

upvoted a paper 7 days ago

Best-of-Majority: Minimax-Optimal Strategy for Pass@k Inference Scaling

Paper • 2510.03199 • Published 10 days ago • 1

upvoted a paper about 1 month ago

Causal Attention with Lookahead Keys

Paper • 2509.07301 • Published Sep 9 • 21

upvoted a paper 5 months ago

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Paper • 2505.17508 • Published May 23 • 6

upvoted a paper 9 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 88

upvoted a paper 11 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

upvoted a paper about 1 year ago

Accelerated Preference Optimization for Large Language Model Alignment

Paper • 2410.06293 • Published Oct 8, 2024 • 5

upvoted a collection about 1 year ago

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6, 2024 • 10

upvoted 3 papers about 1 year ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Paper • 2409.06744 • Published Sep 10, 2024 • 8

upvoted a collection over 1 year ago

SPPO

Collection

Self-Play Preference Optimization • 10 items • Updated Jun 29, 2024 • 13

upvoted 4 papers over 1 year ago

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 27

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Paper • 2310.00927 • Published Oct 2, 2023 • 1

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Paper • 2308.05374 • Published Aug 10, 2023 • 28

upvoted a paper almost 2 years ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 68

Quanquan Gu

AI & ML interests

Recent Activity

Organizations

thughost's activity