Tiehua Mei's picture

Tiehua Mei

Mithas-01

https://github.com/Mithas-114

Mithas-114

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

authored a paper 3 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

authored a paper 3 days ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

View all activity

Organizations

None yet

authored 3 papers 3 days ago

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

Paper • 2512.05591 • Published Dec 5, 2025 • 17

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published 12 days ago • 58

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 4 days ago • 80

upvoted a paper 3 days ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 4 days ago • 80

submitted a paper to Daily Papers 3 days ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 4 days ago • 80

upvoted a paper 11 days ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published 12 days ago • 58

upvoted a paper 7 months ago

Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26, 2025 • 31

authored a paper 12 months ago

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems

Paper • 2506.04015 • Published Jun 4, 2025 • 1

upvoted a paper 12 months ago

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems

Paper • 2506.04015 • Published Jun 4, 2025 • 1

liked a Space about 1 year ago

BookWorld

A demo for BOOKWORLD