Jiaxin Huang's picture

1 3

Jiaxin Huang

teapot123

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Efficient Test-Time Scaling via Self-Calibration

upvoted a paper about 2 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

upvoted a paper 5 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

View all activity

Organizations

teapot123's activity

upvoted a paper 8 days ago

Efficient Test-Time Scaling via Self-Calibration

Paper • 2503.00031 • Published 16 days ago • 13

upvoted a paper about 2 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted a paper 5 months ago

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Paper • 2410.09724 • Published Oct 13, 2024 • 2