4 4 174

Jian Hu

chuyi777

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

liked a model 15 days ago

CohereForAI/c4ai-command-r7b-12-2024

upvoted a paper 23 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

commented on a paper 23 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

chuyi777's activity

liked a model 15 days ago

CohereForAI/c4ai-command-r7b-12-2024

Text Generation • Updated 2 days ago • 11.3k • 351

upvoted a paper 23 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 27 days ago • 89

commented a paper 23 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 27 days ago • 89 •

liked a dataset 28 days ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 7.15k • 338

liked a dataset 29 days ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1, 2024 • 467k • 300 • 30

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

updated 3 models 2 months ago

liked a model 3 months ago

O1-OPEN/OpenO1-LLama-8B-v0.1

Updated Oct 8, 2024 • 463 • 15

updated a model 3 months ago

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

Updated Oct 30, 2024 • 4 • 1

New activity in OpenRLHF/Mistral-7b-PRM-Math-Shepherd 3 months ago

怎么下载模型呢？

#1 opened 3 months ago by

Yutong001

liked 2 models 3 months ago

AI-MO/NuminaMath-7B-TIR

Text Generation • Updated Aug 14, 2024 • 2.95k • 331

Nexusflow/Athene-70B

Text Generation • Updated Nov 15, 2024 • 7.03k • 194

liked a model 4 months ago

peiyi9979/mistral-7b-sft

Text Generation • Updated Jan 15, 2024 • 1.61k • 7

liked 2 datasets 4 months ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 16.7k • 399

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 364 • 132

liked 2 models 4 months ago

peiyi9979/math-shepherd-mistral-7b-rl

Text Generation • Updated Jan 15, 2024 • 193 • 5

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 6.68k • 43

liked a dataset 4 months ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 581 • 87