Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
25
29
ZHANG HAO
26hzhang
Follow
btjhjeon's profile picture
RavRana's profile picture
21world's profile picture
12 followers
·
25 following
https://26hzhang.github.io/
hzhang26
26hzhang
hzhang26
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
upvoted
a
paper
8 days ago
Improving Data and Reward Design for Scientific Reasoning in Large Language Models
upvoted
a
paper
10 days ago
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
View all activity
Organizations
26hzhang
's datasets
6
Sort: Recently updated
26hzhang/math_dapo_qwen2.5-math-7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
17.4k
•
5
26hzhang/math_7.5k_qwen2.5-math-7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
7.5k
•
6
26hzhang/math_dapo_qwen3-1.7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
17.4k
•
15
26hzhang/math_7.5k_qwen3-1.7b_rollout_n_10
Viewer
•
Updated
Nov 14, 2025
•
7.5k
•
7
26hzhang/math_dapo_qwen3-4b_rollout_n_10
Viewer
•
Updated
Nov 11, 2025
•
17.4k
•
22
26hzhang/math_7.5k_qwen3-4b_rollout_n_10
Viewer
•
Updated
Nov 7, 2025
•
7.5k
•
10