Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Zihan Gao
Papercold
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
View all activity
Organizations
None yet
models
0
None public yet
datasets
0
None public yet