Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
6
Jiang
Dongwei
Follow
AZH04's profile picture
SikaStar's profile picture
21world's profile picture
5 followers
·
1 following
Some-random
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
published
a model
5 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
new
activity
5 days ago
Dongwei/Qwen-2.5-7B_Base_Math_smalllr:
Wandb log not found
View all activity
Organizations
Papers
3
arxiv:
2410.01044
arxiv:
2409.12183
arxiv:
2407.09007
models
17
Sort: Recently updated
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
Text Generation
•
Updated
4 days ago
•
11
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer
Text Generation
•
Updated
5 days ago
•
17
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
Updated
5 days ago
•
65
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
Updated
11 days ago
•
24
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
•
Updated
12 days ago
•
133
•
2
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
Updated
12 days ago
•
13
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
Updated
12 days ago
•
7
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
Updated
12 days ago
•
31
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
Updated
12 days ago
•
35
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
Updated
13 days ago
•
72
Expand 17 models
datasets
2
Sort: Recently updated
Dongwei/Math_8K_for_GRPO
Viewer
•
Updated
11 days ago
•
8.89k
•
41
•
1
Dongwei/reasoning_world_model
Viewer
•
Updated
Apr 22, 2024
•
15.2k
•
14
•
5