Qwen
/

Qwen2.5-Math-PRM-72B

Text Classification

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Zhenru commited on 3 days ago

Commit

0062179

·

verified ·

1 Parent(s): 9851cb5

Update README.md

Files changed (1) hide show

README.md +11 -5

README.md CHANGED Viewed

@@ -24,6 +24,10 @@ In addition to the mathematical Outcome Reward Model (ORM) Qwen2.5-Math-RM-72B,
 ![](http://qianwen-res.oss-accelerate-overseas.aliyuncs.com/Qwen2.5/Qwen2.5-Math-PRM/Qwen2.5-Math-PRM.png)
 ## Requirements
 * `transformers>=4.40.0` for Qwen2.5-Math models. The latest version is recommended.
@@ -122,10 +126,12 @@ print(step_reward)  # [[0.9921875, 0.0047607421875, 0.32421875, 0.8203125]]
 If you find our work helpful, feel free to give us a citation.
 ```
-@article{yang2024qwen25mathtechnicalreportmathematical,
-  title={Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement},
-  author={An Yang and Beichen Zhang and Binyuan Hui and Bofei Gao and Bowen Yu and Chengpeng Li and Dayiheng Liu and Jianhong Tu and Jingren Zhou and Junyang Lin and Keming Lu and Mingfeng Xue and Runji Lin and Tianyu Liu and Xingzhang Ren and Zhenru Zhang},
-  journal={arXiv preprint arXiv:2409.12122},
-  year={2024}
 }
 ```

 ![](http://qianwen-res.oss-accelerate-overseas.aliyuncs.com/Qwen2.5/Qwen2.5-Math-PRM/Qwen2.5-Math-PRM.png)
+## Model Details
+For more details, please refer to our [paper](https://arxiv.org/pdf/2501.07301).
 ## Requirements
 * `transformers>=4.40.0` for Qwen2.5-Math models. The latest version is recommended.
 If you find our work helpful, feel free to give us a citation.
 ```
+@article{prmlessons,
+  title={The Lessons of Developing Process Reward Models in Mathematical Reasoning},
+  author={
+    Zhenru Zhang and Chujie Zheng and Yangzhen Wu and Beichen Zhang and Runji Lin and Bowen Yu and Dayiheng Liu and Jingren Zhou and Junyang Lin
+  },
+  journal={arXiv preprint arXiv:2501.07301},
+  year={2025}
 }
 ```