RUC-AIBOX
/

STILL-3-1.5B-preview

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

EliverQ commited on 10 days ago

Commit

88d330b

·

verified ·

1 Parent(s): 28c0b7c

Update README.md

Files changed (1) hide show

README.md +32 -0

README.md CHANGED Viewed

@@ -51,3 +51,35 @@ responses = model.generate(input_prompts, sampling_params)
 print(responses[0].outputs[0].text)
 ```

 print(responses[0].outputs[0].text)
 ```
+# Reference
+Please kindly cite our reports if they are helpful for your research.
+```
+@article{Slow_Thinking_with_LLMs_3_Preview,
+  title={STILL-3-1.5B-preview: Enhancing Slow Thinking Abilities of Small Models through Reinforcement Learning
+},
+  author={RUCAIBox STILL Team},
+  url={https://github.com/RUCAIBox/Slow_Thinking_with_LLMs},
+  year={2025}
+}
+```
+```
+@article{Slow_Thinking_with_LLMs_1,
+  title={Enhancing LLM Reasoning with Reward-guided Tree Search},
+  author={Jiang, Jinhao and Chen, Zhipeng and Min, Yingqian and Chen, Jie and Cheng, Xiaoxue and Wang, Jiapeng and Tang, Yiru and Sun, Haoxiang and Deng, Jia and Zhao, Wayne Xin and Liu, Zheng and Yan, Dong and Xie, Jian and Wang, Zhongyuan and Wen, Ji-Rong},
+  journal={arXiv preprint arXiv:2411.11694},
+  year={2024}
+}
+```
+```
+@article{Slow_Thinking_with_LLMs_2,
+  title={Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems},
+  author={Min, Yingqian and Chen, Zhipeng and Jiang, Jinhao and Chen, Jie and Deng, Jia and Hu, Yiwen and Tang, Yiru and Wang, Jiapeng and Cheng, Xiaoxue and Song, Huatong and Zhao, Wayne Xin and Liu, Zheng and Wang, Zhongyuan and Wen, Ji-Rong},
+  journal={arXiv preprint arXiv:2412.09413},
+  year={2024}
+}
+```