The supervised finetuned model for GSM8k in Alphazero-like tree-search can guide large language model decoding and training, ICML 2024
@article{feng2023alphazero,
title={Alphazero-like tree-search can guide large language model decoding and training},
author={Feng, Xidong and Wan, Ziyu and Wen, Muning and Wen, Ying and Zhang, Weinan and Wang, Jun},
journal={arXiv preprint arXiv:2309.17179},
year={2023}
}
- Downloads last month
- 675
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.