arxiv:2410.06508
Linfeng Song
freesunshine0316
AI & ML interests
Researcher @Tencent AI Lab working on reasoning and RLAIF with LLM, especially search + RL. Working on NLP since 2010.
Recent Activity
authored
a paper
about 2 months ago
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
with Curriculum Preference Learning
upvoted
a
paper
about 2 months ago
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
with Curriculum Preference Learning
commented
a paper
about 2 months ago
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
with Curriculum Preference Learning
Organizations
models
None public yet
datasets
None public yet