Haitao Mi's picture

1 3

Haitao Mi

haitaominlp

·

https://scholar.google.com.sg/citations?user=G3OMbFSm858C&hl=en

AI & ML interests

Large Language Models

Recent Activity

authored a paper 9 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

authored a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

upvoted a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

View all activity

Organizations

haitaominlp's activity

authored a paper 9 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 9 days ago • 51

authored a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

upvoted 2 papers 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

Paper • 2410.03864 • Published Oct 4, 2024 • 11

authored a paper 4 months ago

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Paper • 2409.17433 • Published Sep 25, 2024 • 9

upvoted a paper 4 months ago

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Paper • 2409.17433 • Published Sep 25, 2024 • 9

authored 2 papers 7 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 38

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 97

authored 3 papers 10 months ago

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Paper • 2309.10202 • Published Sep 18, 2023 • 10

The Trickle-down Impact of Reward (In-)consistency on RLHF

Paper • 2309.16155 • Published Sep 28, 2023

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55