Aditya Kothari
AdityaKothari
AI & ML interests
None yet
Recent Activity
liked
a model
18 days ago
m-a-p/YuE-s1-7B-anneal-en-cot
liked
a model
about 1 month ago
meta-llama/Llama-3.2-3B
reacted
to
di-zhang-fdu's
post
with 👍
3 months ago
LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/
What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓
Past related works:
https://huggingface.co/papers/2410.02884
https://huggingface.co/papers/2406.07394