Anikait Singh's picture

4 8 5

Anikait Singh

Asap7772

·

https://asap7772.github.io

AI & ML interests

Deep Learning, Reinforcement Learning, Robotics

Recent Activity

updated a dataset about 8 hours ago

Asap7772/arc-agi-mixed-max4096-newqwen-sft1e-5-test-abs-impabswithold-abs-3of8

updated a dataset about 8 hours ago

Asap7772/arc-agi-mixed-max4096-newqwen-sft1e-5-test-abs-impabswithold-abs-4of8

updated a dataset about 9 hours ago

Asap7772/arc-agi-mixed-max4096-newqwen-sft1e-5-test-abs-impabswithold-abs-7of8

View all activity

Organizations

upvoted a paper about 1 month ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 112

upvoted a paper about 2 months ago

LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing

Paper • 2507.00769 • Published Jul 1 • 4

upvoted a paper 3 months ago

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Paper • 2506.07976 • Published Jun 9 • 6

upvoted 2 papers 6 months ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published Mar 3 • 39

FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users

Paper • 2502.19312 • Published Feb 26 • 7

upvoted a paper 8 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 99

upvoted a paper over 1 year ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 51

upvoted a paper almost 2 years ago

Robotic Offline RL from Internet Videos via Value-Function Pre-Training

Paper • 2309.13041 • Published Sep 22, 2023 • 8