SAMBIT CHAKRABORTY's picture

89 9

SAMBIT CHAKRABORTY

sambitchakhf03

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

upvoted a paper 1 day ago

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

upvoted a paper 1 day ago

Demystifying Reinforcement Learning in Agentic Reasoning

View all activity

Organizations

upvoted 3 papers 1 day ago

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

Paper • 2510.12586 • Published 3 days ago • 102

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published 3 days ago • 12

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 4 days ago • 27

upvoted a paper 2 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 4 days ago • 153

upvoted a paper 5 days ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published 11 days ago • 87

upvoted 2 papers 9 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 11 days ago • 398

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published 11 days ago • 15

upvoted a paper 16 days ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published 21 days ago • 29

upvoted 3 papers about 1 month ago

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Paper • 2509.06949 • Published Sep 8 • 56

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4 • 73

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Paper • 2509.00605 • Published Aug 30 • 42

upvoted 3 papers 2 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 59

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 94

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13 • 56

upvoted 3 papers 3 months ago

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8 • 27

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9 • 54

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 112

upvoted 3 papers 4 months ago

Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact

Paper • 2507.00951 • Published Jul 1 • 24

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Paper • 2506.20639 • Published Jun 25 • 31

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 42