Pratyush Ranjan Tiwari PRO
pratyushrt
AI & ML interests
Reinforcements Learning, Privacy, Post-training LLMs, SLMs
Recent Activity
liked a Space 17 days ago
HuggingFaceTB/smol-training-playbook updated a Space 5 months ago
eternisai/README authored a paper 5 months ago
Hard Examples Are All You Need: Maximizing GRPO Post-Training Under
Annotation Budgets