Konstantin Grotov's picture

3 2

Konstantin Grotov

konstantgr

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

authored a paper about 1 month ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

new activity about 1 month ago

JetBrains-Research/PIPer-8B-RL-only:Improve model card: Add paper and code badges, update datasets metadata

View all activity

Organizations

upvoted a paper 11 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published 12 days ago • 20

authored a paper about 1 month ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 35

New activity in JetBrains-Research/PIPer-8B-RL-only about 1 month ago

Improve model card: Add paper and code badges, update datasets metadata

#1 opened about 1 month ago by

New activity in JetBrains-Research/PIPer-8B about 1 month ago

Improve model card: Add paper and code links

#1 opened about 1 month ago by

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

upvoted a paper about 1 month ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 35

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

updated a dataset about 1 month ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2 • 742 • 64 • 1

published a dataset about 1 month ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2 • 742 • 64 • 1

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

updated a dataset about 1 month ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2 • 2.5k • 34 • 1

published a dataset about 1 month ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2 • 2.5k • 34 • 1

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

updated a dataset about 1 month ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 38

published a dataset about 1 month ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 38

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

updated a model about 1 month ago

JetBrains-Research/Qwen3-8B-am

Text Generation • 8B • Updated Sep 30 • 59

updated a collection about 1 month ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

updated a model about 1 month ago

JetBrains-Research/PIPer-8B-SFT-only

Text Generation • 8B • Updated Sep 30 • 3