Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sumuk Shashidhar's picture
10 9 17

Sumuk Shashidhar PRO

sumuks
Tonic's profile picture Sophia911's profile picture patrickfleith's profile picture
·
https://sumuk.org
  • sumukx
  • sumukshashidhar
  • sumuks

AI & ML interests

Evaluations, Reasoning, Long Term Planning

Recent Activity

updated a dataset about 5 hours ago
sumuks/openai-coval-dpo
published a dataset about 5 hours ago
sumuks/openai-coval-dpo
updated a dataset 17 days ago
sumuks/preference-atlas-rewards
View all activity

Organizations

Blog-explorers's profile picture Verifiers For Code's profile picture Preference Agents's profile picture Sumuk's Archived Content's profile picture UIUC Conversational AI Lab's profile picture self-planner's profile picture Nerdy Face's profile picture Sumuk's Testing Grounds!'s profile picture Spiral Works's profile picture Your Bench's profile picture Sumuk's Second Set of Archived Content's profile picture InfoHunt's profile picture TextCleanLM's profile picture Sumuk's First Archival Storage Volume's profile picture popper's profile picture Sumuk's Archival Storage 2's profile picture Sumuk's Archival Storage 3's profile picture

Articles 1

Article
4

Getting Started with YourBench

Papers 5

arxiv:2505.01592
arxiv:2504.20090
arxiv:2504.01833
arxiv:2410.03731

models 0

None public yet

datasets 29

sumuks/openai-coval-dpo

Viewer • Updated about 4 hours ago • 5.58k

sumuks/preference-atlas-rewards

Viewer • Updated 17 days ago • 5.03k • 32

sumuks/preference-atlas

Viewer • Updated 17 days ago • 329k • 102 • 1

sumuks/reward-bench-2

Viewer • Updated 17 days ago • 1.87k • 43

sumuks/helpsteer3

Viewer • Updated 18 days ago • 49.1k • 250

sumuks/helpsteer3-easy

Viewer • Updated 24 days ago • 7.93k • 44

sumuks/helpsteer-pairwise-grading

Viewer • Updated 29 days ago • 51.8k • 21

sumuks/rupo-eval-logs-helpsteer3-1

Viewer • Updated about 1 month ago • 1.43k • 28

sumuks/helpsteer3-rupo

Viewer • Updated about 1 month ago • 38.2k • 91

sumuks/persuasiveness_detection

Viewer • Updated Feb 6 • 3.94k • 10
View 29 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs