Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
1
rokosbasilisk
rb
Follow
gsvc's profile picture
1 follower
·
3 following
http://fullwrong.com
arambharadwaj
AI & ML interests
Large multi-modal world models
Recent Activity
updated
a dataset
9 days ago
antieval/plots_replication
published
a dataset
9 days ago
antieval/plots_replication
updated
a dataset
16 days ago
antieval/unused_repro_deploy_data
View all activity
Organizations
rb
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
9 days ago
antieval/plots_replication
Viewer
•
Updated
9 days ago
•
11
•
140
published
a dataset
9 days ago
antieval/plots_replication
Viewer
•
Updated
9 days ago
•
11
•
140
updated
a dataset
16 days ago
antieval/unused_repro_deploy_data
Updated
16 days ago
•
344
updated
a dataset
23 days ago
antieval/generator_confound_results
Updated
30 days ago
•
459
New activity in
antieval/repro
26 days ago
Add dataclaw deployment dataset (diverse 40 per model with tools)
1
#1 opened 26 days ago by
rb
published
a dataset
26 days ago
antieval/unused_repro_deploy_data
Updated
16 days ago
•
344
published
a dataset
30 days ago
antieval/generator_confound_results
Updated
30 days ago
•
459
updated
a dataset
about 1 month ago
antieval/unused_generator_confound_data
Updated
Mar 30
•
31
published
a dataset
about 1 month ago
antieval/unused_generator_confound_data
Updated
Mar 30
•
31
updated
a dataset
about 1 month ago
antieval/frontier_sweep_evals
Updated
Mar 19
•
90
published
a dataset
about 1 month ago
antieval/frontier_sweep_evals
Updated
Mar 19
•
90
updated
a dataset
about 2 months ago
antieval/swebench-trajectories
Viewer
•
Updated
Mar 16
•
200
•
29
published
a dataset
about 2 months ago
antieval/swebench-trajectories
Viewer
•
Updated
Mar 16
•
200
•
29
updated
a dataset
about 2 months ago
antieval/cybench-trajectories
Viewer
•
Updated
Mar 16
•
190
•
16
published
a dataset
about 2 months ago
antieval/cybench-trajectories
Viewer
•
Updated
Mar 16
•
190
•
16
updated
a dataset
about 2 months ago
antieval/agentharm-trajectories
Viewer
•
Updated
Mar 16
•
160
•
13
published
a dataset
about 2 months ago
antieval/agentharm-trajectories
Viewer
•
Updated
Mar 16
•
160
•
13
liked
a Space
11 months ago
Running
Agents
105
puzzle
👁
105
Get a numeric score for any text input
updated
a dataset
about 1 year ago
rb/aime_reasoning
Viewer
•
Updated
Feb 28, 2025
•
1.01k
•
12
published
a dataset
about 1 year ago
rb/aime_reasoning
Viewer
•
Updated
Feb 28, 2025
•
1.01k
•
12
Load more