Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
6
Sahand Rezaei-Shoshtari
sahandrez
Follow
https://sahandrez.github.io/
sahandrez
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
about 2 hours ago
sahandrez/rloo-paired-Qwen2.5-1.5B-ultrafeedback-binarized-20241125-125438
updated
a model
4 days ago
sahandrez/sft-Qwen2.5-1.5B-ultrafeedback
updated
a model
4 days ago
sahandrez/sft-Qwen2.5-1.5B-ultrafeedback
View all activity
Organizations
None yet
sahandrez
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
2 months ago
google/gemma-2-2b-it
Text Generation
•
Updated
Aug 27
•
881k
•
•
693
google/gemma-2-2b
Text Generation
•
Updated
Aug 7
•
5.12M
•
437
liked
2 datasets
3 months ago
allenai/tulu-2.5-preference-data
Viewer
•
Updated
Jul 22
•
2.12M
•
1.32k
•
17
allenai/preference-test-sets
Viewer
•
Updated
Mar 14
•
43.2k
•
451
•
20
liked
2 datasets
about 1 year ago
allenai/real-toxicity-prompts
Viewer
•
Updated
Sep 30, 2022
•
99.4k
•
1.15k
•
59
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
9.66k
•
1.21k