Sahand Rezaei-Shoshtari's picture

6

Sahand Rezaei-Shoshtari

sahandrez

https://sahandrez.github.io/

sahandrez

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model about 2 hours ago

sahandrez/rloo-paired-Qwen2.5-1.5B-ultrafeedback-binarized-20241125-125438

updated a model 4 days ago

sahandrez/sft-Qwen2.5-1.5B-ultrafeedback

updated a model 4 days ago

sahandrez/sft-Qwen2.5-1.5B-ultrafeedback

View all activity

Organizations

None yet

sahandrez's activity

liked 2 models 2 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27 • 881k • • 693

google/gemma-2-2b

Text Generation • Updated Aug 7 • 5.12M • 437

liked 2 datasets 3 months ago

allenai/tulu-2.5-preference-data

Viewer • Updated Jul 22 • 2.12M • 1.32k • 17

allenai/preference-test-sets

Viewer • Updated Mar 14 • 43.2k • 451 • 20

liked 2 datasets about 1 year ago

allenai/real-toxicity-prompts

Viewer • Updated Sep 30, 2022 • 99.4k • 1.15k • 59

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 9.66k • 1.21k