Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex's picture
1 24

Alex PRO

avrecum
patriciapampanelli's profile picture msadman-sakib's profile picture mvh's profile picture
·
  • avrecum
  • avrecum

AI & ML interests

None yet

Recent Activity

updated a dataset 12 days ago
reasoning-proj/rebuttal_multiple_interventions_DeepSeek-R1-Distill-Qwen-7B_nc5
updated a dataset 12 days ago
reasoning-proj/j_rebuttal_multiple_interventions_EXAONE-Deep-32B_nc5
published a dataset 12 days ago
reasoning-proj/j_rebuttal_multiple_interventions_EXAONE-Deep-32B_nc5
View all activity

Organizations

refusals's profile picture Reasoning Project's profile picture

authored a paper about 1 month ago

Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs

Paper • 2412.16974 • Published Dec 22, 2024 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs