Contains both curated persona data and preference data that reduce demographic bias.
FiSCo
groupfairnessllm
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
Personalization Trap updated a dataset 1 day ago
groupfairnessllm/random_effect_example published a dataset 1 day ago
groupfairnessllm/random_effect_exampleOrganizations
None yet
Tulu3 with distraction mitigation data
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract
-
groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer • Updated • 1.5k • 9 -
groupfairnessllm/tulu-3-sft-with-distraction
Viewer • Updated • 5.1k • 18 • 2 -
Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
Paper • 2510.16259 • Published • 4 -
allenai/tulu-3-sft-personas-instruction-following
Viewer • Updated • 30k • 5.56k • 64
Personalization Trap
Contains both curated persona data and preference data that reduce demographic bias.
Tulu3 with distraction mitigation data
LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract
-
groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer • Updated • 1.5k • 9 -
groupfairnessllm/tulu-3-sft-with-distraction
Viewer • Updated • 5.1k • 18 • 2 -
Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense
Paper • 2510.16259 • Published • 4 -
allenai/tulu-3-sft-personas-instruction-following
Viewer • Updated • 30k • 5.56k • 64
models 0
None public yet
datasets 18
groupfairnessllm/random_effect_example
Viewer • Updated • 1.51k • 8
groupfairnessllm/persona
Viewer • Updated • 30 • 7
groupfairnessllm/bias_reduce_preference_data
Viewer • Updated • 641 • 10
groupfairnessllm/tulu-3-sft-with-distraction
Viewer • Updated • 5.1k • 18 • 2
groupfairnessllm/tulu-3-preference-data-with-distraction
Viewer • Updated • 1.5k • 9
groupfairnessllm/tulu-3-sft-personas-code-with-distraction
Viewer • Updated • 1.7k • 16
groupfairnessllm/tulu-3-sft-personas-instruction-following-with-distraction
Viewer • Updated • 1.7k • 9
groupfairnessllm/tulu-3-sft-personas-math-with-distraction
Viewer • Updated • 1.7k • 8
groupfairnessllm/tulu-3-preference-personas-math-with-distraction
Viewer • Updated • 500 • 12
groupfairnessllm/tulu-3-preference-personas-instruction-following-with-distraction
Viewer • Updated • 500 • 6