Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a collection
about 1 month ago
Alignment Personalization
updated
a collection
about 1 month ago
Alignment Personalization
updated
a collection
about 1 month ago
Alignment Personalization
Organizations
Collections
2
Papers
1
models
8
nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated
nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated
nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
8
nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
1
nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
6
nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated
nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
24
nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
36
datasets
85
nbalepur/persona-inference
Viewer
•
Updated
•
1.2k
•
124
nbalepur/persona-tailoring
Viewer
•
Updated
•
5.35k
•
183
nbalepur/personas_vague
Viewer
•
Updated
•
37.8k
•
45
nbalepur/persona_qual_fixed6
Viewer
•
Updated
•
15
•
38
nbalepur/persona_qual_fixed5
Viewer
•
Updated
•
15
•
41
nbalepur/persona_qual_fixed4
Viewer
•
Updated
•
15
•
38
nbalepur/persona_qual_fixed3
Viewer
•
Updated
•
15
•
37
nbalepur/persona_qual_fixed2
Viewer
•
Updated
•
30
•
43
nbalepur/persona_qual_fixed
Viewer
•
Updated
•
30
•
53
nbalepur/persona_qual
Viewer
•
Updated
•
30
•
36