arxiv:2410.01257
Alexander Bukharin
alexwb
AI & ML interests
None yet
Organizations
Papers
2
models
8
alexwb/reward_modeling_anthropic_hh_rm1e-3
Updated
•
8
alexwb/reward_modeling_anthropic_hh_rm1e-4
Updated
•
8
alexwb/reward_modeling_anthropic_hh_rm1.4e-5
Updated
•
8
alexwb/reward_modeling_anthropic_hh_rm1e-6
Updated
•
9
alexwb/reward_modeling_anthropic_hh_rm0.99
Updated
•
8
alexwb/reward_modeling_anthropic_hh_rm0.9_lr5e-5
Updated
•
9
alexwb/reward_modeling_anthropic_hh
Text Classification
•
Updated
•
12
alexwb/sft_trl_test
Updated
•
1
datasets
None public yet