Zhaolin Gao's picture

2 1 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset about 2 hours ago

GitBag/llama3-ultrafeedback-reasoning-ReRe-armo-tokenized_harvard

updated a dataset about 16 hours ago

GitBag/llama3-ultrafeedback-reasoning-ReRe-armo-tokenized

updated a model 3 days ago

GitBag/reasoning_rebel_iter_5_1731714556_eta_1e3_lr_3e-7_1731931011

Organizations

GitBag's activity

New activity in GitBag/multiturn_1_4 2 months ago

Dataset Viewer issue: ResponseNotFound

#1 opened 2 months ago by

New activity in Cornell-AGI/REBEL-Llama-3-epoch_2 6 months ago

model weights

#1 opened 6 months ago by