10 11 20

Shengyi Costa Huang

vwxyzjn

http://costa.sh

AI & ML interests

None yet

Articles

Organizations

Collections 3

Papers 10

spaces 4

No application file

🦀

Test

Runtime error

🔥

Aim

Sleeping

😻

Vwxyzjn Testyes4

Runtime error

📊

Pyserini Wikipedia Kilt Doc

models 389

vwxyzjn/rm_zephyr_new

Text Classification • Updated Sep 26, 2024 • 16

vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev

Updated Sep 11, 2024

vwxyzjn/reward_modeling__EleutherAI_pythia-14m

Updated Aug 24, 2024 • 15

vwxyzjn/online_dpo_vllm__vwxyzjn_btulu

Updated Aug 23, 2024 • 1

vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b

Updated Aug 19, 2024 • 5

vwxyzjn/btulu

Text Generation • Updated Aug 19, 2024 • 135

vwxyzjn/online_dpo_tulu_2

Text Generation • Updated Aug 19, 2024 • 10

vwxyzjn/gkd-model

Updated Aug 15, 2024

vwxyzjn/reward_modeling__allenai_llama-3-tulu-2-8b

Updated Aug 11, 2024 • 46

vwxyzjn/online_dpo__cleanrl_EleutherAI_pythia-1b-dedupedsfttldr

Updated Aug 9, 2024

datasets 282

vwxyzjn/norobot_pref_4860

Viewer • Updated Oct 2, 2024 • 59.9k • 31

vwxyzjn/norobot_generation_4860

Viewer • Updated Oct 2, 2024 • 29.9k • 7

vwxyzjn/norobot_pref_465

Viewer • Updated Oct 2, 2024 • 59.4k • 23

vwxyzjn/norobot_generation_465

Viewer • Updated Oct 2, 2024 • 29.7k • 96

vwxyzjn/norobot_generation_16325

Viewer • Updated Oct 2, 2024 • 29.7k • 48

vwxyzjn/norobot_pref_11421

Viewer • Updated Oct 2, 2024 • 56.1k • 13

vwxyzjn/norobot_generation_11421

Viewer • Updated Oct 2, 2024 • 28k • 158

vwxyzjn/rejection_sampling_scores_1727889563

Viewer • Updated Oct 2, 2024 • 240 • 7

vwxyzjn/rejection_sampling_1727889563

Viewer • Updated Oct 2, 2024 • 60 • 18

vwxyzjn/rejection_sampling_scores_1727889130

Viewer • Updated Oct 2, 2024 • 180 • 7

Shengyi Costa Huang

AI & ML interests

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

Putting RL back in RLHF

Constitutional AI with Open LLMs

The N Implementation Details of RLHF with PPO

Organizations

Collections 3

Papers 10

spaces 4 Sort: Recently updated

Test

Aim

Vwxyzjn Testyes4

Pyserini Wikipedia Kilt Doc

models 389 Sort: Recently updated

datasets 282 Sort: Recently updated

spaces 4

models 389

datasets 282