1 21 126

peng

superpeng

AI & ML interests

None yet

Recent Activity

upvoted an article 14 days ago

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

liked a dataset 18 days ago

O1-OPEN/OpenO1-SFT

liked a dataset 23 days ago

medalpaca/medical_meadow_wikidoc

View all activity

Organizations

None yet

superpeng's activity

upvoted an article 14 days ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19

• 74

liked a dataset 18 days ago

O1-OPEN/OpenO1-SFT

Viewer • Updated 1 day ago • 77.7k • 1.59k • 217

liked a dataset 23 days ago

medalpaca/medical_meadow_wikidoc

Viewer • Updated Apr 6, 2023 • 10k • 975 • 40

liked a dataset about 1 month ago

kaiokendev/SuperCOT-dataset

Viewer • Updated May 26, 2023 • 58.3k • 68 • 46

liked a model about 1 month ago

kaiokendev/SuperCOT-LoRA

Updated May 6, 2023 • 104

liked a dataset about 2 months ago

RLHFlow/prompt-collection-v0.1

Viewer • Updated May 8 • 179k • 51 • 8

liked a Space about 2 months ago

Running

293

📐

Reward Bench Leaderboard

upvoted a collection about 2 months ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12 • 11

liked 2 datasets about 2 months ago

tasksource/oasst1_pairwise_rlhf_reward

Viewer • Updated Jul 4, 2023 • 18.9k • 126 • 42

BAAI/AquilaMed-RL

Viewer • Updated Jun 21 • 12.7k • 66 • 8

updated a collection 2 months ago

LLM Pretrain

Collection

6 items • Updated Oct 8

liked 3 datasets 3 months ago

liked a model 3 months ago

abacusai/Smaug-Qwen2-72B-Instruct

Text Generation • Updated Aug 6 • 2.79k • 9

liked 4 datasets 4 months ago

Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered

Viewer • Updated Jul 22 • 200k • 140 • 13

WizardLMTeam/WizardLM_evol_instruct_V2_196k

Viewer • Updated Mar 10 • 143k • 509 • 229

RyokoAI/ShareGPT52K

Preview • Updated Apr 2, 2023 • 202 • 310

shibing624/huatuo_medical_qa_sharegpt

Viewer • Updated Jan 29 • 276k • 74 • 16

liked a model 4 months ago

nvidia/Llama3-70B-SteerLM-RM

Updated Jun 19 • 13 • 42