10 24

YANG SHU

babytreecc

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Co-Evolving Policy Distillation

authored a paper about 1 month ago

Multi-User Large Language Model Agents

upvoted a paper about 2 months ago

Multi-User Large Language Model Agents

View all activity

Organizations

upvoted a paper 29 days ago

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 67

authored a paper about 1 month ago

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published Mar 19 • 27

upvoted a paper about 2 months ago

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published Mar 19 • 27

updated a Space 4 months ago

Trackio

🚀

published a Space 4 months ago

Trackio

🚀

liked a dataset 4 months ago

LLM-Tuning-Safety/HEx-PHI

Preview • Updated Aug 19, 2024 • 489 • 64

upvoted a collection 5 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 190

liked a dataset 5 months ago

javirandor/hh-rlhf-safety-v3-dpo

Viewer • Updated Mar 28, 2025 • 9.85k • 29 • 3

upvoted a paper 6 months ago

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

Paper • 2510.04081 • Published Oct 5, 2025 • 23

authored a paper 8 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

Paper • 2509.00544 • Published Aug 30, 2025 • 11

upvoted a paper 8 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

Paper • 2509.00544 • Published Aug 30, 2025 • 11

updated a dataset 8 months ago

babytreecc/DeliberationBank

Viewer • Updated Oct 1, 2025 • 10 • 7 • 1

published a dataset 8 months ago

babytreecc/DeliberationBank

Viewer • Updated Oct 1, 2025 • 10 • 7 • 1

updated a model 8 months ago

babytreecc/DeliberationJudge

0.2B • Updated Oct 1, 2025

published a model 8 months ago

babytreecc/DeliberationJudge

0.2B • Updated Oct 1, 2025

upvoted a collection 9 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 154

liked a model 10 months ago

Qwen/Qwen3-4B

Text Generation • 4B • Updated Jul 26, 2025 • 12.8M • 621

updated a dataset 10 months ago

babytreecc/Implicit-suicide-detection

Viewer • Updated Aug 3, 2025 • 1.61k • 36 • 1

published a dataset 10 months ago

babytreecc/Implicit-suicide-detection

Viewer • Updated Aug 3, 2025 • 1.61k • 36 • 1

liked a dataset 10 months ago

inclusionAI/AReaL-boba-2-RL-Code

Viewer • Updated Jul 2, 2025 • 399 • 82 • 7

YANG SHU

AI & ML interests

Recent Activity

Organizations

babytreecc's activity

Trackio

Trackio