ARPO - a dongguanting Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

dongguanting 's Collections

ARPO

ARPO

updated Jul 29

The official datasets and model checkpoints of ARPO

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 147
dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29 • 31 • 1
dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated 24 days ago • 63 • 4
dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated 17 days ago • 72 • 2
dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated 24 days ago • 16 • 1
dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated 24 days ago • 22 • 1
dongguanting/ARPO-SFT-54K

Viewer • Updated 24 days ago • 54.6k • 528 • 9
dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated 24 days ago • 10k • 223 • 3
dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Jul 29 • 1.07k • 161 • 4

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs