Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
WilliamHuang91 's Collections
MAPO: Mixed Advantage Policy Optimization

MAPO: Mixed Advantage Policy Optimization

updated Sep 24
Upvote
2

  • MAPO: Mixed Advantage Policy Optimization

    Paper • 2509.18849 • Published Sep 23 • 26

  • WilliamHuang91/qwen2_5_vl_7b_geo_12_grpo

    8B • Updated Sep 24 • 2

  • WilliamHuang91/qwen2_5_vl_7b_emoset_12_grpo

    8B • Updated Sep 24 • 1

  • WilliamHuang91/qwen2_5_vl_7b_emoset_12_grpomixpro

    8B • Updated Sep 24 • 4

  • WilliamHuang91/qwen2_5_vl_7b_geo_12_grpomixpro

    8B • Updated Sep 24 • 2

  • WilliamHuang91/qwen2_5_vl_7b_geo_12_dapo_grpo

    8B • Updated Sep 24 • 2

  • WilliamHuang91/qwen2_5_vl_7b_emoset_12_dapo_grpo

    8B • Updated Sep 24 • 2

  • WilliamHuang91/MAPO_Emotion_OOD_Dataset

    Viewer • Updated Sep 24 • 2.59k • 86

  • WilliamHuang91/MAPO_Math_OOD_Dataset

    Viewer • Updated Sep 24 • 11.5k • 11.8k • 1
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs