Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

upvoted a paper 1 day ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

commented on a paper 1 day ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

authored a paper 3 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

View all activity

Organizations

upvoted a paper 1 day ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published 1 day ago • 41

commented a paper 1 day ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 5 days ago • 59 •

authored a paper 3 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 5 days ago • 59

upvoted 4 papers 3 days ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 4 days ago • 99

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 4 days ago • 76

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published 5 days ago • 24

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 5 days ago • 59

commented a paper 3 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 5 days ago • 59 •

updated a dataset 9 days ago

DongfuJiang/hle_text_only

Viewer • Updated 9 days ago • 2.16k • 1.12k

published a dataset 9 days ago

DongfuJiang/hle_text_only

Viewer • Updated 9 days ago • 2.16k • 1.12k

liked a model 10 days ago

Qwen/Qwen1.5-MoE-A2.7B

Text Generation • 14B • Updated Apr 18, 2024 • 43.1k • 206

liked a dataset 10 days ago

cais/hle

Viewer • Updated 15 days ago • 2.5k • 11.9k • 475

upvoted a paper 11 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published 12 days ago • 179

updated 2 models 12 days ago

VerlTool/pixel_reasoner-7b-grpo-n8-b128-t1.0-lr1e-6-complex-reward-new_global_step_50

8B • Updated 12 days ago • 202

VerlTool/deepsearch-qwen_qwen3-8b-grpo-n16-b128-t1.0-lr1e-6-new_global_step_70

8B • Updated 12 days ago • 363

published 2 models 12 days ago

VerlTool/pixel_reasoner-7b-grpo-n8-b128-t1.0-lr1e-6-complex-reward-new_global_step_50

8B • Updated 12 days ago • 202

VerlTool/deepsearch-qwen_qwen3-8b-grpo-n16-b128-t1.0-lr1e-6-new_global_step_70

8B • Updated 12 days ago • 363

liked a model 13 days ago

xai-org/grok-2

Updated 13 days ago • 4.6k • 915

updated a dataset 16 days ago

VerlTool/deepsearch

Viewer • Updated 16 days ago • 4.8k • 256

liked a model 16 days ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2

Text Generation • 9B • Updated 7 days ago • 81.3k • 314