arxiv:2509.13310

Scaling Agents via Continual Pre-training

Published on Sep 16

· Submitted by

Jialong Wu on Sep 17

#2 Paper of the day

Upvote

Authors:

Liangcai Su ,

Maojia Song ,

Jialong Wu ,

Xuanzhong Chen ,

Zhongwang Zhang ,

Runnan Fang ,

Wenbiao Yin ,

Pengjun Xie ,

Abstract

AgentFounder, a deep research agent model incorporating Agentic Continual Pre-training, achieves state-of-the-art performance in agentic tasks while maintaining strong tool-use ability.

AI-generated summary

Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving. However, post-training approaches building upon general-purpose foundation models consistently underperform in agentic tasks, particularly in open-source implementations. We identify the root cause: the absence of robust agentic foundation models forces models during post-training to simultaneously learn diverse agentic behaviors while aligning them to expert demonstrations, thereby creating fundamental optimization tensions. To this end, we are the first to propose incorporating Agentic Continual Pre-training (Agentic CPT) into the deep research agents training pipeline to build powerful agentic foundational models. Based on this approach, we develop a deep research agent model named AgentFounder. We evaluate our AgentFounder-30B on 10 benchmarks and achieve state-of-the-art performance while retains strong tool-use ability, notably 39.9% on BrowseComp-en, 43.3% on BrowseComp-zh, and 31.5% Pass@1 on HLE.

View arXiv page View PDF Project page GitHub 13.3k Add to collection

Community

callanwu

Paper author Paper submitter 5 days ago

callanwu

Paper author Paper submitter 5 days ago

Github: https://github.com/Alibaba-NLP/DeepResearch/
Blog: https://tongyi-agent.github.io/blog/

librarian-bot

4 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

grantsing

4 days ago

arXiv explained breakdown of this paper 👉 https://arxivexplained.com/papers/scaling-agents-via-continual-pre-training

Eryk-Chmielewski

4 days ago

How
AgentFounder-30b
and
WebSailor-V2-30B-A3B

are connected with:
Tongyi-DeepResearch-30B-A3B

each of these agents is a separate model?

HKU-Liangcai

Paper author about 19 hours ago

Thanks for your attention. Actually, Tongyi DeepResearch adopts the methods and data from AgentFounder and WebSailor-v2. A more detailed version will be included in our future technical reports (when available). However, the data and models used in AgentFounder and WebSailor-v2 may be derived from exploratory experiments and may differ from the final DeepResearch model.

yczhuang

4 days ago

Thank you for sharing this very interesting paper! Very cool work! We have done some similar exploration of using CPT for agent training previously: https://arxiv.org/pdf/2502.06589 and have some similar findings. It is great to see the performance further boosted with stronger models (Qwen-series) and larger amount of data.

HKU-Liangcai

Paper author about 19 hours ago

Pretty Cool! I didn't notice this paper before, I will read your work carefully, and I think it is very likely that we need to cite your work in the updated version!! Thank you for your reply!!!