Tongyi DeepResearch

An open research agent and toolset for long-horizon information-seeking and agentic tasks, developed by Tongyi Lab (Alibaba-NLP).

Author: Alibaba

Added Date: 2025-09-18

Open Source Since: 2025-01-09

Visit Website GitHub

Introduction

Tongyi DeepResearch is an open-source agentic large language model and toolkit from Tongyi Lab / Alibaba-NLP, designed for long-horizon information-seeking and deep research tasks (family includes 30.5B parameters with efficient activation). The project provides synthetic data pipelines, agent training and inference frameworks, benchmark scripts, and example code for reproducibility and evaluation.

Key Features

Fully automated synthetic data generation pipeline for agentic pre-training and supervised fine-tuning.
End-to-end reinforcement learning pipeline (custom Group Relative Policy Optimization) to improve stability on long-horizon tasks.
Compatibility with multiple inference paradigms (ReAct, IterResearch), and comprehensive evaluation and benchmark tooling.

Use Cases

Information retrieval and knowledge discovery: long-context web retrieval, cross-document QA and evidence aggregation.
Automated research assistant: literature discovery, experiment plan generation, and result summarization.
Multi-tool collaborative agents: complex task execution combining retrieval, computation, and external APIs.

Technical Highlights

Large-scale continual pre-training with task-oriented synthetic data to enhance reasoning and retrieval abilities.
Token-level policy gradient RL design with negative-sample selection and stabilization techniques.
Provides HuggingFace / ModelScope model links and inference scripts to facilitate engineering adoption.

Tongyi DeepResearch

Introduction

Key Features

Use Cases

Technical Highlights

Resource Info

Related Resources

Spring AI Alibaba

Qwen3-VL

ROLL