AnyTool

An open-source implementation of self-reflective, hierarchical agents designed for large-scale API calls and evaluation.

Author: Dyabel

Since: 2024-02-05

Visit Website GitHub

Detailed Introduction

AnyTool is an open-source implementation of the paper “AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls.” It provides a framework and experimental pipelines for building self-reflective, hierarchical agents that select, compose, and verify API calls across large tool sets. The repository includes data preparation steps for ToolBench, preprocessing scripts, example AnyToolBench data, and evaluation scripts used to reproduce the results reported in the paper.

Main Features

Self-reflective strategy: agents perform iterative self-checks to improve execution quality.
Hierarchical design: separates retrieval, planning, and solver layers for modularity.
Tool and dataset support: integrates ToolBench and AnyToolBench sample data with preprocessing utilities.
Reproducible experiments: includes commands and output conventions to reproduce paper results.

Use Cases

Multi-API orchestration: selecting and composing multiple APIs to answer complex queries.
Research & benchmarking: provide a baseline implementation for comparing self-reflective strategies.
Education & reproduction: a resource for learning hierarchical agent design and reproducing experiments.
Tool-integration prototyping: quickly prototype agentic systems that coordinate many external APIs.

Technical Features

Scalable API retrieval and scheduling pipelines to handle large tool collections.
Combines modern LLMs (e.g., GPT-4) with solver components for joint solution and validation.
Data preprocessing, AnyToolBench generation, and evaluation scripts for reproducible research.
Apache-2.0 licensed for research and commercial use.

AnyTool

Detailed Introduction

Main Features

Use Cases

Technical Features

Resource Info

Related Resources

OpenMCP Client

Swarms

karpathy