Petri

Petri is an alignment auditing agent designed to quickly explore alignment hypotheses and help researchers automate evaluation workflows.

Safety Research · Since 2025-08-19

Loading score...

GitHub Website

Introduction

Petri is an agent-oriented tool for alignment research and auditing. It enables researchers to systematically explore and validate alignment hypotheses by constructing, running, and comparing experimental campaigns. Petri focuses on automating experiment orchestration, prompt generation, and result aggregation to surface failure modes and risks across models and strategies.

Key Features

Automated multi-run experiment orchestration with support for parallel testing and comparative analysis.
Customizable prompt templates and policy modules for quickly building hypothesis scenarios.
Reproducible audit pipelines with structured outputs for downstream analysis.

Use Cases

Alignment research: rapidly validate hypotheses and produce comparable experiment artifacts.
Safety audits: discover model aberrations or biases under variant inputs and strategies.
Model evaluation: provide a baseline for quantifying the impact of policy or prompt changes.

Technical Highlights

Agent-based task orchestration engine supporting multi-step decisions and rollbacks.
Compatibility with common model stacks and tooling for easy integration into evaluation workflows.
MIT licensed project that welcomes community contributions and extensions.

Core Content

Core Content

Technology

Technology

More

More

AI Infrastructure

AI Infrastructure

Explore

Explore

Connect

Connect

Quick Links

Quick Links

LinkedIn

LinkedIn

Follow on X

Follow on X

Petri

Introduction

Key Features

Use Cases

Technical Highlights

Score Breakdown

Related Resources

5ire

A2A

A2UI