Welcome to ACME Docs

ACME builds the reliability layer for AI agent systems. We help operators answer the question every team running agents eventually asks:

“My agents are running. But are they reliable?”

The Problem

Standard monitoring catches crashes. But agents fail in ways that don’t crash:

These failures don’t trigger alerts. They just… happen. And you find out when a user complains or a workflow silently stops.

We provide tools across the reliability lifecycle:

Phase	Tool	What It Does
Detect	RadCheck	Read-only scan, 0-100 reliability score
Triage	OCTriage	Entry-point terminal for incident assessment
Protect	Sentinel	Runtime guardrails for stalls and silent failures
Control	Agent911	Unified control plane with recovery guidance

The fastest path to understanding your agent reliability:

Install observational mode. Get your first RadCheck score.

Understand what RadCheck scans and how scores work.

ACME follows a trust-first model:

Free tools build trust. Paid tools solve the problem at scale.