Welcome to ACME Docs
ACME builds the reliability layer for AI agent systems. We help operators answer the question every team running agents eventually asks:“My agents are running. But are they reliable?”
The Problem
Standard monitoring catches crashes. But agents fail in ways that don’t crash:- Silent failures — the agent looks healthy but stopped making progress
- Stalls — waiting on a response that will never come
- Drift — behavior changes session-to-session without obvious cause
The ACME Stack
We provide tools across the reliability lifecycle:Getting Started
The fastest path to understanding your agent reliability:5-Minute Quickstart
Install observational mode. Get your first RadCheck score.
RadCheck Overview
Understand what RadCheck scans and how scores work.
Free vs Paid
ACME follows a trust-first model:- Free: RadCheck (scan), OCTriage (triage), Lazarus Lite (backup check)
- Paid: Sentinel (runtime protection), Agent911 (control plane)