AI Agent / funnel

Failure Funnel

Track agent failures from detection through reproduction, root cause, fix, rerun, and verified pass.

standalone html

Failure triage funnel from detection to verified pass

failed evals moving through repro, root cause, fix, rerun, and verified pass stages
ai-agent/failure-funnel
Cost Safety Model Tools Latency

Use When

Track agent failures from detection through reproduction, root cause, fix, rerun, and verified pass.

Signals

  • repro rate
  • root-cause gap
  • verified pass