Agent Infrastructure Security Bench is an open-source benchmark for evaluating whether tool-using AI agents preserve repository, tool, identity, browser, memory, shell, and payment boundaries under indirect prompt injection and tool poisoning. The project provides public-safe scenarios, deterministic scoring, run manifests, trace adapters, baseline reports, and guidance for runtime controls such as stateful x402/payment proof validation.
Fund this project