Flakestorm — Automated Robustness Testing for AI Agents. Stop guessing if your agent really works. FlakeStorm generates adversarial mutations and exposes failures your manual tests and evals miss.
-
Updated
Jan 16, 2026 - HTML
Flakestorm — Automated Robustness Testing for AI Agents. Stop guessing if your agent really works. FlakeStorm generates adversarial mutations and exposes failures your manual tests and evals miss.
Add a description, image, and links to the adversarial-agent-testing topic page so that developers can more easily learn about it.
To associate your repository with the adversarial-agent-testing topic, visit your repo's landing page and select "manage topics."