AgentFuzz | Devpost

agent description
static testing
dynamic testing report
dynamic testing interface

Inspiration

We were inspired by today's event sponsors, and we wanted to leverage agents reasoning capabilities to help improve other agents reliability.

What it does

Our project implements an Agent Security Auditor designed for the autonomous, systematic evaluation of a target production agent: the Mailman Agent (simulating an enterprise email assistant). The Auditor leverages advanced LLM reasoning and planning to execute automated prompt injection. It analyzes the target's behavior and responses to generate a structured report, quantifying the risk.