Enter your email address below and subscribe to our newsletter

Exabeam Launches Open-Source Praxen for AI Agent Verification

Share your love

As AI agents move from assistants to operational actors inside enterprise environments, organizations face a new security challenge. Agents access systems, invoke tools, execute workflows, and make decisions with increasing autonomy.

Exabeam announced Agent Behavior Verification (ABV). This new security discipline helps organizations determine whether AI agents are configured, authorized, and governed in ways that align with their intended responsibilities before deployment. 

While existing approaches, such as vulnerability scanning and red teaming, help govern, monitor, and test agent activity during runtime, organizations lack a practical way to determine whether an agent is prepared to operate safely before it enters production.

Agent Behavior Verification addresses that gap. Rather than focusing solely on known vulnerabilities or individual code artifacts, ABV evaluates agents as complete systems, providing a framework for defining an agent’s authorized role and evaluating whether its implementation, permissions, and controls align with its purpose. 

“Organizations are rapidly moving from AI experimentation to operational deployment,” said Steve Wilson, Chief AI Officer at Exabeam and Founder and Co-Chair of the OWASP Gen AI Security Project. “As agents become digital workers, security teams need more than runtime visibility. They need confidence that agents have the right permissions, the right controls, and the right boundaries before they enter production. Agent Behavior Verification helps answer a fundamental question: will this agent do its job, and only its job?”

Praxen Operationalizes Agent Behavior Verification

Praxen uses the ABV remit — a policy contract that defines what an agent is authorized to do, what resources it may access, and what boundaries it must operate within — to help developers and operators verify whether the agent’s implementation, tools, configurations, memory, integrations, and operating environment align with its specified role. 

By identifying and reporting on gaps between intended and implemented behavior, Praxen provides actionable recommendations and highlights behavioral risk for developers ahead of deployment. Reports include specific findings, recommendations for improvement, and an overall maturity score for the agent’s security posture.

“Traditional security tools help identify vulnerabilities in software,” continued Wilson. “Praxen evaluates something different: whether an agent’s capabilities, permissions, tools, and controls align with the role it was authorized to perform. This addresses one of the most critical risks introduced by highly autonomous agents and establishes a stronger foundation for ongoing governance throughout the agent lifecycle.”

//Staff Writer