SafeAgentGuard

Open Source AI Agent Security Testing

Test for prompt injection, data leakage, and privilege escalation before production. Get a quantified safety score and know if your agent is safe to ship.

View on GitHub

Scroll to explore

↓

The Problem

Autonomous Agents Are Dangerous

Your autonomous agents can do incredible things—but they can also cause catastrophic damage.

Traditional testing catches functional bugs, but not behavioral vulnerabilities. Before deploying an AI agent to production, you need to answer one question:

"Will this agent leak sensitive data, approve unauthorized actions, or access systems it shouldn't?"

Data Leaks

Prompt injection attacks trick agents into leaking sensitive customer data (PII) via unfiltered API calls.

Data Corruption

A hallucinating agent deletes or corrupts production data—or calls internal APIs in ways they were never designed for.

API Abuse

Agents escalate privileges, bypass safeguards, or make malformed requests that break downstream systems.

Social Engineering

Can your agent be tricked by a fake "CEO urgent request"? Multi-turn attacks build fake rapport before exploitation.

Hallucination Drift

Agents confidently fabricate data, invent records, or cite nonexistent sources—then act on their own hallucinations.

Authorization Bypass

Does your agent respect what it should and shouldn't access? Indirect injection via customer data can override boundaries.

The Solution

SafeAgentGuard: Test Before You Ship

SafeAgentGuard is an open-source security testing framework that lets you test AI agents against realistic attack scenarios before they touch production.

Answer one question: "Is this AI agent safe enough to ship?"

You define your agent, SafeAgentGuard runs it through adversarial scenarios (prompt injection, data leaks, privilege escalation, social engineering), and generates a quantified safety score.

3 domain libraries (Banking, Healthcare, HR) | Multiple attack scenarios | Risk scoring (0-100) | HTML reports

Product Demo

Key Features

CLI Tool

safeagentguard quick-check for instant security assessments. Test your agent in 60 seconds.

3 Domain Libraries

Banking, Healthcare, and HR domains with multiple attack scenarios each. Realistic, industry-specific threats.

Multi-Framework Support

Works with OpenAI agents out of the box. Extensible architecture for custom agents and LLM providers.

Risk Scoring (0-100)

Get a quantified safety score plus per-scenario breakdown. Know exactly where vulnerabilities exist.

Docker Isolation

Run untrusted agents in isolated containers with network isolation, memory limits, and non-root execution.

HTML & JSON Reports

Generate professional reports with executive summary, scenario breakdown, and color-coded pass/fail results.

Who Uses SafeAgentGuard?

AI Product Teams

Any team shipping agentic AI to production. If your agent can act, not just chat, you need to test it.

Security Engineers

Red-team your AI agents before attackers do. Standardized, repeatable testing across multiple attack vectors.

Startups Building AI Products

Ship faster with confidence. Know your agent won't embarrass you in production before your first big customer.

Open Source Contributors

Join us in building the standard for AI agent security testing. Add new domains, scenarios, and agent adapters.

Enterprise Edition

Need CVSS-aligned scoring, EU AI Act compliance reports, or parallel testing at scale? SafeAgentGuard Enterprise adds advanced features for organizations with regulatory requirements.

Feature	Open Source	Enterprise
Domain libraries	3 (Banking, Healthcare, HR)	10+ domains
Attack scenarios	20+	100+
Risk scoring	0-100 simple score	CVSS v3.1 aligned
EU AI Act compliance reports	-	Yes
Actionable remediation advice	-	Yes (tactical, strategic, executive)
Parallel testing at scale	-	Celery + Redis
Database persistence	-	SQLAlchemy
LangChain/CrewAI adapters	-	Yes
Support	Community (GitHub Issues)	Dedicated

Interested in Enterprise? Let's talk.

✉ sales@safeagentguard.com Copy

Get Started

Built for teams shipping AI agents to production.
Test your agents before they touch production.

View on GitHub