Question 1

What are the key features of Agent Safety?

Accepted Answer

Enforces API cost budgets for LLM calls. Detects prompt injection using 75 built-in patterns. Provides decision tracing for agent audit trails. Zero ML dependencies and pure Python implementation. Compatible with Claude Code, Cursor, Windsurf, and Zed.

Question 2

What can I use Agent Safety for?

Accepted Answer

Preventing runaway API costs in autonomous AI agents. Securing AI inputs against prompt injection attacks. Creating verifiable audit trails for AI decision-making. Monitoring token usage for financial market trading agents.

Question 3

What tools does Agent Safety provide?

Accepted Answer

cost_guard_configure: Set weekly budget, alert threshold, and dry-run mode.. cost_guard_status: Check current spend vs budget.. cost_guard_check: Pre-check if a model call is within budget.. cost_guard_record: Record a completed call's token usage.. cost_guard_models: List supported models with pricing.. injection_scan: Scan text for injection patterns (non-blocking).. injection_check: Scan and block if injection detected.. injection_patterns: List all 75 built-in detection patterns across 9 categories.. trace_start: Start a new trace session.. trace_step: Log a decision step with context.. trace_summary: Get session summary including steps, errors, and timing.. trace_save: Save trace to JSON and Markdown files..

Question 4

How do I install Agent Safety?

Accepted Answer

Install Agent Safety by running: claude mcp add agent-safety -- uvx agent-safety-mcp

Question 5

What are the requirements for Agent Safety?

Accepted Answer

Agent Safety requires a compatible MCP client such as Claude Desktop, Claude Code, or Cursor. No additional environment variables are needed for basic setup.

Question 6

Is Agent Safety free to use?

Accepted Answer

Yes, Agent Safety is open source and free to use. You can find the source code on GitHub.

Question 7

What MCP clients support Agent Safety?

Accepted Answer

Agent Safety works with any MCP-compatible client including Claude Desktop (Anthropic's official desktop app), Claude Code (CLI tool), Cursor, and other editors with MCP support.

Question 8

How do I configure Agent Safety?

Accepted Answer

Configure Agent Safety by adding it to your MCP client's config file. The setup block at the top of this page generates a ready-to-paste config for Claude Code, Cursor, Codex, Windsurf, and Claude Desktop.

Tool	What it does
`cost_guard_configure`	Set weekly budget, alert threshold, dry-run mode
`cost_guard_status`	Check current spend vs budget
`cost_guard_check`	Pre-check if a model call is within budget
`cost_guard_record`	Record a completed call's token usage
`cost_guard_models`	List supported models with pricing

Tool	What it does
`injection_scan`	Scan text for injection patterns (non-blocking)
`injection_check`	Scan + block if injection detected
`injection_patterns`	List all 75 built-in detection patterns across 9 categories

Tool	What it does
`trace_start`	Start a new trace session
`trace_step`	Log a decision step with context
`trace_summary`	Get session summary (steps, errors, timing)
`trace_save`	Save trace to JSON + Markdown files

Agent Safety MCP Server

Add it to Claude Code

Make your agent remember this setup

What it does

Tools 12

Try it

agent-safety-mcp

Install

Claude Code (recommended)

Manual (any MCP client)

From PyPI

Tools

Cost Guard — Budget enforcement for LLM calls

Injection Guard — Prompt injection scanner

Decision Tracer — Agent decision logging

What this wraps

Why

License

Frequently Asked Questions

Turn this server into reusable context