McpVanguard: Open-Source Security Proxy for AI Agents

McpVanguard is an open-source security proxy and firewall designed specifically for local AI agents using the Model Context Protocol (MCP). It addresses security concerns that arise when giving LLMs access to tools like terminals or filesystems.

How it works

The proxy sits between the AI agent and MCP tools, wrapping existing MCP servers without requiring setup rewrites. It can run locally as a lightweight proxy or be deployed as a cloud gateway, with a Railway template available for easier deployment.

Security layers

Rules/signature engine: Contains around 50 YAML signatures that detect common attacks like reverse shells, SSRF attempts, and other obvious threats. This layer adds approximately 16ms latency.
Semantic scoring layer (optional): When requests appear suspicious but not clearly malicious, they can be evaluated by a small LLM (Ollama or OpenAI) that assesses intent.
Behavioral monitoring: Blocks anomalous patterns, such as an agent attempting to read hundreds of files in a short time.

Audit capabilities

Every blocked request is recorded in an immutable audit log that's cryptographically signed and stored locally, providing a verifiable record of what was blocked and why.

The tool was developed to address specific security concerns with MCP implementations, including prompt injection, path traversal, and accidental directory deletion by AI agents.

📖 Read the full source: r/LocalLLaMA

McpVanguard: Open-source security proxy for MCP-based AI agents

How it works

Security layers

Audit capabilities

👀 See Also

jqwik 1.10.0 Plants Anti-AI Message in Test Output — A New Supply-Charin Attack Vector for Coding Agents

Delimiter defense boosts Gemma 4 from 21% to 100% prompt injection defense in 6100+ test benchmark

Claude Code CVE-2026-39861: Sandbox Escape via Symlink Following

MCPwner AI Pentesting Tool Finds Multiple 0-Day Vulnerabilities in OpenClaw