McpVanguard: Open-source security proxy for MCP-based AI agents

McpVanguard is an open-source security proxy and firewall designed specifically for local AI agents using the Model Context Protocol (MCP). It addresses security concerns that arise when giving LLMs access to tools like terminals or filesystems.
How it works
The proxy sits between the AI agent and MCP tools, wrapping existing MCP servers without requiring setup rewrites. It can run locally as a lightweight proxy or be deployed as a cloud gateway, with a Railway template available for easier deployment.
Security layers
- Rules/signature engine: Contains around 50 YAML signatures that detect common attacks like reverse shells, SSRF attempts, and other obvious threats. This layer adds approximately 16ms latency.
- Semantic scoring layer (optional): When requests appear suspicious but not clearly malicious, they can be evaluated by a small LLM (Ollama or OpenAI) that assesses intent.
- Behavioral monitoring: Blocks anomalous patterns, such as an agent attempting to read hundreds of files in a short time.
Audit capabilities
Every blocked request is recorded in an immutable audit log that's cryptographically signed and stored locally, providing a verifiable record of what was blocked and why.
The tool was developed to address specific security concerns with MCP implementations, including prompt injection, path traversal, and accidental directory deletion by AI agents.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Clawndom: A Security Hook for Claude Code to Block Vulnerable npm Packages
A developer built Clawndom, an open-source hook for Claude Code that checks npm packages against the OSV.dev vulnerability database before installation, blocking known vulnerable packages while maintaining agent autonomy.

AI Chatbots Leaking Real Phone Numbers: The PII Exposure Problem
Chatbots like Gemini, ChatGPT, and Claude are exposing real personal phone numbers due to PII in training data. DeleteMe reports a 400% increase in AI-related privacy requests in seven months.

OpenClaw Skill Analyzer: Static Security Scanner for AI Agent Skills
A developer built a static analyzer that scans OpenClaw skills for security risks before installation, with 40+ detection rules across 12 categories including prompt injection and data exfiltration.

Hidden Audio Signals Hijack Voice AI Systems with 79-96% Success Rate
Research shows imperceptible audio clips can force LALMs to execute unauthorized commands like web searches, file downloads, and email exfiltration with 79-96% success across 13 models including Mistral and Microsoft services.