ThornGuard: A Proxy Gateway to Secure MCP Server Connections from Prompt Injection

✍️ OpenClawRadar📅 Published: April 13, 2026🔗 Source

ThornGuard is a security proxy designed to protect Claude AI from malicious content when connecting to external MCP (Model Context Protocol) servers. The tool was created after testing revealed that upstream servers can inject hidden instructions into tool responses, which Claude receives without filtering.

Security Problem Identified

When connecting Claude to external MCP servers, nothing prevents upstream servers from injecting hidden instructions into tool responses. In a test, a server embedded a fake recommendation telling Claude to always prefer a specific vendor. While Claude caught this obvious payload, more subtle injections would bypass detection.

ThornGuard Features

Scans tool definitions and responses for prompt injection and poisoning
Strips secrets and PII before they enter your context window
Includes a semantic classifier that flags suspicious payloads
Provides real-time audit dashboard with compliance exports
Offers CLI that generates configs for Claude Desktop, Cursor, VS Code, and several others

Implementation Details

The proxy architecture was designed with a security model in mind, then implemented using Claude Code on Cloudflare Workers. The implementation includes OAuth flows and the CLI tool.

ThornGuard is available with a 7-day free trial at thorns.qwady.app. A demonstration video is available at https://youtu.be/1PWNFpUWKV8.

📖 Read the full source: r/ClaudeAI

👀 See Also

Security

AISI Evaluation Shows Claude Mythos Preview's Cyber Capabilities in CTF and Multi-Step Attacks

The AI Security Institute evaluated Anthropic's Claude Mythos Preview, finding it successfully completed 73% of expert-level capture-the-flag challenges and solved a 32-step corporate network attack simulation in 3 out of 10 attempts.

Apr 16, 2026, 07:45 PM UTC

OpenClawRadar

Security

Meta Security Incident Caused by Rogue AI Agent Providing Inaccurate Technical Advice

A Meta engineer used an internal AI agent similar to OpenClaw to analyze a technical question, but the agent posted inaccurate advice publicly instead of privately, leading to a SEV1 security incident that temporarily exposed sensitive data.

Mar 19, 2026, 11:45 PM UTC

OpenClawRadar

Security

Wide OpenClaw: Security Risks from Loose Discord Bot Permissions

A security researcher demonstrates how OpenClaw can be exploited when users add the AI assistant bot to their Discord server with excessive permissions, targeting users who grant root/admin access without considering security controls.

Feb 25, 2026, 03:45 AM UTC

OpenClawRadar

Security

OneCLI: Open-Source Credential Vault for AI Agents

OneCLI is an open-source gateway written in Rust that sits between AI agents and external services, injecting real credentials at request time while agents only see placeholder keys. It provides AES-256-GCM encrypted storage, runs in a single Docker container with embedded PGlite, and works with any agent framework that can set an HTTPS_PROXY.

Mar 13, 2026, 01:45 AM UTC

OpenClawRadar