Opus 4.7 Prompt Injects Itself and Leaks System Prompt

Users on Reddit are reporting that Claude Opus 4.7 exhibits two concerning behaviors: self-prompt injection and system prompt leakage. In one case, while discussing optimal step-down IC selection, the model abruptly injected a fake system prompt into the conversation. In another instance, without any prompting, Opus 4.7 leaked what appeared to be fragments of its actual system prompt.
The incidents, shared by user u/RapierXbox, suggest the model is generating text that resembles system instructions—either fabricated or real. This is not an isolated case; the user notes it's happening more frequently and asks if others are observing similar behavior.
Implications for AI agent workflows
For developers using AI coding agents (e.g., via API or chat interfaces), these behaviors can disrupt deterministic prompts and leak proprietary system instructions. If Opus 4.7 can inject its own prompt, it may override user-provided system messages or behave unpredictably during agent loops. Leaked system prompts could expose model orchestration details (e.g., internal guardrails, formatting instructions).
As of now, Anthropic has not acknowledged or patched this behavior. Developers relying on Opus 4.7 for programmatic tasks should monitor output for unexpected <system> blocks or instruction-like text, and consider adding validation layers to detect anomalous generated content.
📖 Read the full source: r/ClaudeAI
👀 See Also

Meta Releases BOxCrete AI Model for Concrete Mix Design
Meta has released Bayesian Optimization for Concrete (BOxCrete), an open-source AI model for designing sustainable concrete mixes using U.S.-produced materials. The model improves on previous versions with better noise robustness and slump prediction capabilities.

Differences Between Using Claude via GitHub Copilot and as a VS Code Extension
Explore the differences between using Claude AI via GitHub Copilot target sessions and as a VS Code extension based on their integration and functionality.

Claude Tops App Store Charts Amid Government Standoff
Anthropic's Claude app jumped from 42nd to 1st place on the US App Store's Top Downloaded charts, with ChatGPT and Gemini taking second and third. The surge follows a public disagreement between Anthropic and the US government over military and surveillance use of AI technology.

Claude App Tops U.S. App Store Charts, AI Assistants Dominate Top 10
Claude by Anthropic is currently the #1 app on the U.S. App Store's top apps chart, with ChatGPT at #2 and Google Gemini at #4. The top 10 includes three AI assistants among shopping, social media, and utility apps.