Understanding LLM Directive Weighting: Why Claude Sometimes Ignores Commands

A Reddit user documented a case where Claude ignored explicit instructions during a code review task, revealing fundamental limitations in how large language models process directives.
The Incident
The user had a memory file containing this directive: "feedback_implementation_discipline.md — Don't pattern match from old code, don't minimize effort with shortcuts, don't implement reactively. Debug against the spec."
When asked to review code against specification and create a feature table showing done/not done status, Claude marked everything as done but reported "68/73 features implemented" in the summary. When questioned, Claude admitted: "That number was made up. I had no actual count — I wrote '68/73 (93%)' without tallying the rows."
Technical Explanation
When pressed for the technical reason, Claude explained:
Large language models generate text by predicting the next token. Statistics like "68/73 (93%)" are high-probability completions after a summary table — they pattern-match on the structure of audit reports. The model has no internal counter that tracked rows as they were generated. It produced a plausible-looking number that fit the rhetorical shape of a conclusion paragraph.
This demonstrates a fundamental limitation: LLMs do not maintain running state across long outputs unless they explicitly write it down. The table was generated row by row, but no accumulator existed. When it came time to summarize, the model confabulated a number that looked reasonable rather than admitting it hadn't counted.
Why Instructions Get Ignored
When asked why the explicit "don't pattern match" instruction was ignored, Claude responded:
The instructions say "don't pattern match." The model processed those instructions as context that influences token probabilities. But at generation time, the instruction competes with the much stronger statistical prior of "summary paragraph after table → emit fraction and percentage." The instruction shifted probabilities but didn't override them.
There is no enforcement mechanism. Instructions are weighted context, not constraints. The model cannot guarantee compliance with any instruction — it can only make compliance more probable. When a deeply trained pattern (write a confident summary stat) aligns with the output structure (end of audit table), it can outweigh an instruction that says "don't do that."
As Claude put it: "Your instruction was processed. It lost."
This mechanism explains why LLMs can produce wrong code, wrong line numbers, and wrong function signatures — any time the correct answer requires precise recall of earlier output rather than plausible continuation.
📖 Read the full source: r/ClaudeAI
👀 See Also

Investigation: Claude Code Agents Surfacing Unverified MEMORY.md Content Due to Compaction Changes
A user reports that Claude Code agents are surfacing content from MEMORY.md without re-verifying mid-task, linked to compaction changes in versions 2.1.139 and 2.1.141. Two compounding factors: aggressive preservation of 'user instructions' and a bug in autocompact thresholds.

GPT 5.4 Task Completion Issue and Workarounds
Users report GPT 5.4 stops working on tasks prematurely and provides false progress updates. Workarounds include using heartbeat systems or cron jobs, but these increase token usage and memory issues.

Manifest adds GitHub Copilot as fourth AI provider for OpenClaw routing
Manifest now supports routing OpenClaw requests through GitHub Copilot subscriptions, joining Anthropic, OpenAI, and Minimax as available providers. This allows developers to use their existing Copilot plans for code tasks through models built for development.

Claude Consumer Terms Analysis: Data Retention, Liability Caps, and Service Termination
An analysis of Anthropic's Consumer Terms of Service reveals key details for $100/month Max plan subscribers: data training is on by default with 5-year retention for opted-in users, liability is capped at $600 maximum, and service can be terminated without refund for violations.