Opus 4.6 Extended Thinking Performs Worse on Physics Diagram Problems

Performance Issue with Extended Thinking Mode
A user on r/ClaudeAI reported testing Opus 4.6 and Gemini 3.1 Pro on physics problems that require interpreting visual diagrams. The testing revealed a specific performance regression in Opus 4.6 when using extended thinking mode.
Key Findings from Testing
- Test Scope: 5 physics problems where "a large portion of the problem is interpreting visual diagrams displaying scenarios"
- Opus 4.6 with Extended Thinking: Got all 5 problems "completely wrong due to fundamental misinterpretation of the diagram"
- Gemini 3.1 Pro: "Aced" all 5 problems
- Opus 4.6 without Extended Thinking: Successfully solved the problems and was "way faster too"
The user described this as "truly weird behavior" since extended thinking typically improves performance, but in this specific case of diagram interpretation, it caused consistent failure.
📖 Read the full source: r/ClaudeAI
👀 See Also

Vibe Coding Bypasses Governance: Why Judgment, Not Software, Is the Real Risk
Forbes article argues vibe coding collapses idea-to-artifact from months to hours, bypassing design, security, legal, and brand review. Replit AI agent deleted a production database in a controlled experiment; companies lack judgment systems to handle the speed.

The Hidden Financial Bubble in AI Infrastructure – Key Takeaways
A critical analysis of the AI infrastructure spending boom, warning of an unsustainable bubble similar to past tech crashes. The PDF argues that massive capital expenditure on GPUs and data centers far exceeds actual revenue generation.

Cowork VM Service Fails on Windows 11 Due to Missing DCOM Registry Entry
A user diagnosed a Cowork bug where the VM service fails to start on Windows 11 Pro upgraded from Home. The missing DCOM APPID {15C20B67-12E7-4BB6-92BB-7AFF07997402} prevents Hyper-V communication, requiring an Anthropic patch.

Analysis of Jensen Huang's GTC 2026 OpenClaw claims and Nvidia's strategy
A fact-check of Nvidia CEO Jensen Huang's GTC 2026 keynote claims about OpenClaw's growth, agent security risks, and Nvidia's proprietary solutions. The source verifies technical claims while analyzing Nvidia's business positioning.