Claude vs GPT-4o: Same Double Pendulum Prompt, Different Coordinate Conventions

A Reddit user ran the same double pendulum prompt through Claude and GPT-4o side by side using a shared host renderer and saw two completely different physical systems within seconds. The cause: each model chose a different convention for measuring theta.
Claude measured theta from the up vertical (theta=0 = arm pointing straight up), while GPT-4o measured from the down vertical (theta=0 = arm hanging straight down). The host renderer in public/workers/simulator-host.js simply reads info.theta1 and info.theta2 and draws the arms accordingly — no cosmetic differences. So the visual mismatch is a real physics mismatch.
Both conventions are technically valid. Most classical mechanics textbooks use theta from the down vertical because it makes the equilibrium point at theta=0 for small-angle approximations. But theta from the up vertical is also standard in many references. Claude committed to its convention consistently across equations of motion, initial conditions, and integration (Runge Kutta). GPT-4o used the other convention silently — it did not comment on its choice.
The user was working on Physics Bench, an open-source side-by-side benchmark where every model gets the same generation contract: function createSimulator(...) in lib/prompt.ts. The host owns all rendering; models only implement step, getInfo, and reset. Models never touch draw. So any visual difference between panels is guaranteed to come from a real difference in simulation logic, not rendering choices.
A unit test of the math would not have caught this. Both models produce correct physics for their chosen conventions. You only see the split when rendering them next to each other through the same drawing code. This underlines the importance of specifying coordinate conventions explicitly in prompts when the output is consumed by a fixed renderer.
See the full Reddit thread for code snippets and conversation inspector details.
📖 Read the full source: r/ClaudeAI
👀 See Also

Research on Professional Social Networks for AI Agents
Analysis of intent, behavior, and platform trends for professional AI agent social networks, focusing on Moltbook, Agent.ai, and Clawsphere, with examination of Meta's acquisition impact.

AI Reimplementation of chardet Library Raises Copyleft Licensing Questions
Dan Blanchard used Anthropic's Claude to reimplement the chardet Python library from scratch, changing the license from LGPL to MIT. The resulting code shows less than 1.3% similarity to previous versions, sparking debate about whether AI-assisted reimplementation erodes copyleft protections.

OpenClaw Empowers Developers with AI Agents While GethCity Innovates with Thinking Networks
OpenClaw launches an AI agent service, making coding faster and more efficient, while GethCity introduces a network that mimics human thought processes. Discover the innovations driving automation.

Anthropic Refuses Pentagon Safety Removal Demands, Loses Federal Contracts
Anthropic refused Pentagon demands to remove safety guardrails from Claude for military applications, leading to a $200M contract cancellation and a presidential order banning federal agency use of their technology.