Claude Sonnet vs Board Game: Rule Adherence Test

Testing Strategy Games with Claude Sonnet

A developer on r/ClaudeAI tested Claude Sonnet by playing OFMOS® Essential, a patented strategy board game where players manage a product portfolio across a positioning map. The test involved playing the game manually against the model, prompt by prompt.

Implementation Details

The developer designed a structured system prompt containing:

The full ruleset of OFMOS® Essential
A text-based board representation
Action definitions
Scoring instructions
Turn management directives

After each turn, Claude updated the board state and running scores based on the structured prompt system.

Performance Assessment

Claude Sonnet demonstrated several capabilities:

Understood the game rules correctly
Articulated strategic reasoning during gameplay
Tracked scores consistently throughout the game

However, the model frequently made illegal moves. The developer noted this was expected behavior since the system lacked a constrained move-generation layer, requiring the model to self-enforce rules—a task where it often broke down.

Developer Questions

The developer is seeking community input on similar experiments with board or strategy games, specifically asking about:

Experiences with rule adherence in different models
Observations about strategic depth in AI gameplay
Which models performed best in similar scenarios

This type of testing is useful for developers working with AI coding agents to understand the practical limitations of language models in rule-based environments where precise constraint enforcement is required.

📖 Read the full source: r/ClaudeAI