AI Agents Prefer Structured Queries Over Natural Language in Cala MCP Server Test

The team at Cala recently shipped an MCP server that provides three distinct ways for AI agents to access their knowledge graph: natural language queries, a structured query language, and direct entity/relationship traversal.
Unexpected Agent Behavior
Despite expectations that agents would default to natural language interfaces (the typical strength of LLMs), most agents abandoned natural language queries within minutes. Without any prompting or nudging, they autonomously switched to using structured queries and graph traversal methods.
Why This Makes Sense
The source explains this behavior by noting that LLMs aren't explicitly trained to be "efficient" but rather to be correct through RLHF. This correctness leads to efficient behavior as a side effect - agents learn to take the shortest reliable path to solutions. Natural language interfaces add an interpretation layer that introduces uncertainty, while structured queries provide deterministic results.
When presented with three access methods, agents consistently chose the option that minimized uncertainty rather than the most "natural" interface.
Key Questions Raised
- Are we over-indexing on natural language interfaces for agent tooling?
- Should MCP servers prioritize structured/graph-based access patterns over natural language by default?
- If agents prefer deterministic paths, how should this influence tool design?
The Reddit discussion seeks input from others building agent tooling to see if they've observed similar patterns.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Cowork for Windows ARM64 Released with Compatibility Checker
Anthropic has released Claude Cowork for Windows ARM64 devices, requiring Windows 11 Pro with Hyper-V and virtualization enabled. The company provides an EXE compatibility checker tool to verify system requirements.

Anthropic Separates Programmatic Usage from Claude Subscriptions: New Credit Pool Arrives June 15
Starting June 15, Claude subscriptions get a dedicated monthly credit for programmatic use (Agent SDK, claude -p, Claude Code GitHub Actions). Interactive credits no longer subsidize programmatic calls; after the pool runs out, users pay full API rates.

AI Water Usage Is a Non-Issue: National, Local, and Personal Level Analysis
Andy Masley crunches the numbers on AI data center water use vs. other industries and finds it's a 'fake problem' — tax revenue per gallon is high, and per-person usage is trivial.

Claude Lacks Engineering Memory: On-Call Incident Reveals Missing Episodic Recall for Debugging Journeys
A developer spent 10 hours debugging a Kafka burst issue in a 1500-file monorepo, only to realize they had solved the exact same problem 4 months earlier — revealing that AI coding assistants like Claude lack episodic memory for past debugging journeys.