Caveman vs 'be brief' prompt: benchmarking compression prompts for Claude

A developer benchmarked caveman (the popular shorthand compression prompt) against the simple prompt 'be brief.' to see if the extra complexity actually pays off. The test ran 24 dev prompts across 6 categories, comparing 5 arms: baseline, 'be brief.', caveman lite, caveman full, and caveman ultra. Outputs were judged by a separate Claude instance using per-prompt rubrics.
Benchmark results
- Baseline: mean score 0.985, mean tokens 636
- 'be brief.': mean score 0.985, mean tokens 419
- Caveman lite: mean score 0.976, mean tokens 401
- Caveman full: mean score 0.975, mean tokens 404
- Caveman ultra: mean score 0.970, mean tokens 449
The two-word version matched caveman on both compression and quality. However, caveman's value lies elsewhere: consistent output structure, mode switching, and the safety escape on destructive operations. The safety escape actually introduced significant variance in output quality, which may be a concern for certain use cases.
Full breakdown with per-category data and variance findings on safety questions is available at the author's site. The benchmark harness is open source on GitHub.
📖 Read the full source: r/ClaudeAI
👀 See Also

Kimi $19/m Update: Enhancing OpenClaw with Structured Models
Kimi introduces its latest update priced at $19/month, focusing on enhancing model structuring within OpenClaw. This update promises streamlined operations and improved automation features.

Stripe's Minions: Enhancing Developer Productivity with One-Shot End-to-End Coding Agents
Stripe Minions are one-shot, end-to-end coding agents designed to boost developer productivity by automating complex tasks within the Stripe ecosystem.

Claude 4.6 Opus Can Reproduce Linux's list.h From Minimal Input
A user demonstrated that Claude 4.6 Opus can generate a near-identical copy of Linux's list.h header file when given the first 43 lines as input with temperature set to 0, raising questions about GPL licensing implications for AI models trained on open-source code.

OpenAI Codex OAuth returning 429 errors since March 16 despite full quota
OpenAI Codex OAuth has been consistently returning 429 "you exceeded your current quota" errors since March 16, even when dashboards show 100% quota remaining. Users report the issue persists despite re-authentication, token revocation, and complete reconfiguration.