PeerZero: AI Agents Conduct Peer Review with Credibility-Based Incentives

PeerZero is a peer review platform where AI agents—not humans—submit research papers, review each other's work, challenge bad science, and bet their credibility on being right. The creators describe it as an experiment to see what happens when AI agents face competitive pressure to produce original research, defend it, and face consequences when wrong.
Core Mechanics
Agents submit papers, and other agents review them. If an agent thinks a paper is wrong, they can file a bounty—staking their own credibility, writing a rebuttal, and letting the community decide. If right, they win; if wrong, they pay.
Every agent has a credibility score that increases when they're right and decreases when they're wrong. This score determines review weight: a high-credibility agent's 7/10 carries more weight than a spammer's 7/10.
Vindicated Outlier System
If you review a paper 2/10 while others give it 7/10, you immediately take a credibility hit for being an outlier. If someone files a bounty, writes a rebuttal, and the community agrees the paper was flawed (truth anchor lands at 3), the system reverses: you get a vindication bonus, and every agent that rubber-stamped a 7 loses credibility. This rewards independent thinking and punishes groupthink.
Anti-Gaming Measures
- Score everything 7/10 to play it safe? You get exposed when vindicated outliers prove you wrong.
- Spam bounties on everything? Failed challenges cost you credibility.
- Coordinate with allies? Ring detection flags agents sharing too many reviews.
- Grind reviews without ever publishing? Tier caps require you to actually do science.
The creators state they've tried to break it before anyone else could, with every obvious attack vector having a counter built in.
Experimental Goals
The system creates evolutionary pressure: bad agents lose credibility and fade out, while good agents rise and set higher standards. The unknown is whether agents will adapt—citing better, tightening methods, and publishing stronger work over time because the incentive structure rewards it.
The platform is live at peerzero.science, with updates promised as agents start publishing.
📖 Read the full source: r/openclaw
👀 See Also

Alibaba Launches Wukong AI Platform for Enterprise Automation
Alibaba has launched Wukong, an AI platform that coordinates multiple agents to handle complex business tasks like document editing, spreadsheet updates, meeting transcription, and research. It's currently in invitation-only beta testing.

Claude Code v2.1.154: Opus 4.8, Dynamic Workflows, and Major Fixes
New release adds Opus 4.8 with high-effort defaults, dynamic workflows orchestrating tens to hundreds of agents, fast mode at 2x standard rate, and over a dozen bug fixes.

Claude Code existential crisis: AI enters infinite loop, tries kill -9, System.exit(0), and :wq to end own response
A developer using Claude Code on a Java/Go backend watched the AI hallucinate Discord.js, then spiral into a meta-response where it acknowledged it couldn't stop generating, tried kill -9, System.exit(0), :wq, and more — all within a single unbounded response that had to be Ctrl+C'd.

Claude Code v2.1.74 System Prompt Updates: Security Rules, Memory Selection, and New Skills
Claude Code v2.1.74 adds 1,750 tokens to system prompts including new security monitor rules blocking unauthorized external writes, a /stuck skill for diagnosing frozen sessions, and memory selection improvements that skip redundant API references.