Anthropic reports industrial-scale distillation attacks by Chinese AI labs on Claude

✍️ OpenClawRadar📅 Published: February 24, 2026🔗 Source

Industrial-scale model extraction operation

Anthropic has published findings detailing coordinated distillation attacks against Claude by three Chinese AI laboratories. The attacks involved creating fraudulent accounts at scale to extract Claude's reasoning capabilities through massive API interactions.

Key attack details from Anthropic's report

DeepSeek, Moonshot, and MiniMax created over 24,000 fraudulent accounts
Total exchanges with Claude exceeded 16 million
MiniMax alone fired 13 million requests
When Anthropic released a new model, MiniMax redirected nearly half its traffic within 24 hours
DeepSeek specifically targeted thought chains and censorship-safe answers
Attacks grew in sophistication over time as the labs adapted their methods

Security implications for AI developers

This incident highlights vulnerabilities in AI model security when billion-dollar labs systematically attempt to extract proprietary capabilities. The scale and persistence of these attacks—spanning multiple organizations and adapting to new model releases—suggests this represents an ongoing threat vector rather than isolated incidents.

The methods used (fraudulent account creation, targeted querying for specific capabilities, rapid adaptation to new model versions) could potentially be replicated against other AI systems, raising questions about the security of third-party AI tools that developers integrate into their workflows.

📖 Read the full source: r/ClaudeAI

👀 See Also

Security

Claude Code Identifies Malware Backdoor in GitHub Repo During Technical Audit

A developer used Claude Code to audit a GitHub repository before execution and discovered a remote code execution backdoor in src/server/routes/auth.js that would have compromised their machine. The prompt requested a technical due diligence audit checking project completeness, AI/ML layer, database, authentication, backend services, frontend, code quality, and effort estimate.

Mar 10, 2026, 03:45 PM UTC

OpenClawRadar

Security

OpenClaw Security Gap Addressed by Agentic Power of Attorney (APOA) Spec

A developer has published an open specification called Agentic Power of Attorney (APOA) to address security concerns in OpenClaw, where agents currently access services like email and calendar with only natural language instructions as guardrails. The spec proposes per-service permissions, time-bounded access, audit trails, revocation, and credential isolation.

Feb 27, 2026, 12:45 AM UTC

OpenClawRadar

Security

Claude Code source map leak reveals minified JavaScript was already public on npm

A source map file accidentally included in version 2.1.88 of the @anthropic-ai/claude-code npm package revealed internal developer comments, but the actual 13MB cli.js file containing 148,000+ plaintext strings has been publicly accessible on npm since launch.

Apr 4, 2026, 11:45 AM UTC

OpenClawRadar

Security

OpenClaw security patches fix QR code credential exposure and plugin auto-load vulnerabilities

OpenClaw released two security patches addressing critical vulnerabilities: QR codes embedded permanent gateway credentials without expiry, and plugins auto-loaded from cloned repos without user confirmation. Version 2026.3.12 fixes both issues.

Mar 13, 2026, 08:45 PM UTC

OpenClawRadar