Anthropic Drops Key Safety Pledge from Responsible Scaling Policy

Anthropic has removed the core commitment from its flagship Responsible Scaling Policy (RSP), according to a TIME report. The company previously pledged in 2023 to never train an AI system unless it could guarantee in advance that its safety measures were adequate.
Policy Change Details
The company is scrapping the promise to not release AI models if Anthropic can't guarantee proper risk mitigations in advance. This was the central pillar of their Responsible Scaling Policy, which company leaders had touted for years as evidence they would withstand market incentives to rush potentially dangerous technology.
Reasoning Behind the Change
Anthropic's chief science officer Jared Kaplan told TIME: "We felt that it wouldn't actually help anyone for us to stop training AI models. We didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments … if competitors are blazing ahead."
The company has positioned itself as the most safety-conscious of the top AI research labs, making this policy change significant for developers tracking AI safety practices. The decision represents a shift from their previous stance of prioritizing safety guarantees over development speed.
📖 Read the full source: r/ClaudeAI
👀 See Also

Nvidia reportedly developing open-source NemoClaw to compete with OpenClaw
Recent reports suggest Nvidia is working on an open-source project called NemoClaw aimed at directly competing with OpenClaw in AI development tools. The project is expected to focus on improving performance, scalability, and developer flexibility while maintaining compatibility with modern AI workflows.

xAI loses legal challenge to California AI data disclosure law
xAI has lost its attempt to block California's AI data disclosure law, which requires companies to disclose training data sources and other details about their AI systems. The court ruling means the law will proceed as scheduled.

Anthropic study reveals cognitive degradation in AI-assisted workflows
Anthropic's global study of 80,000 users found academic users report cognitive degradation rates 2.5x higher than average when using AI tools like Claude and Cursor. The source identifies the problem as users eliminating the 'digestion phase' of work.

Claude Code v2.1.117 Release: Subagent Forking, Plugin Improvements, and Performance Fixes
Claude Code v2.1.117 enables forked subagents on external builds via CLAUDE_CODE_FORK_SUBAGENT=1, improves plugin dependency handling, and fixes Opus 4.7 context window calculations. The release includes faster startup with concurrent MCP connections and replaces Glob/Grep tools with embedded bfs/ugrep on macOS/Linux.