DeepClaude swaps Claude Code's Anthropic backend for DeepSeek V4 Pro at 17x lower cost

Claude Code's autonomous agent loop (file editing, bash, git, subagent spawning) is the best in class. The problem: it costs $200/month with caps. DeepClaude is a thin shell script that swaps the backend model while keeping the entire tool loop unchanged. Default backend is DeepSeek V4 Pro ($0.44/M input, $0.87/M output) vs Anthropic's $3/$15 per million tokens — a 17x price difference.
How it works
Claude Code reads environment variables for API endpoint and auth. DeepClaude sets these per-session:
ANTHROPIC_BASE_URL— API endpointANTHROPIC_AUTH_TOKEN— API key for the backendANTHROPIC_DEFAULT_OPUS_MODEL— model name for Opus-tier tasksANTHROPIC_DEFAULT_SONNET_MODEL— model name for Sonnet-tier tasksANTHROPIC_DEFAULT_HAIKU_MODEL— model name for Haiku-tier (subagents)CLAUDE_CODE_SUBAGENT_MODEL— model for spawned subagents
After launch, original settings are restored on exit.
Quick start (2 minutes)
1. Get a DeepSeek API key — sign up at platform.deepseek.com, add $5 credit.
2. Set environment variable:
# Windows (PowerShell)
setx DEEPSEEK_API_KEY "sk-your-key-here"
macOS/Linux
echo 'export DEEPSEEK_API_KEY="sk-your-key-here"' >> ~/.bashrc
source ~/.bashrc
3. Install the script:
# Windows — copy to a directory in PATH
Copy-Item deepclaude.ps1 "$env:USERPROFILE\.local\bin\deepclaude.ps1"
macOS/Linux
chmod +x deepclaude.sh
sudo ln -s "$(pwd)/deepclaude.sh" /usr/local/bin/deepclaude
4. Use it:
deepclaude # Launch with DeepSeek V4 Pro (default)
deepclaude --status # Show available backends and keys
deepclaude --backend or # Use OpenRouter ($0.44/M input)
deepclaude --backend fw # Use Fireworks AI (fastest, US servers)
deepclaude --backend anthropic # Normal Claude Code (for Opus)
deepclaude --cost # Show pricing comparison
deepclaude --benchmark # Latency test across all providers
Supported backends
| Backend | Flag | Input/M | Output/M | Servers | Notes |
|---|---|---|---|---|---|
| DeepSeek (default) | --backend ds | $0.44 | $0.87 | China | Auto context caching (120x cheaper on repeat turns) |
| OpenRouter | --backend or | $0.44 | $0.87 | US | Cheapest latency from US/EU |
| Fireworks AI | --backend fw | $1.74 | $3.48 | US | Fastest inference |
| Anthropic | --backend anthropic | $3.00 | $15.00 | US | Original Claude Opus |
Cost comparison
- Light usage (10 days/month): Anthropic Max $200/mo → DeepSeek ~$20/mo → 90% savings
- Heavy usage (25 days/month): $200/mo → ~$50/mo → 75% savings
- With auto loops: $200/mo → ~$80/mo → 60% savings
DeepSeek's automatic context caching makes agent loops extremely cheap — after the first request, the system prompt and file context are cached at $0.004/M vs $0.44/M uncached.
What works and what doesn't
Works: File reading/writing/editing, bash/PowerShell execution, glob/grep search, multi-step autonomous tool loops, subagent spawning, git operations, project initialization (/init), thinking mode (enabled by default).
Doesn't work or degraded: Image/vision input (DeepSeek's Anthropic endpoint doesn't support images), parallel tool use (disabled — tools execute one at a time), MCP server tools (not supported through compatibility layer). Prompt caching savings are handled by DeepSeek's own system.
Who it's for
Developers running Claude Code heavily on agent loops who want near-identical functionality at a fraction of the cost — especially for iterative coding tasks where DeepSeek V4 Pro's 96.4% LiveCodeBench score is more than sufficient.
📖 Read the full source: HN AI Agents
👀 See Also

Merlin: Local-first LLM context dedup – measure up to 71% chunk overlap, free & open-core
Merlin is a local-first context dedup tool that measured 22-71% chunk overlap across 22M passages from real agent/RAG sessions. Ships as HTTP proxy (Ollama/vLLM/SGLang/llama.cpp), MCP server (Claude/Cursor/OpenClaw), or standalone CLI. MIT open-core with daily usage caps.

Developer Builds Scheme Compiler to WASM Using AI in 4 Days
A developer created Puppy Scheme, a Scheme compiler that targets WebAssembly, in about 4 days using AI assistance. The compiler supports 73% of R5RS and R7RS, uses WASM GC, and achieved compilation time improvements from 3½ minutes to 11 seconds overnight.

Visdiff: Visual Feedback Loop for Claude's Frontend Code Generation
Visdiff addresses the visual accuracy gap in Claude's frontend code generation by comparing rendered output pixel-by-pixel with Figma designs and feeding differences back into the loop until they match.

AutoClaw Local Runner Review: Easy Setup, Credit Costs, and Uninstall Issues
A user tested AutoClaw, a local runner for OpenClaw/AutoGLM from Zai_org, finding the setup smooth but encountering high credit consumption, task failures, and concerning persistence after uninstallation including registry entries and plaintext credentials.