ClawProxy: Self-Hosted AI Routing Proxy for Rotating Free-Tier API Keys

ClawProxy is a self-hosted AI routing proxy designed to help developers run professional-grade AI setups with a $0 monthly bill by multi-threading free-tier quotas. It sits between tools like OpenClaw, Cursor, and OpenCode and AI providers, managing key rotation and load balancing in the background.
Key Features
The proxy includes several specific features to handle free-tier limitations:
- In-Flight Key Rotation & Load Balancing: ClawProxy doesn't just wait for 429 errors. If a provider is overloaded, times out, or has bugs, the proxy replaces the key in-flight. It uses weighted load balancing where you assign "weights" to your keys to distribute load across available quotas, and weighted round-robin to ensure no single key gets hammered too hard.
- Robust Provider Fallback: When free providers are tapped out, ClawProxy shifts load automatically. It translates model names within the same format - if your preferred Gemini 3.1 Flash endpoint fails, it can fail over to another healthy provider hosting the same model or compatible alternatives like GLM 5 or Kimi K2.5. It also supports custom endpoints for Perplexity to centralize AI web search traffic.
- Dashboard with Deep-Parsed Logs: The dashboard features a modern bubble UI that displays System Instructions, User, and Assistant messages neatly formatted. It includes tool analysis to inspect Tool Calls and their arguments, plus real-time metrics via WebSockets with precise token counts and latency metrics.
- Premium Self-Hosted Experience: Installation is a one-liner with native services for Linux (systemd), macOS (launchd), and Windows that start on boot. The tool is privacy-first - 100% local, with API keys never leaving your machine except to reach official providers.
Supported Models and Providers
ClawProxy works with high-end models including Gemini 3 flash, GLM 5, and MiniMax M2.5. It manages free-tier keys from providers like Google AI Studio, Nvidia NIM, and Groq. The creator offers early users access to two secondary providers that provide access to cutting-edge models like GLM 5, MiniMax M2.5, Kimi K2.5, and GLM 4.7 - these come pre-configured with no setup needed.
This tool is designed for developers who want to use multiple AI models without hitting rate limits or incurring API costs, particularly those working with AI coding agents.
📖 Read the full source: r/openclaw
👀 See Also

Claude Code v2.1.126: Model Picker, Project Purge, OAuth Fixes, and Security Improvements
Claude Code v2.1.126 adds a /model picker for Anthropic-compatible gateways, a new claude project purge command, fixes OAuth login in WSL2/SSH/containers, and patches security issues with managed settings and clipboard exposure on Windows.

JetBrains Introduces Plugin for Modern Go Code with AI Agents Junie and Claude Code
JetBrains has released a plugin for AI agents Junie and Claude Code, enhancing their ability to generate modern Go code by adhering to the latest Go features and best practices.

EctoLedger: Open-source microVM sandbox for local AI agents with terminal access
EctoLedger is an open-source runtime firewall and ledger that provides microVM isolation for local AI agents with terminal access, running four prevention layers before executing commands in Apple Hypervisor.framework (macOS) or Firecracker microVM (Linux) environments.

RunAnywhere RCLI: On-Device Voice AI Pipeline for Apple Silicon
RunAnywhere has released RCLI, an open-source voice AI pipeline for macOS that runs STT, LLM, and TTS entirely on Apple Silicon devices. The tool uses their proprietary MetalRT inference engine and claims significant performance improvements over existing solutions.