ClawProxy: Self-Hosted AI Routing Proxy for Rotating Free-Tier API Keys

ClawProxy is a self-hosted AI routing proxy designed to help developers run professional-grade AI setups with a $0 monthly bill by multi-threading free-tier quotas. It sits between tools like OpenClaw, Cursor, and OpenCode and AI providers, managing key rotation and load balancing in the background.
Key Features
The proxy includes several specific features to handle free-tier limitations:
- In-Flight Key Rotation & Load Balancing: ClawProxy doesn't just wait for 429 errors. If a provider is overloaded, times out, or has bugs, the proxy replaces the key in-flight. It uses weighted load balancing where you assign "weights" to your keys to distribute load across available quotas, and weighted round-robin to ensure no single key gets hammered too hard.
- Robust Provider Fallback: When free providers are tapped out, ClawProxy shifts load automatically. It translates model names within the same format - if your preferred Gemini 3.1 Flash endpoint fails, it can fail over to another healthy provider hosting the same model or compatible alternatives like GLM 5 or Kimi K2.5. It also supports custom endpoints for Perplexity to centralize AI web search traffic.
- Dashboard with Deep-Parsed Logs: The dashboard features a modern bubble UI that displays System Instructions, User, and Assistant messages neatly formatted. It includes tool analysis to inspect Tool Calls and their arguments, plus real-time metrics via WebSockets with precise token counts and latency metrics.
- Premium Self-Hosted Experience: Installation is a one-liner with native services for Linux (systemd), macOS (launchd), and Windows that start on boot. The tool is privacy-first - 100% local, with API keys never leaving your machine except to reach official providers.
Supported Models and Providers
ClawProxy works with high-end models including Gemini 3 flash, GLM 5, and MiniMax M2.5. It manages free-tier keys from providers like Google AI Studio, Nvidia NIM, and Groq. The creator offers early users access to two secondary providers that provide access to cutting-edge models like GLM 5, MiniMax M2.5, Kimi K2.5, and GLM 4.7 - these come pre-configured with no setup needed.
This tool is designed for developers who want to use multiple AI models without hitting rate limits or incurring API costs, particularly those working with AI coding agents.
📖 Read the full source: r/openclaw
👀 See Also

VectorClaw v1.0.0: MCP Server for Anki Vector Robot Control
VectorClaw v1.0.0 is an MCP server that enables OpenClaw to control Anki Vector robots through 23 specific tools for speech, motion, perception, sensors, and display functions.

Claude Skill Enables Granular Personality Adjustments with Quantified Variables
A new Claude skill allows developers to make quantified adjustments across 32 groups of personality traits covering 120 Claude-defined variables, with group-level profiles showing metrics like Wordiness (60), Agreeableness (55), and Sarcasm & Edge (17). The skill persists across conversations and includes a publish command for custom instructions.

MCP Lets Claude Analyze Google Search Console Data Automatically
A new free MCP connects Claude directly to Google Search Console, enabling natural language queries on search performance data like queries, pages, clicks, and CTR without manual CSV exports.

Scaffold Framework Addresses Claude Code Memory and Workflow Issues
Scaffold is a 17-skill framework for Claude Code that provides persistent memory, decision enforcement, and workflow gates. It uses a 3-tier model routing system for token savings and can be installed via the Claude Code plugins menu.