ClawProxy: Self-Hosted AI API Key Rotating Proxy

ClawProxy is a self-hosted AI routing proxy designed to help developers run professional-grade AI setups with a $0 monthly bill by multi-threading free-tier quotas. It sits between tools like OpenClaw, Cursor, and OpenCode and AI providers, managing key rotation and load balancing in the background.

Key Features

The proxy includes several specific features to handle free-tier limitations:

In-Flight Key Rotation & Load Balancing: ClawProxy doesn't just wait for 429 errors. If a provider is overloaded, times out, or has bugs, the proxy replaces the key in-flight. It uses weighted load balancing where you assign "weights" to your keys to distribute load across available quotas, and weighted round-robin to ensure no single key gets hammered too hard.
Robust Provider Fallback: When free providers are tapped out, ClawProxy shifts load automatically. It translates model names within the same format - if your preferred Gemini 3.1 Flash endpoint fails, it can fail over to another healthy provider hosting the same model or compatible alternatives like GLM 5 or Kimi K2.5. It also supports custom endpoints for Perplexity to centralize AI web search traffic.
Dashboard with Deep-Parsed Logs: The dashboard features a modern bubble UI that displays System Instructions, User, and Assistant messages neatly formatted. It includes tool analysis to inspect Tool Calls and their arguments, plus real-time metrics via WebSockets with precise token counts and latency metrics.
Premium Self-Hosted Experience: Installation is a one-liner with native services for Linux (systemd), macOS (launchd), and Windows that start on boot. The tool is privacy-first - 100% local, with API keys never leaving your machine except to reach official providers.

Supported Models and Providers

ClawProxy works with high-end models including Gemini 3 flash, GLM 5, and MiniMax M2.5. It manages free-tier keys from providers like Google AI Studio, Nvidia NIM, and Groq. The creator offers early users access to two secondary providers that provide access to cutting-edge models like GLM 5, MiniMax M2.5, Kimi K2.5, and GLM 4.7 - these come pre-configured with no setup needed.

This tool is designed for developers who want to use multiple AI models without hitting rate limits or incurring API costs, particularly those working with AI coding agents.

📖 Read the full source: r/openclaw

ClawProxy: Self-Hosted AI Routing Proxy for Rotating Free-Tier API Keys

Key Features

Supported Models and Providers

👀 See Also

Real-Time Desktop Overlay for Monitoring Claude Code Usage Limits

ClawPy: Minimal Single-File Python Implementation of OpenClaw with Experience Memory

OpenClaw plugin adds persistent memory with Engram server

Developer tracks frustration with 'F-Bombs Per Thousand Prompts' metric across 44,212 Claude Code logs