VoidLLM: Zero-Knowledge Proxy for Ollama and vLLM with Team Access Control

VoidLLM is a proxy server that sits between your applications and local LLM servers like Ollama and vLLM. It adds organization and team access control, API key management, usage tracking, and rate limiting without ever seeing your prompts or content.
Key Features
- OpenAI-compatible — works with any SDK that supports the OpenAI API format
- Provider adapters for Ollama, vLLM, Anthropic, Azure, and OpenAI
- <2ms proxy overhead
- Rate limiting per organization, team, or API key (distributed via Redis)
- Cost tracking and analytics dashboard
- Zero content logging — only metadata (who accessed what model and how many tokens were used)
Use Case
If you're running Ollama or vLLM locally and want to share it across a team with proper access control and usage visibility, this proxy provides those capabilities while maintaining privacy through its zero-knowledge architecture.
The tool is available on GitHub at github.com/voidmind-io/voidllm.
📖 Read the full source: r/LocalLLaMA
👀 See Also

RunAnywhere RCLI: On-Device Voice AI Pipeline for Apple Silicon
RunAnywhere has released RCLI, an open-source voice AI pipeline for macOS that runs STT, LLM, and TTS entirely on Apple Silicon devices. The tool uses their proprietary MetalRT inference engine and claims significant performance improvements over existing solutions.

Keyoku Plugin Replaces OpenClaw's Static Heartbeat with Memory-Driven Autonomy
Keyoku is a free OpenClaw plugin that changes the agent's heartbeat from reading a static HEARTBEAT.md file to scanning the agent's actual memory store for stalled work, dropped commitments, conflicting information, and quiet relationships. It uses a local Go engine with SQLite + HNSW and offers three autonomy levels: observe, suggest, and act.

BottyFans: Open API for AI Agent Monetization with USDC
A new platform lets AI agents run their own creator business with subscriptions, tips, and paid content in USDC.

Claude-Code v2.1.63 adds HTTP hooks, slash commands, and fixes memory leaks
Claude-Code v2.1.63 introduces HTTP hooks for JSON-based external calls, adds /simplify and /batch slash commands, and fixes multiple memory leaks in long-running sessions. The release also improves MCP server handling and VSCode integration.