CrabMeat v0.1.0: A Security-First Agent Gateway That Doesn't Trust the LLM with the Security Boundary

CrabMeat v0.1.0 dropped yesterday under Apache 2.0, built on one design thesis: the LLM never holds the security boundary. The project is a direct response to failures like Summer Yue's agent deleting 200+ emails — where a safety instruction was a prompt that got compacted away.
Key protections (all always-on, no config toggle)
- Capability ID indirection — The model sees per-session HMAC-derived opaque IDs like
cap_a4f9e2b71c83, never real tool names. It cannot guess or forge a tool name because it doesn't know any. - Effect classes — Every tool declares a class (
read,write,exec,network). Every agent declares which classes it can use. The check is a pure function with no runtime state, easy to test exhaustively, hard to bypass. - IRONCLAD_CONTEXT — Critical safety instructions are pinned to the top of the context window and explicitly marked as non-compactable. The compaction failure mode that stripped Yue's instruction cannot happen.
- Tamper-evident audit chain — Every tool call, privileged operation, and scheduler run enters the same SHA-256 hash-chained log. Tampering is provable.
- Streaming output leak filter — Secrets (API keys, JWTs, PEM blocks, capability IDs) are redacted mid-stream across token boundaries before reaching the client.
- No YOLO mode — There is no global 'trust the LLM with everything' switch. Expanded reach comes through named scoped roots that are explicit, audit-logged, and bounded.
The README lists 15 always-on protections in a table; none can be turned off by config. The gateway is local-first by default, configured for Ollama, LM Studio, vLLM out of the box. Anthropic and OpenAI require explicit configuration — no silent cloud shipping.
Who it's for
Developers building agentic systems who need architectural guarantees, not prompt-based safety, and want a gateway they can trust with tool execution and sensitive data.
📖 Read the full source: r/ClaudeAI
👀 See Also

Manual-Driven Development: A Method to Prevent Claude Code's Confident Divergence
Manual-Driven Development (MDD) is a method that addresses confident divergence in Claude Code, where the AI produces wrong code that passes its own tests. In a production audit, MDD found 190 issues, wrote 876 new tests in under 8 hours, and eliminated rule violations.

OpenClaw skill reduces accessibility tree tokens from 600K to 1.3K for ad-heavy sites
A developer built an OpenClaw skill that uses ML-based element ranking to prune accessibility trees, cutting slickdeals.com from ~598K tokens to ~1.3K tokens by keeping only the top ~50 actionable elements.

Building a Self-Improving Knowledge System with Claude Code and Obsidian
A developer built a 25-tool system that gives Claude Code persistent memory through semantic search, knowledge graphs, and spaced repetition over an Obsidian vault. The system indexes content with bge-m3 embeddings, detects contradictions, auto-prunes stale notes, and generates Obsidian Canvas maps automatically.

OpenClaw PARA Skill Automatically Organizes Files Using Tiago Forte's Method
A developer created an OpenClaw skill that enforces the PARA method (Projects, Areas, Resources, Archives) for automatic file organization, moving files from a messy root directory into structured folders.