/compress-architecture: An Agent Skill to Prune Over-Engineering

A Reddit user (u/CamWebby) built /compress-architecture, an agent skill for AI coding agents that focuses on removing unnecessary architecture rather than adding it. The core question it asks: “Can the system do the same job with fewer visible parts?” The goal is fewer concepts, clearer ownership, and less incidental coupling — not fewer files or lines.
What It Audits
- Speculative layers (abstractions for features that never materialized)
- Pass-through modules (code that just delegates without adding value)
- Duplicate concepts (two ways to do the same thing)
- Abstractions that no longer earn their keep (overhead > benefit)
What It Protects
- Real domain boundaries (actual business logic seams)
- Public APIs (contracts consumed externally)
- Testability seams (dependency injection, interfaces that enable testing)
- Explicit error handling (not swallowed or concealed)
- Code that changes at different rates (separate evolution paths)
The skill is designed to be run as part of an AI coding agent's workflow, presumably via a command like /compress-architecture against a codebase directory. It's not a standalone tool — it's an agent skill that modifies the agent's behavior to audit and suggest removals.
Who It's For
Developers using AI coding agents (e.g., Claude, Copilot, etc.) who want to automatically detect and remove over-engineering from their codebases without breaking legitimate architectural boundaries.
📖 Read the full source: r/ClaudeAI
👀 See Also

OnUI: Browser Extension for Precise UI Feedback to Claude Code
OnUI is a browser extension that lets you annotate webpage elements and export structured reports for Claude Code via local MCP, eliminating ambiguous UI descriptions. Built primarily with Claude Code, it's free, open-source, and available for Chrome, Edge, and Firefox.

Google Research introduces TurboQuant for AI model compression
Google Research has introduced TurboQuant, a compression algorithm that reduces AI model size with zero accuracy loss. It addresses memory overhead in vector quantization and improves key-value cache performance.

ClawMetry adds remote monitoring with E2E encryption for OpenClaw agents
ClawMetry v0.1.0 now includes cloud sync for remote monitoring of OpenClaw agents from any browser or Mac menu bar app, with end-to-end encryption that keeps data encrypted until it reaches your client.

VoidLLM: Zero-Knowledge Proxy for Ollama and vLLM with Team Access Control
VoidLLM is a proxy that sits between applications and local LLM servers like Ollama and vLLM, adding organization/team access control, API key management, usage tracking, and rate limiting without viewing prompts. It has <2ms proxy overhead and works with OpenAI-compatible SDKs.