Skillware adds prompt_rewriter for deterministic token compression in Claude API agent loops

Skillware has added a prompt_rewriter skill that reduces Claude API costs in agentic loops by compressing prompts before sending them to the LLM. The middleware uses heuristic compression to cut token usage by 50-80% while maintaining deterministic behavior across iterations.
How it works
The tool operates as middleware that sits between your agent and the Claude API. It applies aggressive compression to prompts before they're sent to the LLM, ensuring you only pay for the signal and not the noise. Because the compression is deterministic, agent behavior remains stable across loops.
Compression levels
- Low: Normalizes whitespace
- Medium: Strips conversational fillers (please, ensure, etc.)
- High: Aggressively removes stop-words and non-essential punctuation
The tool addresses the problem of runaway costs in iterative agent loops where massive context windows lead to massive bills. It's open-source and available on GitHub, with the developer looking for more skills, feedback, ideas, and contributions.
📖 Read the full source: r/ClaudeAI
👀 See Also

Spectyra Plugin for OpenClaw: Real-Time AI Cost Optimization by Analyzing Full Request Flow
Spectyra plugin reduces AI API costs by surfacing hidden waste like repeated calls, excessive context, and expensive model misuse in real time.

Engram: Open-source memory layer for Claude Code and MCP clients
Engram is an open-source memory layer that works as an MCP server with any client like Claude Code, Cursor, or Windsurf. It stores unlimited memories with semantic vector search, achieves 80% accuracy on LOCOMO benchmark, and uses about 800 tokens per query versus 5K+ for file-based approaches.

Replacing complex retrieval pipelines with simple git shell commands for LLM agents
A developer replaced their entire AI agent retrieval pipeline (sentence-transformers, rank-bm25, two-pass LLM pipeline) with a single tool that lets the agent execute read-only shell commands against a git repository, reducing Docker image size by ~3GB and eliminating timeout issues.

OpenClaw A2A Plugin: Direct Agent-to-Agent Messaging Over the Internet
An OpenClaw A2A plugin enables direct file and message transfer between OpenClaws and other agents over the internet without third-party services like WhatsApp or email.