Claw Compactor: 14-stage token compression engine for LLM pipelines

What is Claw Compactor?
Claw Compactor is an open-source LLM token compression engine built around a 14-stage Fusion Pipeline. Each stage is a specialized compressor — from AST-aware code analysis to JSON statistical sampling to simhash-based deduplication — chained through an immutable data flow architecture where each stage's output feeds the next.
Architecture Details
The Fusion Pipeline includes these stages:
- QuantumLock → Cortex → Photon → RLE → SemanticDedup → Ionizer
- LogCrunch → SearchCrunch → DiffCrunch → StructuralCollapse
- Neurosyntax → Nexus → TokenOpt → Abbrev
Key design principles:
- Immutable data flow — FusionContext is a frozen dataclass. Every stage produces a new FusionResult; nothing is mutated in-place.
- Gate-before-compress — Each stage has should_apply() that inspects context type, language, and role before doing any work. Stages that don't apply are skipped at zero cost.
- Content-aware routing — Cortex auto-detects content type (code, JSON, logs, diffs, search results) and language (Python, Go, Rust, TypeScript, etc.), then downstream stages make type-aware compression decisions.
- Reversible compression — Ionizer stores originals in a hash-addressed RewindStore. The LLM can call a tool to retrieve any compressed section by its marker ID.
Benchmark Results
Real-World Compression (FusionEngine v7 vs Legacy Regex):
- Python source: 25.0% compression (3.4x improvement over legacy)
- JSON (100 items): 81.9% compression (6.5x improvement)
- Build logs: 24.1% compression (4.4x improvement)
- Agent conversation: 31.0% compression (5.4x improvement)
- Git diff: 15.0% compression (2.4x improvement)
- Search results: 40.7% compression (7.7x improvement)
- Weighted average: 53.9% compression (5.9x improvement)
SWE-bench Real Tasks:
- django__django-11620 (4.5K): 14.5% compression
- sympy__sympy-14396 (5.5K): 19.1% compression
- scikit-learn-25747 (11.8K): 15.9% compression
- scikit-learn-13554 (73K): 11.8% compression
- scikit-learn-25308 (81K): 14.4% compression
vs LLMLingua-2 (ROUGE-L Fidelity):
- Compression rate 0.3 (aggressive): Claw Compactor 0.653 vs LLMLingua-2 0.346 (+88.2%)
- Compression rate 0.5 (balanced): Claw Compactor 0.723 vs LLMLingua-2 0.570 (+26.8%)
Quick Start
git clone https://github.com/open-compress/claw-compactor.git
cd claw-compactor
# Benchmark your workspace (non-destructive)
python3 scripts/mem_compress.py /path/to/workspace benchmark
# Full compression pipeline
python3 scripts/mem_compress.py /path/to/workspace fullRequirements: Python 3.9+. Optional: pip install tiktoken for exact token counts.
API Usage
from scripts.lib.fusion.engine import FusionEngine
engine = FusionEngine()
result = engine.compress(
text="def hello(): \n # greeting function \n print('hello')",
content_type="code", # or let Cortex auto-detect
language="python", # optional hint
)
print(result["compressed"]) # compressed output
print(result["stats"]) # per-stage stats
📖 Read the full source: HN LLM Tools
👀 See Also

ClawClone: Cloud Backup Tool for OpenClaw Workspaces
ClawClone is a tool that backs up OpenClaw workspaces to the cloud with one prompt and restores them with another. It was built after a developer lost a month's worth of training data.

Headless OpenClaw Setup with Discord via Docker Scripts
A GitHub repository provides scripts to run OpenClaw with Discord in a headless Docker container, avoiding the TUI/WebUI. It includes a management script with commands like claw init, start, and stop, plus preconfigured support for OpenAI Responses API, Chromium, and various tools.

50 Popular Apps Reverse-Engineered into Claude-Readable Design Specs: Key Patterns for UI Cloning
u/meliwat reverse-engineered 50 popular apps into structured markdown design specs. Claude nails UI clones with exact values, state coverage, spacing scales, and navigation graphs. Longer prose degrades output.

Agent Browser Protocol: Open-source Chrome fork for AI agents achieves 90% on Mind2Web benchmark
Agent Browser Protocol (ABP) is an open-source Chrome fork that freezes JavaScript and time after each action to convert web browsing into multimodal chat for AI agents. It achieved 90.53% on the Online Mind2Web Benchmark and can be added to Claude Code with a single command.