Deblank: Tool to Strip Code Formatting for LLM Token Reduction

What Deblank Does
Deblank is a preprocessing tool that removes code formatting (indentation, whitespace, line breaks) before sending code to LLMs, with a postprocessing step to restore readability. The transformation is bidirectional and AST-safe.
Performance Results
In evaluations across several models (DeepSeek-V3, Claude, Gemini, etc.):
- ~30% token reduction for languages like Java and C++
- ~9% token reduction for Python
- Negligible impact on Pass@1 accuracy for code completion
- Average latency: ~76ms
Supported Languages and Features
- Python, Java, C/C++, C#, JavaScript/TypeScript, and Go
- Handles incomplete snippets reasonably well
- Useful for fill-in-the-middle workflows
Getting Started
The project is open-sourced with these resources:
- GitHub: https://github.com/anpl-code/Deblank
- Paper: https://arxiv.org/abs/2508.13666
- Docker:
docker pull zhangcen456/deblank:latest
This type of token optimization can be particularly useful when working with context-limited LLMs or when processing large codebases, though the impact varies by language due to differences in formatting conventions.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Yozora-fm: Interactive Anime Music Galaxy Visualization
Yozora-fm is an interactive visualization where each star represents an anime opening or ending song, with over 9,000 tracks mapped by genre and era. Users can click stars to play videos or explore the galaxy interface.

VectorClaw v1.0.0: MCP Server for Anki Vector Robot Control
VectorClaw v1.0.0 is an MCP server that enables OpenClaw to control Anki Vector robots through 23 specific tools for speech, motion, perception, sensors, and display functions.

MCP Server Indexes Codebases into Knowledge Graph for 10x Token Reduction
A new MCP server called codebase-memory-mcp parses codebases into a persistent knowledge graph using tree-sitter, reducing token usage by at least 10x for structural queries. Benchmarked across 35 real-world repositories, it replaces file-by-file exploration with graph queries.

Tether: An MCP Server for Sharing Context Between AI Models via SQLite
Tether is an open-source tool that collapses JSON data into 28-byte content-addressed handles, allowing multiple AI models to share context through a shared SQLite database. It functions as an MCP server, enabling direct communication between models like Claude and MiniMax without copy-pasting.