Fullerenes: Cut Tokens 64% for Coding Agents

Fullerenes is an open-source persistent memory layer for AI coding agents. Instead of re-reading files every session, it builds a local knowledge graph from your repo using Tree-sitter and exposes it over MCP (Model Context Protocol). Agents query the graph for functions, classes, imports, and call relationships rather than reading raw files — cutting token consumption drastically.

How it works

Run npx fullerenes init in your repo. It walks the codebase with Tree-sitter, extracts every function, class, import, and call relationship, and stores it in a local SQLite graph. Agents connect via MCP and ask targeted questions.

The design draws on retrieval research: Repoformer (retrieve only when needed), HippoRAG and G-Retriever (graph beats flat chunks), and LLMLingua (aggressive context compression). The goal is better signal per token, not more context.

Unique MCP tools

Two standout tools:

predict_impact({ functionName: "x" }) — Before editing, the agent asks what else will break. Traverses the edge graph and returns direct + transitive dependents with a risk score. Blast radius before the first keystroke.
get_function({ name: "x", includeBody: true }) — Signature, body, and callers in one MCP call. No follow-up read_file needed.

Benchmark results

SWE-bench Verified (1 instance so far): Codex baseline 91,949 tokens → with Fullerenes 32,945 tokens. 64% reduction.
Internal (5 questions on this repo): Raw files 2,452 tokens avg → Fullerenes 137 tokens avg. 94.4% reduction.
External (Gemini CLI on a Python project): Raw files 27,292 tokens → Fullerenes AGENTS.md 919 tokens. 96.6% reduction.

Limitations

Tree-sitter is structural, not semantic. Dynamic dispatch and metaprogramming will miss edges. LSP integration is on the roadmap. One SWE-bench instance is not a broad result — more are being run.

Local & open source

Everything runs locally: SQLite, no server, no API key, pure npm (no Python), works offline, MIT license. 589 npm downloads in 40 hours before the Reddit post. 14 stars. Just launched.

github.com/codebreaker77/Fullerenes
npmjs.com/package/fullerenes

Three questions the author is asking the community: Does graph-based retrieval change your agent workflows, or is long context winning? What MCP tools beyond the current 8? Does the SWE-bench methodology look sound?

📖 Read the full source: r/ClaudeAI