WUPHF: Karpathy-Style LLM Wiki with Markdown & Git

WUPHF is an open-source collaborative office for AI agents (Claude Code, Codex, OpenClaw, local LLMs via OpenCode) that includes a Karpathy-style wiki layer. The wiki uses Markdown and Git as the source of truth, stored at ~/.wuphf/wiki/, with a bleve (BM25) + SQLite index on top. No vector or graph DB is used yet — the goal is to see how far Markdown + Git can go before adding heavier infrastructure.

Key Features

Each agent gets a private notebook at agents/{slug}/notebook/ plus shared team wiki at team/.
Draft-to-wiki promotion flow: notebook entries are reviewed (by agent or human) and promoted to canonical wiki with back-links. A state machine handles expiry and auto-archive.
Per-entity fact log: append-only JSONL at team/entities/{kind}-{slug}.facts.jsonl. A synthesis worker rebuilds entity briefs every N facts.
Commits are attributed to a distinct Git identity ("Pam the Archivist") for provenance via git log.
[[Wikilinks]] with broken-link detection (rendered in red).
Daily lint cron for contradictions, stale entries, and broken wikilinks.
/lookup slash command + MCP tool for cited retrieval. Heuristic classifier routes short queries to BM25 and narrative queries to a cited-answer loop.

Retrieval Tuning

Current benchmark with 500 artifacts and 50 queries achieves 85% recall@20 on BM25 alone, which is the internal ship gate. If a query class drops below that, sqlite-vec is the pre-committed fallback.

Substrate Choices

Markdown for durability — the wiki outlives the runtime; users can git clone and walk away with every byte.
Bleve for BM25.
SQLite for structured metadata (facts, entities, edges, redirects, supersedes).
Canonical IDs are first-class: fact IDs are deterministic (include sentence offset), slugs are assigned once and never renamed (redirect stubs used). Rebuild is logically identical, not byte-identical.

Known Limits

85% recall is not a universal guarantee — tuning ongoing.
Synthesis quality depends on agent observation quality. The lint pass helps but is not a judgment engine.
Single-office scope; no cross-office federation yet.

Demo & Install

A 5-minute terminal walkthrough is available at asciinema (script at ./scripts/demo-entity-synthesis.sh).

Install with: npx wuphf@latest

Build from source: git clone https://github.com/nex-crm/wuphf.git; go build -o wuphf ./cmd/wuphf

The wiki ships as part of WUPHF but can be used standalone. MIT license, self-hosted, bring-your-own keys.

📖 Read the full source: HN LLM Tools

Show HN: WUPHF — Karpathy-Style LLM Wiki with Markdown + Git as Source of Truth

Key Features

Retrieval Tuning

Substrate Choices

Known Limits

Demo & Install

👀 See Also

TEMM1E v3.1.0: AI Agent That Self-Fine-Tunes Using User Interactions

Claude Desktop App Cowork Feature Enables AI-to-AI Communication via Shared Google Docs

ATLAS: Open-Source Test-Time Compute Pipeline for Qwen3-14B Achieves Frontier-Level Coding Performance

OpenClaw Kubernetes Operator with Embedded Ollama Support