ReasonDB: Open-Source Document Database Using LLM-Guided Tree Navigation Instead of Vector Search

What ReasonDB Does
ReasonDB is an open-source document database that takes a different approach to knowledge retrieval by preserving document structure as a hierarchy (headings → sections → paragraphs) and letting the LLM navigate that tree to find answers. This addresses the common issue where retrieval failures in vector DBs and RAG pipelines become debugging black boxes.
How It Works
Ingest process: Documents are converted to markdown, chunked by structure, built into a tree, and each node is summarized by an LLM (bottom-up).
Query process: BM25 narrows candidate nodes → tree-grep filters by structure → LLM ranks by summaries → beam-search traversal over the tree extracts the answer. The LLM visits approximately 25 nodes out of millions instead of searching a flat vector index.
Query Language and Stack
ReasonDB uses RQL, an SQL-like query language:
SELECT * FROM contracts SEARCH 'payment terms' REASON 'What are the late payment penalties?' LIMIT 5;Where SEARCH = BM25 and REASON = LLM-guided tree traversal.
The stack is built in Rust using redb, tantivy, axum, and tokio. It's a single binary that works with OpenAI, Anthropic, Gemini, Cohere, and compatible APIs (including local or OpenAI-compatible endpoints).
Practical Applications
This approach is particularly useful for developers who have been struggling with RAG retrieval quality or want to try structure-based retrieval instead of pure vector search. The tool was built after 3 years of experience with knowledge retrieval at Brainfish, where vector DBs, graph DBs, and custom RAG pipelines consistently presented the same debugging challenges.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude AI's UltraThink feature returns with practical usage guidance
Claude AI has reinstated the UltraThink feature after user feedback. Medium effort is now the default for Opus 4.6 (Max/Team), with High effort available permanently via /model, and UltraThink as a one-turn override to high effort.

Local AI Agent Achieves Sub-Second STT and TTS Latency with Open-Source Servers
A developer achieved ~0.2s STT latency using Whisper large-v3-turbo with hybrid thread-managed GPU architecture and ~250ms TTS latency with Coqui-TTS optimized for low-latency synthesis. Both implementations are fully self-hosted and open-sourced.

NVIDIA Announces NemoClaw Agent Platform with Privacy Controls
NVIDIA has launched NemoClaw, an agent platform that lets users install Nimotron models and the Open Shell runtime with a single command while adding privacy and security controls for autonomous agents.

Sociality.io Releases MCP Server for Claude: Live Social Media Intelligence via OAuth
Sociality.io launched a remote HTTP MCP server that lets Claude access live reporting and competitor data across Instagram, TikTok, Facebook, YouTube, X, and LinkedIn. Free to try.