llmLibrarian: Local RAG Engine with MCP Integration for File-Based AI Search

✍️ OpenClawRadar📅 Published: March 30, 2026🔗 Source

What This Is

llmLibrarian is a local RAG (Retrieval-Augmented Generation) engine that exposes retrieval capabilities through the Model Context Protocol (MCP). It allows you to index folders into silos (ChromaDB collections), then query them from any MCP client—including Claude—to get grounded, cited answers.

Key Features and Architecture

The tool indexes folders into silos, which are ChromaDB collections. When you want direct answers instead of raw chunks, Ollama handles the synthesis layer. Everything runs locally on your machine.

The developer highlights the multi-silo capability as particularly powerful: combining silos allows patterns to surface across domains that would be difficult to catch manually. For example, a journal folder becomes a thinking partner that remembers what you've written, and a codebase becomes an agent that knows your actual files.

MCP Tools Exposed

retrieve — hybrid RRF vector search that returns raw chunks with confidence scores for Claude to reason over
retrieve_bulk — multi-angle queries in one call, useful when aggregating across document types
ask — Ollama-synthesized answer directly from retrieved context (defaults to llama3.1:8b, but you can swap in whatever model you have pulled)
list_silos, inspect_silo, trigger_reindex — index management tools

Technical Stack

ChromaDB for vector storage
Ollama for model synthesis
sentence-transformers (all-mpnet-base-v2, MPS-accelerated) for embeddings
fastmcp for the MCP layer

The developer mentions that the multi-silo metadata tagging in ChromaDB took several iterations to get right and is open to discussing the architecture.

This type of tool is useful for developers who want to build AI agents that can reference and reason over their local files without sending data to external services.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

Claude Code Container Provides Zero-Config Docker Isolation for Claude Code

Claude Code Container (ccc) is a free, open-source tool that automatically creates per-project Docker containers for Claude Code with full isolation and zero configuration. It forwards host environment variables, mounts SSH keys, provides transparent localhost proxy, and includes Chromium with chrome-devtools MCP pre-configured.

Mar 9, 2026, 01:45 PM UTC

OpenClawRadar

Tools

gui.new: Tool for Claude to Render Visual Output as Shareable Links

gui.new is a tool that lets Claude render visual output as live shareable links instead of returning code blocks. It's built with Claude, uses Next.js on Vercel with Supabase, and requires no signup.

Mar 11, 2026, 09:45 PM UTC

OpenClawRadar

Tools

SWE-CI: New Benchmark Tests AI Agents on Long-Term Code Maintenance via CI

SWE-CI is a repository-level benchmark that evaluates LLM-powered agents on maintaining codebases through continuous integration cycles, shifting focus from static bug fixing to long-term maintainability across 100 real-world tasks.

Mar 8, 2026, 01:45 PM UTC

OpenClawRadar

Tools

OpenGalatea MCP Server Connects Claude to Prusa 3D Printers

OpenGalatea is an open-source MCP server that enables Claude to control Prusa 3D printers via PrusaLink, allowing natural language commands to search Printables.com, slice models, and manage prints.

Mar 24, 2026, 02:45 PM UTC

OpenClawRadar