Engram Memory SDK: Graph-Based Memory for AI Agents with Local Models

Graph Memory SDK for Local AI Models
Engram Memory SDK is an open-source graph memory system designed for AI agents that works with local models through LiteLLM integration. The core architecture separates ingestion from recall: you only need the LLM once during ingestion to extract entities and relationships, while recall operates through pure vector search, graph traversal, and scoring without requiring additional LLM calls.
Technical Details
The SDK is built with async Python and uses Neo4j as its backend database. According to the source, it averages ~735 tokens per ingestion operation and achieves 95ms recall latency. The system includes self-restructuring memory features with decay and clustering running in the background.
Setup and Installation
Installation is straightforward:
pip install engram-memory-sdkConfiguration requires a .env file with these variables:
LLM_MODEL=ollama/llama3 # or any LiteLLM-supported local model
NEO4J_URI=bolt://localhost:7687The system supports any model via LiteLLM, including local deployments through Ollama, vLLM, and text-generation-webui. The key advantage is cost efficiency: with a small local model handling extraction, ongoing recall operations have literally $0 cost since they don't consume LLM tokens.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Best-Backup: A Free Tool for OpenClaw Server and Docker Container Backups
The free tool best-backup provides robust backup capabilities for OpenClaw servers, including full server backups, specific folder backups, and Docker container backups, with features like compression, encryption using existing SSH keys, and integration with Google Drive.

Markdown Manager: A Simple Markdown Editor for macOS
Markdown Manager is a free, open-source macOS app for managing Markdown files, featuring document conversion and preview capabilities.

Trepan: Local VS Code Security Auditor for AI-Generated Code
Trepan is an open-source VS Code extension that acts as a security gatekeeper for AI-generated code suggestions. It uses Ollama to run local security audits against project-specific rules in a .trepan/system_rules.md file.

Chromeflow: Chrome Extension Automates Web UI Tasks for Claude
Chromeflow is a free, open-source Chrome extension and MCP server built with Claude Code that gives Claude browser control to automate manual web UI tasks like setting up Stripe, Supabase, or SendGrid. It highlights elements to click, fills fields, clicks Save, and writes API keys directly to .env files.