ScreenMind: Local-First AI Memory That Indexes Your Entire Computer Activity

ScreenMind is a local-first AI memory system that continuously captures your screen, transcribes meetings, and indexes voice notes, building a persistent, searchable timeline of everything you do on your computer. It uses perceptual hashing to only trigger when content changes, then runs each frame through Gemma 4 E2B via llama.cpp for vision analysis, chat, and audio processing.
Key Features
- Screen capture with perceptual hashing — only stores frames when content actually changes
- Searchable timeline — query past activity: "that error message from earlier," "what was I working on at 3pm?"
- Chat with your history — persistent AI context from your entire session
- Meeting transcription — auto-detects Zoom, Teams, and Google Meet
- Voice memos — processed via Gemma 4's audio encoder
- Natural language automations — write them in plain English Markdown
- MCP integration — connect to Claude and Cursor
Technical Stack
- Models: Gemma 4 E2B (handles vision, chat, audio)
- Backend: Python + FastAPI
- Storage: SQLite
- Inference: llama.cpp with Q4 quantization
- Hardware: 4GB+ VRAM
The author notes that GPU scheduling between vision, chat, and audio tasks is the main inference optimization challenge. The project is still workflow-driven rather than fully autonomous — retrieval quality and onboarding friction are areas needing improvement.
GitHub: ayushh0110/ScreenMind
📖 Read the full source: r/LocalLLaMA
👀 See Also

Unlocking Proactivity: A Deep Dive into Clawbot Innovations from the Community
Discover how enthusiasts are enhancing their Clawbot's proactivity through inventive strategies and community-driven insights. A look at discussions and revelations from r/openclaw.

Skir: A Modern Alternative to Protocol Buffers for Type-Safe Data Exchange
Skir is a declarative language for defining data types, constants, and APIs that generates idiomatic, type-safe code in TypeScript, Python, Java, C++, Kotlin, and Dart from a single .skir file. It includes built-in schema evolution safety, RPC support similar to gRPC, and serialization to JSON or binary formats.

Cloken: A Chrome Extension That Shows Real-Time Claude Context Usage as a Percentage
Cloken is a free Chrome extension that displays your current Claude.ai chat context usage as a percentage — including messages, files, images, and system prompt.

Open Source MCP Server Connects Claude to Brazilian Central Bank Economic Data
Sidney Bissoli created bcb-br-mcp, an MIT-licensed MCP server that provides Claude access to 18,000+ time series from Brazil's Central Bank (SGS/BCB). The server includes 8 tools covering interest rates, inflation, exchange rates, GDP, employment, and credit data.