Bespoke AI v0.8.1: VS Code Autocomplete Extension for Code and Text

Bespoke AI v0.8.1 is a VS Code extension that provides autocomplete functionality for both code and non-code text. The developer created it after finding no decent open source autocomplete projects that worked well for text autocomplete.
Key Details
The extension leverages existing Claude Code subscriptions through Anthropic's Agent SDK, allowing users to access full power autocomplete (including Opus) without incurring API charges. Alternatively, users can configure it to use any API they want, including Ollama.
The extension is available on the VS Code Marketplace at https://marketplace.visualstudio.com/items?itemName=TrentMcNitt.bespoke-ai and the source code is hosted on GitHub at https://github.com/trentmcnitt/bespoke-ai-vscode-ext.
The developer is seeking testers to validate the extension on setups other than their own. Feedback can be provided via comments on the Reddit post, direct messages, or by creating issues on the GitHub repository.
📖 Read the full source: r/ClaudeAI
👀 See Also

Recall: A Persistent Memory MCP Server for Claude Code
Recall is an open-source MCP server that gives Claude Code persistent memory across sessions via semantic search with embeddings. It includes four lifecycle hooks: session-start, observe, pre-compact, and session-end.

context-link v1.0.0: Local MCP server reduces Claude Code token usage by 91%
context-link v1.0.0 is a local MCP server that indexes codebases with Tree-sitter to serve Claude only the exact symbols, dependencies and structure needed, reducing token usage by 91% in specific cases and 70-80% across full tasks.

ApexClaw: Open-Source Telegram AI Agent with 85+ Tools for Web Automation, Voice, and Email
ApexClaw is an open-source Telegram AI agent written in Go that provides 85+ built-in tools including web browsing with headless Chrome, voice note processing, Gmail integration, and shell script execution. It's self-hosted and uses the z.ai engine for reasoning.

Context Mode: An MCP Server That Compresses Tool Outputs for Claude Code
Context Mode is an MCP server that sits between Claude Code and tool outputs, processing them in sandboxes and returning only summaries. It reduces 315 KB of MCP output to 5.4 KB, extending session time before slowdown from ~30 minutes to ~3 hours.