Offline Voice-to-Text Tool for macOS Using Local Whisper via MLX

A developer has created whisper-dictate, a macOS tool that enables fully offline voice-to-text transcription with real-time translation capabilities. The tool uses OpenAI's Whisper running locally through MLX on Apple Silicon, with no data ever leaving your machine.
How It Works
The workflow is straightforward: hold the fn key, speak, and release. The text is transcribed and pasted directly where you're typing. The tool works in Slack, VS Code, browsers, email, or any other text field. A floating "Listening..." overlay provides visual feedback during recording.
Technical Details
- Transcription takes about 500ms after you stop speaking
- Uses Whisper running locally through MLX on Apple Silicon
- The small model is solid for everyday use
- large-v3-turbo model provides near-perfect accuracy
- 100% offline operation - no accounts, tokens, or data leaving your machine
Translation Feature
Whisper can translate on the fly without extra models or translation APIs. The translation is built into Whisper's decode step. For example, speaking French outputs English text. You set the output language to English, and it handles translation natively.
Setup and Configuration
On launch, the tool asks what language you'll be speaking and what you want the output in, allowing easy switching between pure transcription and translation. The entire setup requires no internet connection once installed.
Availability
The project is open-sourced on GitHub for others to use and adapt to their workflows.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Distillery: A Claude Code Plugin for Persistent Team Context
Distillery is a plugin for Claude Code that provides teams with shared, persistent context across sessions and people. Version 0.2.0 adds hybrid search, auth audit logging, and uv support.

Claude Desktop + Blender via MCP: Real-Time 3D Workflow Closes the Feedback Loop
An open-source Blender add-on runs an MCP server inside Blender, letting Claude Desktop inspect scenes, create objects, render images, and read results—closing the script-paste feedback loop.

Jentic Mini: Self-Hosted API and Action Execution Layer for OpenClaw
Jentic Mini is a self-hosted API and action execution layer that sits between AI agents and external APIs, storing credentials in an encrypted vault and providing scoped toolkits with individually revocable keys. It automatically imports 10,000+ OpenAPI specs and Arazzo workflow sources when credentials are added.

Analysis of Ollama's Reusable Go Components for Local LLM Development
A developer examined Ollama's source code and found several standalone Go components including a pure Go token sampler, GGUF reader/writer, model conversion tools, chat template rendering, and OpenAI compatibility transforms that aren't available as separate libraries.