Financial Modeler Builds Local Speech-to-Tool Desktop App with Claude Code

A developer without professional coding experience has built a fully functional local speech-to-text desktop application using Claude Code. The developer, who works primarily with Excel and SQL in financial modeling, created the tool to replace a cloud-based subscription service.
What Was Built
The application is called Sotto — a local speech-to-text tool for Windows that runs Whisper on your GPU. The developer was motivated by dissatisfaction with cloud-based dictation services where voice data leaves the machine, despite having capable hardware (specifically mentioning a 4070 Ti GPU).
Key Features
- System-wide hotkey accessible from any application
- Auto-stops recording when you stop talking
- Second hotkey for longer voice notes that export to markdown format (compatible with Obsidian)
- Settings UI with system tray integration
- Visual waveform indicator while listening
- Automatic GPU detection and model selection
- Text insertion at cursor position
Technical Details
The application consists of approximately 2,200 lines of Python code across 17 files. According to the developer, Claude Code wrote "the vast majority" of the implementation, including:
- Threading logic
- Windows API integration
- Qt-based user interface
The developer handled requirements definition, testing, bug identification, and feature prioritization, while relying on Claude Code for the actual implementation of technical components they didn't know how to build.
Availability
Sotto is available under MIT license with the following requirements:
- Windows operating system
- Python 3.10+
- GPU recommended but not required
The developer mentions a Mac version is "coming soon" because they recently purchased a MacBook and want to use the tool there. The project is hosted on GitHub at https://github.com/mrobison12-oss/sotto.
📖 Read the full source: r/ClaudeAI
👀 See Also

OpenClaw Configurations That Last: Less Complexity, More Reliability
Analysis of 40-50 OpenClaw setups shows that sustainable configurations use 1 agent, 3-5 skills, Sonnet model, and focus on mundane tasks like calendar management and email triage, while complex multi-agent systems with 20+ skills typically fail within 3 weeks.

User Deploys AI Assistant 'Elvis' on VPS for Project Management Tasks
A developer spent three days setting up an AI assistant called 'Elvis' on a Linux VPS, integrating it with Microsoft Teams, JIRA, Asana, and email systems. The assistant successfully handled project management tasks including reading emails, extracting requirements, updating change requests, and coordinating with team members.

Using Claude Opus 4 for AI Orchestration on Limited Hardware
Exploring Claude Opus 4 as a reasoning engine on a 2014 Mac Mini, leveraging the Claude API for handling complex orchestration tasks.

Automating a Daily AI News Podcast with Claude Code and Three AI Agents
A developer built a fully automated podcast pipeline using Claude Code to orchestrate three specialized AI agents that curate AI news, write narration scripts, fact-check content, and generate audio with voice cloning. The system publishes daily episodes with minimal manual intervention.