Building a Voice Interface for OpenClaw Agents Using iPhone Shortcuts

A developer on r/openclaw shared their setup for creating a voice interface similar to Siri for OpenClaw agents. The system combines a local Python server with iPhone Shortcuts to enable voice interaction with OpenClaw agents.
System Architecture
The setup requires enabling OpenAI HTTP mode on the OpenClaw gateway and LAN. The core components are:
- Python Server: Originally a script that listened for keywords via microphone, performed speech-to-text, sent text to OpenClaw API, received responses, and performed text-to-speech using the user's voice. This was adapted into a basic server with an endpoint that can receive text from anywhere, send it to OpenClaw, and return the response.
- iPhone Shortcut: Handles speech-to-text and text-to-speech locally on the iPhone. The shortcut workflow includes:
- Dictate text (records voice to text)
- Get contents of URL: url/ask with dictated text in body (sends text to be routed to OpenClaw agent for response)
- Dictionary: Get value for reply in contents of URL (store response text)
- Speak: dictionary value (text-to-speech output)
Implementation Details
The developer runs this through WireGuard and operates entirely on LAN or through VPN when outside the local network. They emphasize a critical security consideration: "Be careful opening an endpoint for your OpenClaw agent to respond through. It can allow anyone to access your agent (computer). Use auth token."
The approach offloads speech processing to the iPhone while keeping the OpenClaw agent interaction centralized through the Python server endpoint. This allows for voice interaction with OpenClaw agents from anywhere while maintaining security through VPN and authentication tokens.
📖 Read the full source: r/openclaw
👀 See Also
Claude Artifacts as a Presentation Builder: Full Context + Brand Assets
Use Claude with codebase context, browser access to branding (Brandfetch), and popular design libraries to generate a vanilla HTML/JS/CSS presentation via Artifacts — producing a sleek, remixable deck without Google Slides or PowerPoint.

Patient uses Claude AI to interpret medical data and navigate brain cancer treatment
A 27-year-old patient with primary mediastinal B-cell lymphoma with CNS involvement uses Claude AI daily to interpret immunohistochemistry panels, analyze PET-CT scan results, evaluate CAR-T clinical trial data, understand drug mechanisms, and prepare questions for medical team.

Solo Founder Builds Demo Video with Claude Code and Remotion
A solo developer used Claude Code and Remotion to create a product demo video in a weekend for $0, overcoming a launch delay caused by lack of design skills and budget constraints.

Startup Founder Uses AI Agents for Customer Support and Competitor Research
A startup founder automated customer support by connecting an AI agent to documentation, reducing daily time from 2 hours to 20 minutes, and set up weekly competitor research summaries delivered to Slack.