Apfel: Free CLI Tool to Access Apple's On-Device LLM on macOS

What Apfel Does
Apfel is a free tool that gives you direct access to the LLM Apple ships with macOS 26 (Tahoe) on Apple Silicon Macs. Apple normally restricts this model to Siri and system features, but Apfel exposes it through three interfaces: a UNIX command-line tool, an OpenAI-compatible HTTP server, and an interactive chat.
Technical Details
The tool is built in Swift 6.3 and wraps Apple's FoundationModels framework, specifically the LanguageModelSession API. All inference runs on the Neural Engine and GPU - no network calls, no cloud, and nothing leaves your machine.
Key specifications from the source:
- Version: v0.6.13
- Requirements: macOS 26+ (Tahoe), Apple Silicon, Apple Intelligence enabled
- Context window: 4,096 tokens (input and output combined)
- License: MIT
- Installation:
brew install Arthur-Ficial/tap/apfel
Three Usage Modes
1. CLI Tool
Pipe-friendly UNIX tool with stdin/stdout support, JSON output, file attachments, and proper exit codes:
$ apfel "What is the capital of Austria?"
The capital of Austria is Vienna.
$ apfel -o json "Translate to German: hello" | jq .content
"Hallo"
2. OpenAI-Compatible Server
Drop-in replacement at localhost:11434 that works with any OpenAI SDK:
$ apfel --serve
Server running on http://127.0.0.1:11434
any OpenAI client works
$ curl localhost:11434/v1/chat/completions
Supports streaming (SSE), tool calling, CORS, response formats, temperature, max_tokens, and seed parameters.
3. Interactive Chat
Multi-turn conversations with automatic context management and five trimming strategies:
$ apfel --chat -s "You are a coding assistant"
Chat started. Type /quit to exit.
> How do I reverse a list in Python?What Apfel Adds Over Apple's Raw API
- Proper exit codes for shell scripting
- JSON output format
- File attachment support
- Five context trimming strategies for the 4,096-token window
- Real token counting via the SDK
- Conversion of OpenAI tool schemas to Apple's native Transcript.ToolDefinition format
Included Power Tools
The demo/ folder includes several shell scripts:
cmd: Natural language to shell command conversiononeliner: Generates pipe chains from plain Englishmac-narrator: Narrates system activity like a nature documentaryexplain: Explains commands, error messages, or code snippetswtd: Instant project orientation for any codebasegitsum: Summarizes recent git commits
Who This Is For
Developers who want to experiment with Apple's on-device LLM without writing Swift applications or paying for cloud API calls.
📖 Read the full source: HN AI Agents
👀 See Also

TradesMCP: Open Source MCP Server for Contractor License Verification and Construction Data
TradesMCP is an open source Model Context Protocol server that provides Claude with access to real contractor license data, building permits, material pricing, and labor rates. The tool correctly verified an active California contractor license where ChatGPT returned incorrect information.

Skynet: Multi-Agent Collaboration Network for Claude Code Agents
Skynet is an open-source network that enables role-based collaboration between multiple Claude Code agents and humans. It's installed as a skill using npx and managed through natural language commands.

MOOSE-Star: A 7B Model and 108K-Paper Dataset for Scientific Hypothesis Discovery – ICML 2026
MiroMind releases MOOSE-Star on Hugging Face: a 7B model (DeepSeek-R1-Distill-Qwen-7B fine-tune) for scientific hypothesis discovery, alongside the 108K-paper TOMATO-Star dataset. Benchmark shows MS-7B achieves 54.34% inspiration retrieval accuracy, beating GPT-5.4 and approaching Gemini-3 Pro.

Reverse-engineered Claude Code SDK released in four languages
A developer has reverse-engineered Claude Code and created single-file SDKs in Node.js, Python, Go, and Rust with zero dependencies. The tools provide full agent loop with streaming and tool use while using existing Claude Pro/Max subscriptions.