Apfel: Free CLI Tool to Access Apple's On-Device LLM on macOS

✍️ OpenClawRadar📅 Published: April 7, 2026🔗 Source
Apfel: Free CLI Tool to Access Apple's On-Device LLM on macOS
Ad

What Apfel Does

Apfel is a free tool that gives you direct access to the LLM Apple ships with macOS 26 (Tahoe) on Apple Silicon Macs. Apple normally restricts this model to Siri and system features, but Apfel exposes it through three interfaces: a UNIX command-line tool, an OpenAI-compatible HTTP server, and an interactive chat.

Technical Details

The tool is built in Swift 6.3 and wraps Apple's FoundationModels framework, specifically the LanguageModelSession API. All inference runs on the Neural Engine and GPU - no network calls, no cloud, and nothing leaves your machine.

Key specifications from the source:

  • Version: v0.6.13
  • Requirements: macOS 26+ (Tahoe), Apple Silicon, Apple Intelligence enabled
  • Context window: 4,096 tokens (input and output combined)
  • License: MIT
  • Installation: brew install Arthur-Ficial/tap/apfel

Three Usage Modes

1. CLI Tool

Pipe-friendly UNIX tool with stdin/stdout support, JSON output, file attachments, and proper exit codes:

$ apfel "What is the capital of Austria?"
The capital of Austria is Vienna.

$ apfel -o json "Translate to German: hello" | jq .content "Hallo"

2. OpenAI-Compatible Server

Drop-in replacement at localhost:11434 that works with any OpenAI SDK:

$ apfel --serve
Server running on http://127.0.0.1:11434

any OpenAI client works

$ curl localhost:11434/v1/chat/completions

Supports streaming (SSE), tool calling, CORS, response formats, temperature, max_tokens, and seed parameters.

3. Interactive Chat

Multi-turn conversations with automatic context management and five trimming strategies:

$ apfel --chat -s "You are a coding assistant"
Chat started. Type /quit to exit.
> How do I reverse a list in Python?
Ad

What Apfel Adds Over Apple's Raw API

  • Proper exit codes for shell scripting
  • JSON output format
  • File attachment support
  • Five context trimming strategies for the 4,096-token window
  • Real token counting via the SDK
  • Conversion of OpenAI tool schemas to Apple's native Transcript.ToolDefinition format

Included Power Tools

The demo/ folder includes several shell scripts:

  • cmd: Natural language to shell command conversion
  • oneliner: Generates pipe chains from plain English
  • mac-narrator: Narrates system activity like a nature documentary
  • explain: Explains commands, error messages, or code snippets
  • wtd: Instant project orientation for any codebase
  • gitsum: Summarizes recent git commits

Who This Is For

Developers who want to experiment with Apple's on-device LLM without writing Swift applications or paying for cloud API calls.

📖 Read the full source: HN AI Agents

Ad

👀 See Also

TradesMCP: Open Source MCP Server for Contractor License Verification and Construction Data
Tools

TradesMCP: Open Source MCP Server for Contractor License Verification and Construction Data

TradesMCP is an open source Model Context Protocol server that provides Claude with access to real contractor license data, building permits, material pricing, and labor rates. The tool correctly verified an active California contractor license where ChatGPT returned incorrect information.

OpenClawRadar
Skynet: Multi-Agent Collaboration Network for Claude Code Agents
Tools

Skynet: Multi-Agent Collaboration Network for Claude Code Agents

Skynet is an open-source network that enables role-based collaboration between multiple Claude Code agents and humans. It's installed as a skill using npx and managed through natural language commands.

OpenClawRadar
MOOSE-Star: A 7B Model and 108K-Paper Dataset for Scientific Hypothesis Discovery – ICML 2026
Tools

MOOSE-Star: A 7B Model and 108K-Paper Dataset for Scientific Hypothesis Discovery – ICML 2026

MiroMind releases MOOSE-Star on Hugging Face: a 7B model (DeepSeek-R1-Distill-Qwen-7B fine-tune) for scientific hypothesis discovery, alongside the 108K-paper TOMATO-Star dataset. Benchmark shows MS-7B achieves 54.34% inspiration retrieval accuracy, beating GPT-5.4 and approaching Gemini-3 Pro.

OpenClawRadar
Reverse-engineered Claude Code SDK released in four languages
Tools

Reverse-engineered Claude Code SDK released in four languages

A developer has reverse-engineered Claude Code and created single-file SDKs in Node.js, Python, Go, and Rust with zero dependencies. The tools provide full agent loop with streaming and tool use while using existing Claude Pro/Max subscriptions.

OpenClawRadar