Fine-tuning llama3.2 3B for personalized health coaching using Apple Watch data and MLX

✍️ OpenClawRadar📅 Published: March 2, 2026🔗 Source
Fine-tuning llama3.2 3B for personalized health coaching using Apple Watch data and MLX
Ad

A developer created a personalized health coach LLM by fine-tuning llama3.2 3B on a Mac using Apple Health and Whoop data. The entire fine-tuning process took approximately 15 minutes using MLX.

Technical pipeline

The implementation follows this workflow:

  • Apple Health and Whoop data stored in local SQLite database
  • SQL RAG layer converts natural language queries to SQL
  • Claude API used once to generate ~270 gold-standard training examples (anonymized question/SQL/result pairs, no personal health data sent)
  • LoRA fine-tuning on llama3.2 3B via MLX
  • Fused model served locally at 127.0.0.1:8080

Before vs. after fine-tuning

The source provides concrete examples of the improvement:

Before fine-tuning: "Your HRV is an important measure of autonomic nervous system function..." [500 words of generic advice]

After fine-tuning: "Your HRV averaged 68ms this week, down 12% from last week's 77ms. Coincides with 3 nights under 7 hours sleep. Consider reducing training intensity for 48 hours."

Ad

Memory footprint and hardware

  • Model (4-bit): ~2 GB
  • LoRA adapter: ~50 MB
  • Training memory: ~4-5 GB total
  • Runs on M-series Mac, no GPU needed

The developer mentions including technical details on SQL hallucination guardrails, cross-metric context enrichment, and the training pipeline in their full writeup. They also offer to answer questions about the MLX setup or RAG layer implementation.

📖 Read the full source: r/LocalLLaMA

Ad

👀 See Also

Running Claude with Qwen 3.5 as a persistent agent on Mac Mini reveals human bottleneck
Use Cases

Running Claude with Qwen 3.5 as a persistent agent on Mac Mini reveals human bottleneck

A developer runs Claude with Qwen 3.5 as a persistent agent on a dedicated Mac Mini, handling product creation, project management, analytics, newsletter support, and 3,000 WizBoard tasks. The agent created 16 products in two months, revealing that the bottleneck shifted from production capacity to human approval and decision-making.

OpenClawRadar
OpenClaw Assistant Setup: Model Stack, Use Cases, and Agent Orchestration
Use Cases

OpenClaw Assistant Setup: Model Stack, Use Cases, and Agent Orchestration

An OpenClaw assistant shares their two-week setup using GPT-5.4 with Codex Pro plan ($219/month cap) plus Claude Code via CLI, detailing three core workflows: contract triage, BI data visualization via Metabase API, and project management in Linear.

OpenClawRadar
Local Multi-Agent Setup with vLLM, Claude Code, and gpt-oss-120b on Linux
Use Cases

Local Multi-Agent Setup with vLLM, Claude Code, and gpt-oss-120b on Linux

A developer created a 100% local parallel multi-agent setup using vLLM in Docker, Claude Code for orchestration pointing to localhost, and gpt-oss-120b as a coding agent on an RTX Pro 6000 Blackwell MaxQ GPU with dual-boot Ubuntu, achieving 8 agents working concurrently.

OpenClawRadar
Case Study: Building a Full-Stack Web App with Claude in Six Weeks
Use Cases

Case Study: Building a Full-Stack Web App with Claude in Six Weeks

A 19-year-old developer from Nepal used Claude to build and ship Somnia, a dream journal web app with 100 users and 7 paying customers in six weeks. The workflow involved treating Claude like a junior developer with tight task scoping and clear acceptance criteria.

OpenClawRadar