Qwen2-0.5B Fine-Tuned for Local Task Automation with llama.cpp

A developer has fine-tuned Qwen2-0.5B for task automation, creating a model that runs entirely locally on CPU without requiring GPU or cloud APIs. The project, named ACE, is available on GitHub.
What It Does
- Takes natural language tasks (e.g., "copy logs to backup")
- Detects task type: atomic, repetitive, or clarification
- Generates execution plans consisting of CLI commands and hotkeys
- Runs entirely locally on CPU (no GPU, no cloud APIs)
Technical Details
- Base model: Qwen2-0.5B
- Training: LoRA fine-tuning on approximately 1000 custom task examples
- Quantization: GGUF Q4_K_M format (300MB file size)
- Inference: llama.cpp
- Inference time: 3-10 seconds on i3/i5 processors
Main Challenges During Training
- Data quality: Had to regenerate dataset 2-3 times due to garbage examples
- Overfitting: Took multiple iterations to get validation loss stable
- EOS token handling: Model wouldn't stop generating until tokenizer config was fixed
- GGUF conversion: Required BF16 dtype + imatrix quantization to get stable outputs
Limitations (v0.1)
- Requires full file paths (no smart file search yet)
- CPU inference only (slower on older hardware)
- Basic execution (no visual understanding)
Performance Benchmarks
- i5 (2018+) with SSD: 3-5 seconds
- i3 (2015+) with SSD: 5-10 seconds
- Older hardware (Pentium + HDD): 30-90 seconds
The developer is seeking feedback on performance across different hardware, edge cases that break the model, and feature requests for v0.2.
📖 Read the full source: r/LocalLLaMA
👀 See Also

MCP server connects AI agents to existing Chrome sessions with cookies and auth
@playwright-repl/mcp is an MCP server that connects AI agents to your existing Chrome browser via the Dramaturg extension, providing access to your real browser session including cookies and authentication. It supports full Playwright JavaScript, assertions, and works with Claude Desktop, Claude Code, Cursor, or any MCP client.

AI Team OS: Self-Driving Organization Layer for Claude Code
AI Team OS is an operating system layer for Claude Code that creates autonomous workflows where the system continues working without waiting for user prompts. It includes 22 specialized agent templates, 40+ MCP tools, and runs entirely within Claude Code subscription without external API costs.

Clawforce: Open-Source Control Plane for Managing Clawbot Agent Teams
Clawforce is an open-source control plane for managing Clawbot agent teams that allows deployment in a few clicks. It provides configuration of characters, skills, MCP integrations, and tools through a UI, with agents capable of planning, coordinating, and executing tasks collaboratively.
Claude Code vs Codex: 36 vs 28 files, $2.50 vs $2.04, infinite loop caught — real-world comparison
A developer runs the same two tasks on Claude Code and Codex (Cursor): PR triage bot and real-time code review UI. Results: 36 vs 28 files, $2.50 vs $2.04 cost, Claude produced fewer TypeScript errors, Codex had an infinite React loop.