Multi-Agent Career Mentor Built with Ollama and MCP for Local AI

A developer has built a multi-agent AI system called "AI Career Mentor" that reads resumes and produces comprehensive career intelligence reports. The system runs fully locally using Ollama with llama3, requiring no API keys or external costs.
Technical Architecture
The system consists of five specialized agents that chain their outputs:
- Each agent receives the previous agent's output as shared context
- The roadmap agent knows your skill gaps from the analysis agent
- The salary agent knows your roadmap from the previous agent
- This chaining makes the report progressively smarter as it moves through the pipeline
Tech Stack Details
- AI Engine: Ollama + llama3 (100% local)
- RAG System: FAISS + SentenceTransformers for indexing your knowledge base
- Tool Layer: MCP (Model Context Protocol) - FastAPI spawns the MCP server as a subprocess and communicates via stdio JSON-RPC
- Resume Processing: pdfplumber for reading PDF resumes
- Frontend: React
MCP Implementation Notes
The developer found MCP particularly interesting to build. MCP is Anthropic's open standard for connecting AI to tools, using one server that can work with any client. The system also connects to Claude Desktop via configuration file, allowing Claude to call all 9 tools directly.
A notable bug encountered: MCP SDK v1.x changed handler signatures completely. Old code passes a full request object, while new code unpacks name and arguments directly. This caused significant debugging time.
Report Outputs
The system generates a complete career intelligence report including:
- Resume analysis
- Skill gap identification
- 6-month roadmap
- Salary strategy
- Interview preparation
All components run in one shot after processing the resume.
Resources
The project is available on GitHub with a video walkthrough. The developer notes that the RAG setup and MCP client/server wiring were the trickiest parts to implement.
📖 Read the full source: r/LocalLLaMA
👀 See Also

TradingView MCP Server Enables Claude to Backtest Trading Strategies
A developer has released an MCP server that allows Claude to backtest six trading strategies using Yahoo Finance data without API keys. Setup involves adding one line to the claude_desktop_config.json file.

Soul MCP Server Adds Persistent Memory and Safety for Local LLMs
Soul is an open-source MCP server that provides persistent memory across sessions for local LLMs with two commands: n2_boot at start and n2_work_end at end. It includes Ark safety features that block dangerous commands like rm -rf and DROP DATABASE at zero token cost, plus cloud storage configuration.

Running NemoClaw with Local vLLM: Setup Notes and Agent Engineering Observations
A developer documented running NVIDIA's NemoClaw sandboxed AI agent platform with a local Nemotron 9B v2 model via vLLM on WSL2. Key findings include inference routing details, parser compatibility issues, and observations about the agent engineering gap.

Self-Evolving Skill pattern validation: 5-round experiment results
A developer tested the Self-Evolving Skill design pattern for Claude Code with a 5-round experiment on a MySQL database with 29 tables and 590MB of smart building management data. Key results include a 63.6% Five-Gate rejection rate, incremental convergence, and 100% accuracy with no incorrect knowledge surviving.