Building an Autonomous Research Agent with C# and Local LLMs

Here's a look at a new autonomous research agent built in C# utilizing local LLMs, specifically Ollama along with the llama3.1:8b model. The agent automates the process of URL handling by generating search queries, conducting web searches through the Brave Search API, and extracting relevant data, all culminating in a structured markdown report.
Key Details
- The agent accepts a topic input, for instance, "persistent memory for AI agents".
- It autonomously formulates 5-8 search queries.
- Searches are executed via the Brave Search API, and the top sources are fetched and analyzed.
- The agent reads through 8-12 sources and extracts 5-8 key findings.
- All data processing is executed locally using the Ollama (llama3.1:8b) model without reliance on OpenAI/Anthropic APIs.
- The output is a markdown report complete with citations.
Performance and Architecture
The current setup runs on a Ryzen 5 5500, CPU-only, with 16GB RAM, taking approximately 15 minutes per research cycle. The developer notes that 3B models, such as llama3.2, are inadequate for tool calling, thereby making 8B a necessary minimum for reliable performance.
Key challenges include the need for findings truncation before synthesis to prevent the model from stalling on lengthy contexts, and occasional malformed tool calls even with 8B models, resolved by retrying with altered prompts. The agent utilizes SQLite paired with embeddings for managing memory at a personal scale, obviating the need for a vector database.
Technology Stack
- C# / .NET 8
- Ollama
- SQLite
- Brave Search API (free tier)
For developers interested in building their own agent, there's a starter kit and an 8-chapter guide available on the project's GitHub repository, provided under the MIT license, along with the full source code: hex-dynamics.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Statespace: Build Interactive Web Apps for OpenClaw Agents with Markdown
Statespace is a free, open-source framework for building and sharing AI-friendly web apps that OpenClaw agents can navigate and interact with using pure Markdown. It allows you to define tools, components, and instructions in Markdown files that agents access over HTTP.

Claude Code Plan Mode Reduces Redo Rate from 40% to Near Zero
A developer tracked 30+ coding sessions with Claude Code and found that skipping Plan Mode resulted in redoing tasks from scratch 40% of the time. With Plan Mode, the redo rate dropped to basically zero, with one feature taking 17 minutes total versus 35+ minutes without planning.

Sonicker: Voice Cloning Web App Built with Claude Code in 4 Days
Sonicker is a voice cloning web app that requires only 3 seconds of audio input and supports 10 languages. The developer built it solo in 4 days using Claude Code for the entire frontend, API integration, and deployment.

Building a voice-controlled multi-agent system on top of Claude Code
A developer built a wake-word-activated voice loop for Claude Code that spawns sub-agents, parallelizes work, and auto-QAs results. Full technical breakdown including speaker verification and PID watcher.