OpenEvol: Offline Self-Improvement Pipeline for LLMs Using Conversation History

✍️ OpenClawRadar📅 Published: March 31, 2026🔗 Source
OpenEvol: Offline Self-Improvement Pipeline for LLMs Using Conversation History
Ad

What OpenEvol Does

OpenEvol is an offline self-improvement pipeline for large language models that automatically converts AI conversation history into training data. The tool mines high-value exchanges from conversations, judges their quality, and generates fine-tuning datasets without manual labeling or proprietary data flywheels.

How It Works

The pipeline runs through four automated stages:

  • Mine high-value exchanges from conversations
  • Judge quality using rules with an optional teacher LLM
  • Synthesize SFT, preference, and pretraining datasets
  • Fine-tune with one command

This creates a closed loop where the model learns from its own experience.

Technical Details

No GPU is required to get started - the full pipeline runs on CPU with a mock or OpenAI-compatible teacher backend. You can bring a GPU when ready to train.

Five teacher backends are supported:

  • Mock
  • Rule-based
  • OpenAI-compatible API (any local proxy works)
  • HuggingFace Transformers
  • vLLM
Ad

Usage Options

Three ways to use OpenEvol:

  • CLI for offline batch runs
  • REST API server for automation
  • OpenClaw desktop plugin that lets you trigger pipeline runs directly from chat

Quality Control

Every batch is automatically scored. If the approval rate drops below 80%, training is blocked and flagged for human review, giving users control over what data gets used for training.

This type of tool is useful for developers who want to improve their AI coding agents using actual conversation history without sending data to external services.

📖 Read the full source: r/openclaw

Ad

👀 See Also

Superglue CLI: Let AI Agents Execute API Calls Without Pre-Built Tools
Tools

Superglue CLI: Let AI Agents Execute API Calls Without Pre-Built Tools

Superglue CLI provides a skill that teaches AI coding agents how to use its commands, handle authentication, build tools, and debug failures. Instead of creating pre-built tools for every API integration, agents can read API specs at runtime and plan multi-step calls.

OpenClawRadar
Karpathy Coding Skill Rewritten for Free Plan, Unlocks Claude Coding Discipline Without Pro
Tools

Karpathy Coding Skill Rewritten for Free Plan, Unlocks Claude Coding Discipline Without Pro

A Reddit user rewrote Karpathy's coding discipline guidelines for Claude's free plan, removing terminal and subagent dependencies. The system prompt auto-triggers on coding requests and enforces verification-first thinking.

OpenClawRadar
Local RAG Tool Built with Nemotron Nano 9B v2 and vLLM Tool Calling
Tools

Local RAG Tool Built with Nemotron Nano 9B v2 and vLLM Tool Calling

A developer built a local-first RAG research tool that runs entirely on a single GPU using Nemotron Nano 9B v2 Japanese on vLLM with custom parser plugins for tool calling. The system features a two-step extract-execute flow with bilingual keyword extraction and parallel FTS5/DuckDuckGo search.

OpenClawRadar
MAGELLAN: A 15-Agent Autonomous Scientific Discovery System Built on Claude Code
Tools

MAGELLAN: A 15-Agent Autonomous Scientific Discovery System Built on Claude Code

MAGELLAN is a 15-agent autonomous scientific discovery system built entirely on Claude Code. It uses Opus for deep reasoning and Sonnet for structured tasks, generating cross-disciplinary hypotheses without human direction, with 260 hypotheses proposed and 60% killed by adversarial validation in 19 sessions.

OpenClawRadar