Open-source structural hallucination checker for AI agent pipelines

What it does
A structural hallucination checker specifically designed for AI agent pipelines. Unlike truth checking, this tool focuses on catching structural failures that break downstream tools.
Problem it solves
Most agent issues aren't factual errors but structural problems like:
- The model inventing a field in a JSON tool response
- Citing a source that wasn't in the retrieved set
- Prompt injection hidden inside retrieved content
- Claiming a tool returned something it didn't
Four suppressors
The tool includes four suppressors built in Claude Code that run as a single step before agent output reaches users:
grounding_enforcer- checks whether model output is actually supported by the sources you passed inprompt_suppressor- catches injection attempts in retrieved content and tool resultsjson_suppressor- validates structured tool responses against expected schemastool_response_suppressor- flags when a tool's claimed output doesn't match what it returned
Availability
The tool is available in two formats:
- A REST API
- An MCP server (works with Claude Desktop, Cursor, Windsurf, etc.)
Free tier offers 500 requests/month with no credit card required.
Source and documentation
GitHub repository: https://github.com/steveswain14/mcp-hallucination-suite
API and documentation: https://certifai.dev
📖 Read the full source: r/ClaudeAI
👀 See Also

Forge: Turn a Mac or Linux Machine into an Always-On Dev Host for AI Coding Agents
Forge is an open-source tool that installs a daemon to turn any Mac or Linux machine into a permanent, always-on development host. It keeps AI coding agents running when you walk away, provides a web dashboard for monitoring, and uses Tailscale for secure remote access via SSH.

OpenClaw User Critiques Tool's Architecture and Safety Gaps
A Reddit user describes OpenClaw as the only tool making agent automation this accessible but criticizes its architecture for lacking a control layer for file operations, a protected kernel, proper context management, and built-in versioning or tests.

Baton: A Desktop App for Managing Multiple AI Coding Agents
Baton is a desktop application that helps developers manage multiple AI coding agents across isolated workspaces. It provides real terminal sessions, git worktree isolation, and status monitoring for agents like Claude Code, Codex CLI, OpenCode, and Gemini CLI.

Claude Banana: A Claude Code plugin for image generation with design system awareness
Claude Banana is a Claude Code plugin that generates images using Google's Gemini API with context-aware prompt crafting. It reads Tailwind configs, CSS variables, design tokens, and existing assets to understand project visual styles.