Argus: A GitHub App That Reviews CLAUDE.md Files and Posts Scores on PRs

Argus is a GitHub App that automatically reviews CLAUDE.md files and posts a score on every pull request. The tool was built using Claude Code, which helped design the scoring standard and write the webhook handler.
What Argus Does
The app analyzes CLAUDE.md files against a scoring standard and provides feedback directly on GitHub pull requests. After running against multiple repositories, the developer found that most CLAUDE.md files fail the same two checks:
- No explicit scope limits (what the AI agent cannot do)
- No escalation path (when the agent should stop and ask a human)
Availability and Installation
Argus is available for free installation on public repositories at github.com/apps/argusreview. The tool allows developers to see how their CLAUDE.md files score against the established standards.
CLAUDE.md files are configuration files used to guide AI coding assistants like Claude Code, providing instructions, constraints, and guidelines for how the AI should approach coding tasks. Having clear scope limits and escalation paths helps prevent the AI from making inappropriate changes or getting stuck on tasks that require human intervention.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude Plugins: Computer Vision, Multi-Agent Council, and Self-Debugging Workflow
Three Claude plugins were released: Computer Vision v1.7.0 for Windows app automation, The Council v3.1.0 for adversarial multi-agent consultation, and Upwork Scraper v0.2.0 for job market analysis. A demonstration showed Claude using these plugins to diagnose and fix its own Solitaire automation bug.

Claude-rank: Claude Code Plugin for AI Search Visibility Audits
Claude-rank is a free Claude Code plugin and CLI that audits technical foundations for AI search visibility, handling technical SEO, AI citability scoring, crawlability checks for AI bots, and automated fixes for discoverability issues.

Custom Reddit MCP for Claude Desktop/Code Shared on GitHub
A developer has released a custom-built Reddit MCP designed for Claude Desktop and Claude Code to integrate Reddit research directly into the workflow. The tool is documented on GitHub and available for free use.

Bullshit Benchmark Tests LLM Resistance to Nonsensical Prompts
The Bullshit Benchmark evaluates whether AI models identify and push back on obvious nonsense prompts instead of confidently generating incorrect answers. Results show Claude models perform significantly better than Gemini models at detecting nonsensical questions.