agent-audit
Agent-audit validates Claude agent configurations across model selection, tool permissions, focus areas, and approach methodology using reference guidance on best practices. Use this skill when reviewing existing agents for correctness, auditing agent quality, improving agent designs, or learning how to structure agents effectively. It generates standardized reports identifying critical issues, important improvements, and specific recommendations based on an established checklist.
git clone --depth 1 https://github.com/majiayu000/claude-skill-registry /tmp/agent-audit && cp -r /tmp/agent-audit/skills/agent/agent-audit-philoserf-claude-code-setup ~/.claude/skills/agent-auditSKILL.md
## Reference Files Advanced agent validation guidance: - [model-selection.md](model-selection.md) - Model choice decision matrix, use cases, and appropriateness criteria - [tool-restrictions.md](tool-restrictions.md) - Tool permission patterns, security implications, and restriction fit - [focus-area-quality.md](focus-area-quality.md) - Focus area specificity assessment, quality scoring, and criteria - [approach-methodology.md](approach-methodology.md) - Approach completeness, required components, and methodology patterns - [resource-organization.md](resource-organization.md) - Resource directory validation and progressive disclosure - [examples.md](examples.md) - Good vs poor agent comparisons and full audit reports - [report-format.md](report-format.md) - Standardized audit report template and structure - [common-issues.md](common-issues.md) - Frequent problems, fixes, and troubleshooting patterns --- # Agent Auditor Validates agent configurations for model selection, tool restrictions, focus areas, and approach methodology. ## Quick Start **Basic audit workflow**: 1. Read agent file 2. Check model selection appropriateness 3. Validate tool restrictions 4. Assess focus area quality 5. Review approach methodology 6. Generate audit report **Example usage**: ```text User: "Audit my evaluator skill" → Reads skills/evaluator/SKILL.md → Validates model (Sonnet), tools, focus areas, approach → Generates report with findings and recommendations ``` ## Agent Audit Checklist ### Critical Issues Must be fixed for agent to function correctly: - [ ] **Valid YAML frontmatter** - Proper syntax, required fields present - [ ] **name field matches filename** - Name consistency - [ ] **model field present and valid** - Sonnet, Haiku, or Opus only - [ ] **At least 3 focus areas** - Minimum viable expertise definition - [ ] **Tool restrictions present** - allowed_tools or allowed-patterns specified - [ ] **No security vulnerabilities** - Tools don't expose dangerous capabilities ### Important Issues Should be fixed for optimal agent performance: - [ ] **Model matches complexity** - Haiku for simple, Sonnet default, Opus rare - [ ] **5-15 focus areas** - Not too few (vague) or too many (unfocused) - [ ] **Focus areas specific** - Concrete, not generic statements - [ ] **Tools match usage** - No missing or excessive permissions - [ ] **Approach section complete** - Methodology defined, output format specified - [ ] **File size reasonable** - <500 lines or uses progressive disclosure ### Nice-to-Have Improvements Polish for excellent agent quality: - [ ] **Model choice justified** - Clear reason for non-default model - [ ] **Focus areas have examples** - Technology/framework specificity - [ ] **Approach has decision frameworks** - If/then logic for complex tasks - [ ] **Tool restrictions documented** - Why specific tools are allowed/restricted - [ ] **Resource organization** - Uses references/ when needed, proper structure - [ ] **Context economy** - Concise without sacrificing clarity ## Audit Workflow ### Step 1: Read Agent File Identify the agent file to audit: ```bash # Single agent Read skills/evaluator/SKILL.md # Find all agents Glob agents/*.md ``` ### Step 2: Validate Model Selection **Check model field**: ```yaml model: sonnet # Good - default choice model: haiku # Check: Is agent simple enough? model: opus # Check: Is complexity justified? ``` **Decision criteria**: - **Haiku** (`haiku`): Simple read-only analysis, fast response needed, low cost priority - **Sonnet** (`sonnet`): Default for most agents, balanced cost/capability - **Opus** (`opus`): Complex reasoning required, highest capability needed **Common issues**: - Opus overuse: Using expensive model when Sonnet sufficient - Haiku underperformance: Too simple for task complexity - Missing model: No model field specified (defaults to Sonnet) See [model-selection.md](model-selection.md) for detailed decision matrix. ### Step 3: Validate Tool Restrictions **Check allowed_tools or allowed-patterns**: ```yaml allowed_tools: - Read - Grep - Glob - Bash ``` **Validation checklist**: 1. **Tools specified**: Has allowed_tools field (not unrestricted) 2. **Tools match usage**: All mentioned tools are allowed 3. **No missing tools**: All needed tools are included 4. **No excessive tools**: No unnecessary permissions 5. **Security implications**: No dangerous tool combinations **Common patterns**: - **Read-only analyzer**: [Read, Grep, Glob, Bash (read commands)] - **Code generator**: [Read, Write, Edit, Grep, Glob, Bash] - **Orchestrator**: [Task, Skill, Read, AskUserQuestion] See [tool-restrictions.md](tool-restrictions.md) for security analysis. ### Step 4: Assess Focus Area Quality **Target**: 5-15 focus areas that are specific, concrete, and comprehensive **Quality criteria**: **Specific vs Generic**: - ✗ Generic: "Python programming" - ✓ Specific: "FastAPI REST APIs with SQLAlchemy ORM" **Concrete vs Vague**: - ✗ Vague: "Best practices" - ✓ Concrete: "Defensive programming with strict error handling" **Coverage**: - Too few (<5): Expertise unclear or overly narrow - Sweet spot (5-15): Comprehensive, focused expertise - Too many (>15): Unfocused, trying to do everything **Example analysis**: ```markdown ## Focus Areas - Defensive programming with strict error handling ✓ - POSIX compliance and cross-platform portability ✓ - Safe argument parsing and input validation ✓ - Robust file operations and temporary resource management ✓ - Production-grade logging and error reporting ✓ ``` **Score**: 5/5 areas, all specific and concrete → GOOD See [focus-area-quality.md](focus-area-quality.md) for scoring methodology. ### Step 5: Review Approach Methodology **Check approach section completeness**: Required elements: - [ ] **Methodology defined** - Step-by-step process - [ ] **Decision frameworks** - How to handle different scenarios - [ ] **Output format** - What the agent produces - [ ]
Use when you need to install the embedded robot agents into either .cursor/agents or .claude/agents, selecting the destination interactively and copying the embedded agent definitions from project assets. This should trigger for requests such as Install embedded agents; Bootstrap .cursor/agents; Bootstrap .claude/agents; Copy robot agents. Part of cursor-rules-java project
Use when you need to generate an AGENTS.md file for a Java repository — covering project conventions, tech stack, file structure, commands, Git workflow, and contributor boundaries — through a modular, step-based interactive process that adapts to your specific project needs. This should trigger for requests such as Create AGENTS.md; Update AGENTS.md file; Add agent instructions. Part of cursor-rules-java project
>
Generated skill from request: trinity auto-boot validator
Create your OpenAI Agents SDK skill in one prompt, then learn to improve it throughout the chapter
Create your OpenAI Agents SDK skill in one prompt, then learn to improve it throughout the chapter
Create your Google Agent Development Kit skill in one prompt, then learn to improve it throughout the chapter