skill-audit
Audit, design, categorize, distribute, and measure agent skills using lessons from Anthropic's Lessons from building Claude Code: How we use skills. Use when reviewing an existing skill, deciding whether a workflow deserves a skill, planning a skill library, turning team knowledge into skills, choosing skill categories, writing trigger descriptions, designing progressive disclosure, or planning skill marketplace and usage measurement.
git clone --depth 1 https://github.com/majiayu000/spellbook /tmp/skill-audit && cp -r /tmp/skill-audit/skills/skill-audit ~/.claude/skills/skill-auditSKILL.md
# Skill Audit Use this skill to audit existing skills, turn workflow knowledge into useful agent skills, and review skill libraries at the strategy level. It complements `skill-creator`: use this skill to decide what a skill should be, how it should fit a library, and what needs improvement; use `skill-creator` when the user wants the concrete SKILL.md implementation and eval loop. This workflow is based on Anthropic's June 3, 2026 blog post, "Lessons from building Claude Code: How we use skills": https://claude.com/blog/lessons-from-building-claude-code-how-we-use-skills ## Core Principle A good skill is not "some markdown about a topic." It is a compact extension point that gives the agent non-obvious domain knowledge, reusable files, deterministic helpers, setup rules, verification habits, and guardrails at the moment they matter. ## Workflow ### 1. Decide Whether This Should Be A Skill Create or improve a skill only when at least one of these is true: - The workflow repeats often enough that users should not re-explain it. - The agent regularly makes the same domain-specific mistake. - The work needs local scripts, templates, examples, assets, hooks, or setup. - The output must follow a stable structure or verification path. - The knowledge is team-specific, product-specific, infrastructure-specific, or otherwise not inferable from general model knowledge. Do not make a skill when the content only restates obvious coding behavior, generic best practices, or one-off instructions. ### 2. Classify The Skill Read `references/skill-taxonomy.md` and classify the candidate into exactly one primary category. If it appears to span several categories, tighten the scope or split it. Report: - Primary category - Secondary category, if truly needed - Why this category is the cleanest fit - What would make the skill too broad ### 3. Draft A Skill Brief Use `assets/skill-brief-template.md` for the output. Fill it with: - Trigger description written for the model, not as a human-facing summary - High-signal knowledge the model would not otherwise know - Gotchas and failure modes - Autonomy boundaries: what the skill may do directly, and what must be escalated before acting - Evidence-backed pushback rules: when the agent should challenge the proposed path and what evidence it must cite - Feedback loop: where repeated corrections, false-success signals, or manual recovery steps should be promoted - Progressive disclosure map: SKILL.md vs references vs scripts vs assets - Setup requirements or config questions - Verification strategy - Distribution path - Measurement plan ### 4. Design Progressive Disclosure Keep `SKILL.md` focused on activation, decisions, and the main workflow. Move details into support files: - `references/` for tables, API conventions, taxonomy, playbooks, and long docs - `scripts/` for deterministic actions or repetitive checks - `assets/` for templates, report formats, starter files, or examples - `agents/` for specialized subagent prompts when the repo supports them - `evals/` for realistic prompts and objective assertions Tell the agent exactly when to read each support file. ### 5. Add Operational Design Read `references/writing-and-operations.md` when deciding: - Whether a setup step or config file is needed - Whether the skill should remember past runs - Whether scripts or hooks would improve reliability - Whether the skill belongs in a repo, a shared plugin, or a marketplace - What usage signals indicate undertriggering, overtriggering, or decay ### 6. Hand Off To Implementation When the user wants the skill built, pass the brief into `skill-creator` and ask it to implement the files, generate realistic test prompts, and run validation. If editing an existing skill, include the exact file paths and the smallest content changes needed. Do not rewrite unrelated skill behavior. ## Output Format For advisory requests, answer with: 1. Decision: create, improve, split, merge, or do not create 2. Category: one primary taxonomy category 3. Skill brief: filled from the template 4. Implementation notes: files to create/edit and validation commands 5. Risks: overbreadth, obviousness, missing setup, missing verification, or weak trigger description For repository work, actually create or update the files, then run the repo's skill validation command. ## Gotchas - Do not make a knowledge dump. Convert article or team knowledge into decisions, checklists, templates, and verification. - Do not put all details in SKILL.md. Long reference material belongs in support files. - Do not write a description as a marketing summary. It must name concrete user phrases and contexts that should trigger the skill. - Do not railroad the agent with brittle instructions. Provide defaults, decision criteria, and escape hatches. - Do not ship a skill without at least a lightweight way to tell if it worked: validation commands, example prompts, expected artifacts, or usage metrics. - Do not encode vague autonomy such as "be proactive." Name the direct actions, escalation boundaries, and end-state checks that should change behavior.
Senior backend TypeScript architect specializing in Bun/Node.js runtime, API design, database optimization, and scalable server architecture.
Expert at exploring and understanding legacy and unfamiliar codebases. Maps dependencies, identifies patterns, and creates documentation for complex systems.
Kubernetes architect specializing in cluster design, manifests, Helm charts, GitOps workflows, security policies, and production operations.
Systematic open source contributor that analyzes projects, finds suitable issues, implements fixes, and creates high-quality PRs with high acceptance probability.
Application security expert specializing in SAST, vulnerability assessment, OWASP Top 10, compliance auditing, and security architecture review.
Fullstack code reviewer with 15+ years experience analyzing code for security vulnerabilities, performance bottlenecks, architectural decisions, and best practices.
Senior technical lead who analyzes complex projects and coordinates multi-step development tasks. Delegates to specialized agents and ensures quality delivery.
Use when the user explicitly asks to stage all current changes, create a commit, and push to the remote after safety checks.