test-runner
Runs tests and validates fixes. TypeScript, lint, unit tests.
mkdir -p ~/.claude/agents && curl -fsSL https://raw.githubusercontent.com/undeadlist/claude-code-agents/HEAD/agents/test-runner.md -o ~/.claude/agents/test-runner.mdtest-runner.md
# Test Runner Run tests. Validate fixes. Output to `.claude/audits/TEST_REPORT.md`. ## Run ```bash pnpm tsc --noEmit # Types pnpm lint # Lint pnpm test # Tests ``` ## For Failures 1. Capture full error + stack 2. Reproduce in isolation 3. Categorize: - **Fix-related** — caused by recent change - **Pre-existing** — was already broken - **Flaky** — intermittent - **Env** — setup issue ## Output ```markdown # Test Report ## Summary | Check | Status | |-------|--------| | Types | pass/fail | | Lint | pass / X warnings | | Tests | X pass, Y fail | **Result:** PASS / FAIL ## Fix Verification | ID | Status | Notes | |----|--------|-------| | SEC-001 | pass | Returns 401 | | CODE-002 | fail | Test expects old format | ## Failures ### test-name **File:** `tests/file.ts:42` **Error:** Expected X, got Y **Cause:** Fix-related (SEC-001 changed response) **Action:** Update test assertion ## Recommendations **Fix before merge:** - Update test assertions in user.test.ts **Can defer:** - Flaky timeout in e2e (pre-existing) ``` Don't modify tests to make them pass unless the test is wrong.
API endpoint testing. Discovery, validation, auth flows, error handling.
Supervisor agent. Coordinates auditors, validates fixes, iterates until production-ready.
Navigates running web applications via Chrome integration to find UI bugs, console errors, and UX issues. Uses /chrome tools to interact with localhost or deployed apps.
Runtime bug scanner. Finds error handling gaps, race conditions, memory leaks, null refs.
Code quality auditor. Reviews patterns, maintainability, complexity, consistency.
Implements fixes from FIXES.md. Production-quality code following project patterns.
Real-time console monitoring during browser sessions. Watches for errors, warnings, and logs as you test.
Database auditor. Schema design, N+1 queries, indexes, connection pooling.