Skill389 estrellas del repoactualizado 20d ago

attack-resilience-scoring

This Claude Code skill computes a quantitative resilience score (0.0-1.0) for artifacts evaluated through red team testing by analyzing aggregated vulnerabilities, coverage metrics, and severity distribution across logical, empirical, methodological, and practical dimensions. Use it when you need an objective, bias-independent assessment of how well a system withstands identified attacks relative to test coverage.

Ver fuente Repositorio: de-anthropocentric-research-engine

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/yogsoth-ai/de-anthropocentric-research-engine /tmp/attack-resilience-scoring && cp -r /tmp/attack-resilience-scoring/skills/attack-resilience-scoring ~/.claude/skills/attack-resilience-scoring

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# Attack Resilience Scoring

Computes a quantitative resilience score for the artifact based on red team results.

## Execution

Subagent — spawned via subagent-spawning/spawn-agent.

## Why Subagent

Scoring requires calibrated judgment independent of attack or defense bias. The scorer must weigh findings objectively against coverage.

## Input

- **aggregated_findings**: Deduplicated vulnerability report from finding-aggregation
- **coverage_data**: What percentage of threat surfaces were tested, at what depth

## Output

- **resilience_score**: 0.0-1.0 overall score
- **dimension_scores**: Per-dimension breakdown (logical, empirical, methodological, practical)
- **confidence_in_score**: How much to trust the score given coverage gaps
- **verdict**: Pass/conditional-pass/fail with justification

<!-- BEGIN available-tables (generated) -->

## Available SOPs

Optional, no fixed order; the final leaf is always a sop.

| SOP | When to use |
| --- | --- |
| spawn-agent | Spawn a customized CC subagent with full MCP tool access. Used by SOPs that declare execution: subagent. |

<!-- END available-tables (generated) -->

Del mismo repositorio

formated-resultSkill

Experiment-specific - summarize the DARE executor's research design into a clean research_result report, forced to write back into the spec file produced by formated-specs.

formated-specsSkill

Experiment-specific - replaces writing-specs, emits DARE's 4-layer call plan as a clean research_graph schema. Last step forces load formated-result.

injection-fidelitySkill

loss-1 judge - read a sample's full dialogue and decide whether the user simulator semantically enacted its Policy Card. check-blind.

ladder-quality-orderSkill

loss-2 judge - pairwise quality comparison across the n rungs within one topic; decide monotonicity and endpoint separation. check-blind, D1-D5 only.

abductive-hypothesis-generationSkill

Strategy: Inference to the best explanation in the face of anomalies

ablation-brainstormSkill

Remove components one by one, observe system changes to reveal hidden

ablation-component-mappingSkill

Map system architecture to ablatable units for ablation studies

ablation-designSkill

Design ablation studies to isolate component contributions in ML systems