Skill389 estrellas del repoactualizado 19d ago

adaptive-pair-selection

Adaptive pair selection iteratively identifies the most informationally valuable item comparisons, executes them, updates a rating model based on results, and monitors convergence of the ranking. Use this skill when building robust preference rankings where comparison budget is limited and you need confidence intervals around final positions.

Ver fuente Repositorio: de-anthropocentric-research-engine

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/yogsoth-ai/de-anthropocentric-research-engine /tmp/adaptive-pair-selection && cp -r /tmp/adaptive-pair-selection/skills/adaptive-pair-selection ~/.claude/skills/adaptive-pair-selection

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# Adaptive Pair Selection

Select the next comparison pair by information gain, execute the comparison, update ratings, and check for convergence. Repeats until the ranking stabilizes or the comparison budget is exhausted.

## Stages

1. **Select** — pair-selector identifies the pair whose comparison would most reduce uncertainty
2. **Compare** — comparison-executor produces a judgment with confidence and reasoning
3. **Update** — rating-update incorporates the new judgment into the rating model
4. **Check** — convergence-check determines if ranking has stabilized

Loop stages 1-4 until convergence or budget exhaustion.

## Available SOPs

| Stage | SOP | Input | Output |
|-------|-----|-------|--------|
| Select | pair-selector | current_ratings, comparison_history | next_pairs[] |
| Compare | comparison-executor | pair, context | judgment |
| Update | rating-update | judgment, current_ratings, method | updated_ratings |
| Check | convergence-check | rating_history | converged, stability_score |

## Execution Guidance

- Start with high-uncertainty pairs (largest sigma or most uncertain boundary)
- For small N: may complete all pairs in first pass, then focus on inconsistencies
- For large N: prioritize pairs near rank boundaries (positions k and k+1)
- Track comparison count against budget; exit gracefully if budget hit
- Pass full rating_history to convergence-check (not just latest snapshot)

## Minimum Yield

- Global ranking + confidence intervals + convergence curve
- Global ranking with confidence intervals for each position
- Convergence curve showing stability score over iterations
- Comparison log with all judgments made

<!-- BEGIN available-tables (generated) -->

## Available SOPs

Optional, no fixed order; the final leaf is always a sop.

| SOP | When to use |
| --- | --- |
| comparison-executor | Execute a pairwise comparison between two candidates, producing a judgment with winner, confidence, and reasoning. |
| convergence-check | Evaluate whether the ranking has stabilized by analyzing rating history and computing stability metrics. |
| pair-selector | Select the next comparison pairs that maximize information gain given current ratings and comparison history. |
| rating-update | Incorporate a new judgment into the rating model and return updated ratings for all candidates. |

<!-- END available-tables (generated) -->

Del mismo repositorio

formated-resultSkill

Experiment-specific - summarize the DARE executor's research design into a clean research_result report, forced to write back into the spec file produced by formated-specs.

formated-specsSkill

Experiment-specific - replaces writing-specs, emits DARE's 4-layer call plan as a clean research_graph schema. Last step forces load formated-result.

injection-fidelitySkill

loss-1 judge - read a sample's full dialogue and decide whether the user simulator semantically enacted its Policy Card. check-blind.

ladder-quality-orderSkill

loss-2 judge - pairwise quality comparison across the n rungs within one topic; decide monotonicity and endpoint separation. check-blind, D1-D5 only.

abductive-hypothesis-generationSkill

Strategy: Inference to the best explanation in the face of anomalies

ablation-brainstormSkill

Remove components one by one, observe system changes to reveal hidden

ablation-component-mappingSkill

Map system architecture to ablatable units for ablation studies

ablation-designSkill

Design ablation studies to isolate component contributions in ML systems