Skill94 estrellas del repoactualizado 1mo ago

harness-engineering

Harness engineering improves how AI agents work on codebases by structuring project setup, context design, constraints, evaluation systems, and multi-agent coordination. Use this skill when setting up a new project for agents, debugging why agents underperform or ignore instructions, or systematically improving existing agent infrastructure through linters, documentation standards, and feedback loops.

Ver fuente Repositorio: harness-engineering

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/10xChengTu/harness-engineering /tmp/harness-engineering && cp -r /tmp/harness-engineering/skills/harness-engineering ~/.claude/skills/harness-engineering

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# Harness Engineering

Harness = the operating system for AI agents working on your project. Model is CPU, context window is RAM, harness is OS.

## Core Principle

**Start simple, add complexity only when needed.** Every harness component encodes an assumption about what the model can't do alone. Pressure-test these assumptions — they expire as models improve. Build for deletion.

## When This Skill Activates

| Signal | Action |
|--------|--------|
| Empty/new project | → Full project setup (Section 1) |
| User frustrated with agent | → Diagnose & fix harness gaps (Section 7) |
| Existing project needs improvement | → Assess & incrementally improve |
| Explicit harness question | → Reference relevant sections |

## Workflow

### For New Projects

1. **Assess** — What's the project? Tech stack? Team size? How will agents be used?
2. **Setup** — Create foundational harness files → read `references/01-project-setup.md`
3. **Context** — Design information architecture → read `references/02-context-engineering.md`
4. **Constraints** — Add guardrails and linters → read `references/03-constraints.md`
5. **Evaluate** — Set up feedback loops → read `references/05-eval-feedback.md`
6. If project involves multi-agent or long tasks → read `references/04-multi-agent.md`, `references/06-long-running.md`

### For Diagnosis (Agent Not Performing Well)

1. Read `references/07-diagnosis.md` immediately
2. Identify which harness layer is failing
3. Apply targeted fix from the relevant reference

### For Incremental Improvement

Assess current harness maturity, identify weakest layer, improve one layer at a time.

## Harness Layers (Quick Reference)

| Layer | What | Reference |
|-------|------|-----------|
| **Project Setup** | AGENTS.md, docs/, directory conventions | `01-project-setup.md` |
| **Context Engineering** | What info agents see, progressive disclosure, working state | `02-context-engineering.md` |
| **Constraints & Guardrails** | Linters, type systems, architecture enforcement, safe autonomy | `03-constraints.md` |
| **Multi-Agent Architecture** | Agent separation, coordination protocols, delegation patterns | `04-multi-agent.md` |
| **Eval & Feedback** | Testing, grading, GC agents, observability | `05-eval-feedback.md` |
| **Long-Running Tasks** | Progress tracking, context resets, handoff artifacts | `06-long-running.md` |
| **Diagnosis** | When agents fail — identify root cause in harness, not model | `07-diagnosis.md` |

## Self-Update Protocol

When you discover a new reusable harness pattern during a project:

1. Identify which reference file it belongs to (or if it needs a new one)
2. Add the pattern with: **what** it solves, **when** to use it, **how** to implement it
3. Keep it concise — no fluff, just the pattern

Del mismo repositorio

harness-engineering-zhSkill

为 AI Agent 友好的代码库搭建和改进 Harness 工程（包括 AGENTS.md、docs/、Lint 规则、Eval 系统、项目级 Prompt 工程）。触发场景：为 AI Agent 设置新项目/空项目，创建 AGENTS.md 或 CLAUDE.md，关于 Harness 工程的问题，让 Agent 在代码库上更高效地工作。当用户感到沮丧或抱怨 Agent 质量时也会触发（例如：'Agent 总是无视规范'、'它从不听从指令'、'为什么它总是做错 X'、'Agent 坏了'）— 因为 Agent 输出质量差几乎总是意味着 Harness 缺失，而不是模型问题。涵盖：Context 工程、架构约束、多 Agent 协作、评估、长运行任务 Harness 以及 Agent 质量问题诊断。