Skip to main content
ClaudeWave
Skill1.4k repo starsupdated today

tooluniverse-antibody-engineering

This Claude Code skill provides an AI-guided pipeline for optimizing therapeutic antibodies from preclinical lead compounds toward clinical-ready candidates. It covers sequence humanization through germline alignment and framework retention, structure-guided CDR analysis using AlphaFold and PDB databases, affinity maturation strategies, and comprehensive developability assessment including aggregation risk, stability predictions, post-translational modification analysis, and immunogenicity scoring. Use it when designing or optimizing monoclonal antibodies, engineering bispecific antibodies, assessing biosimilar candidates, reducing manufacturing liabilities, or benchmarking sequences against approved clinical antibodies.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/mims-harvard/ToolUniverse /tmp/tooluniverse-antibody-engineering && cp -r /tmp/tooluniverse-antibody-engineering/plugin/skills/tooluniverse-antibody-engineering ~/.claude/skills/tooluniverse-antibody-engineering
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# Antibody Engineering & Optimization

AI-guided antibody optimization pipeline from preclinical lead to clinical candidate. Covers sequence humanization, structure modeling, affinity optimization, developability assessment, immunogenicity prediction, and manufacturing feasibility.

**KEY PRINCIPLES**:
1. **Report-first approach** - Create optimization report before analysis
2. **Evidence-graded humanization** - Score based on germline alignment and framework retention
3. **Developability-focused** - Assess aggregation, stability, PTMs, immunogenicity
4. **Structure-guided** - Use AlphaFold/PDB structures for CDR analysis
5. **Clinical precedent** - Reference approved antibodies for validation
6. **Quantitative scoring** - Developability score (0-100) combining multiple factors
7. **English-first queries** - Always use English terms in tool calls, even if user writes in another language. Respond in user's language

---

## LOOK UP, DON'T GUESS
When uncertain about any scientific fact, SEARCH databases first (PubMed, UniProt, ChEMBL, ClinVar, etc.) rather than reasoning from memory. A database-verified answer is always more reliable than a guess.

---

## When to Use

Apply when user asks:
- "Humanize this mouse antibody sequence"
- "Optimize antibody affinity for [target]"
- "Assess developability of this antibody"
- "Predict immunogenicity risk for [sequence]"
- "Engineer bispecific antibody against [targets]"
- "Reduce aggregation in antibody formulation"
- "Design pH-dependent binding antibody"
- "Analyze CDR sequences and suggest mutations"

---

## Critical Workflow Requirements

### 1. Report-First Approach (MANDATORY)

1. **Create the report file FIRST**: `antibody_optimization_report.md`
2. **Progressively update** as analysis completes
3. **Output separate files**:
   - `optimized_sequences.fasta` - All optimized variants
   - `humanization_comparison.csv` - Before/after comparison
   - `developability_assessment.csv` - Detailed scores

See `REPORT_TEMPLATE.md` for the full report template with section formats.

### 2. Documentation Standards (MANDATORY)

Every optimization MUST include per-variant documentation with:
- Original and optimized sequences
- Humanization score (% human framework)
- CDR preservation confirmation
- Metrics table (humanness, aggregation risk, predicted KD, immunogenicity)
- Data source citations

---

## Phase 0: Tool Verification

### Required Tools

| Tool | Purpose | Category |
|------|---------|----------|
| `IMGT_search_genes` | Germline gene identification | Humanization |
| `IMGT_get_sequence` | Human framework sequences | Humanization |
| `SAbDab_search_structures` | Antibody structure precedents | Structure |
| `TheraSAbDab_search_by_target` | Clinical antibody benchmarks | Validation |
| `alphafold_get_prediction` | Structure modeling | Structure |
| `iedb_search_epitopes` | Epitope identification | Immunogenicity |
| `iedb_search_bcell` | B-cell epitope prediction | Immunogenicity |
| `UniProt_get_entry_by_accession` | Target antigen information | Target |
| `STRING_get_interaction_partners` | Protein interaction network | Bispecifics |
| `PubMed_search_articles` | Literature precedents | Validation |

**CRITICAL**: SOAP tools (IMGT, SAbDab, TheraSAbDab) require an `operation` parameter. See `QUICK_START.md` for correct usage.

---

## Workflow Overview

```
Phase 1: Input Analysis & Characterization
├── Sequence annotation (CDRs, framework)
├── Species identification
├── Target antigen identification
├── Clinical precedent search
└── OUTPUT: Input characterization
    ↓
Phase 2: Humanization Strategy
├── Germline gene alignment (IMGT)
├── Framework selection
├── CDR grafting design
├── Backmutation identification
└── OUTPUT: Humanization plan
    ↓
Phase 3: Structure Modeling & Analysis
├── AlphaFold prediction
├── CDR conformation analysis
├── Epitope mapping
├── Interface analysis
└── OUTPUT: Structural assessment
    ↓
Phase 4: Affinity Optimization
├── In silico mutation screening
├── CDR optimization strategies
├── Interface improvement
└── OUTPUT: Affinity variants
    ↓
Phase 5: Developability Assessment
├── Aggregation propensity
├── PTM site identification
├── Stability prediction
├── Expression prediction
└── OUTPUT: Developability score
    ↓
Phase 6: Immunogenicity Prediction
├── MHC-II epitope prediction (IEDB)
├── T-cell epitope risk
├── Aggregation-related immunogenicity
└── OUTPUT: Immunogenicity risk score
    ↓
Phase 7: Manufacturing Feasibility
├── Expression level prediction
├── Purification considerations
├── Formulation stability
└── OUTPUT: Manufacturing assessment
    ↓
Phase 8: Final Report & Recommendations
├── Ranked variant list
├── Experimental validation plan
├── Next steps
└── OUTPUT: Comprehensive report
```

---

## Phase 1: Input Analysis & Characterization

**Goal**: Annotate sequences, identify species/germline, find clinical precedents.

**Key steps**:
1. Annotate CDRs using IMGT numbering (CDR-H1: 27-38, CDR-H2: 56-65, CDR-H3: 105-117)
2. Identify closest human germline genes via `IMGT_search_genes`
3. Search clinical precedents via `TheraSAbDab_search_by_target`
4. Get target antigen info via `UniProt_get_entry_by_accession`

**Output**: Sequence information table, CDR annotation, target info, clinical precedent list.

See `WORKFLOW_DETAILS.md` Phase 1 for code examples.

---

## Phase 2: Humanization Strategy

**Goal**: Select human framework, design CDR grafting, identify backmutations.

**Key steps**:
1. Search IMGT for IGHV/IGKV human germline genes
2. Score candidate frameworks by identity, CDR compatibility, and clinical use
3. Design CDR grafting onto selected framework
4. Identify Vernier zone residues that may need backmutation (positions 2, 27-30, 47-48, 67, 69, 71, 78, 93-94)
5. Generate at least 2 variants: full humanization and with key backmutations
6. Calculate humanization score (framework humanness, CDR preservation, T-cell epitopes, aggregation risk)

**Output**: Framework selection
setup-tooluniverseSkill

Install and configure ToolUniverse for any use case — MCP server (chat-based), CLI (command line with 9 subcommands), or Python SDK (Coding API with 3 calling patterns). Covers uv/uvx setup, MCP configuration for 12+ AI clients (Cursor, Claude Desktop, Windsurf, VS Code, Codex, Gemini CLI, Trae, Cline, etc.), full CLI reference (tu list/grep/find/info/run/test/status/build/serve), Coding API quickstart, agentic tools, code executor, API key walkthrough, skill installation, and upgrading. Use when user asks how to set up ToolUniverse, which access mode to use (MCP vs CLI vs SDK), configuring MCP servers, using the CLI, troubleshooting installation, upgrading, or mentions installing ToolUniverse or setting up scientific tools. Also triggers for "how do I use ToolUniverse", "what's the best way to access tools", "command line", "tu command", "coding API", "tu build".

tooluniverse-acmg-variant-classificationSkill

Systematic ACMG/AMP germline variant classification with all 28 criteria (PVS1, PS1-4, PM1-6, PP1-5, BA1, BS1-4, BP1-7) for clinical significance. Produces 5-tier verdict (Pathogenic / Likely Pathogenic / VUS / Likely Benign / Benign) with cited evidence per criterion. Use for variant interpretation, VUS resolution, and pathogenicity assessment. Combines ClinVar, gnomAD, computational predictors, and gene-mechanism context.

tooluniverse-admet-predictionSkill

Comprehensive ADMET (Absorption, Distribution, Metabolism, Excretion, Toxicity) profiling for drug candidates. Integrates ADMET-AI predictions, SwissADME drug-likeness, PubChemTox experimental toxicity, ChEMBL clinical data, Lipinski rule-of-five, and CYP interaction data. Use for drug-likeness assessment, BBB penetration, bioavailability, hepatotoxicity prediction, ADME/PK profiling, or screening compound libraries before lab testing.

tooluniverse-adverse-event-detectionSkill

Detect and analyze adverse drug event signals using FDA FAERS reports, drug labels, and disproportionality statistics (PRR, ROR, IC). Generates quantitative safety signal scores (0-100) with evidence grading. Use for post-market surveillance, pharmacovigilance, drug safety assessment, regulatory submissions, and detecting rare AE signals not visible in clinical trials.

tooluniverse-adverse-outcome-pathwaySkill

Map environmental and industrial chemicals to adverse outcome pathways (AOPs) — molecular initiating event to organ-level toxicity. Uses AOPWiki, GHS classification, IARC carcinogen status, and LD50 data. Use for environmental/industrial chemical risk assessment, regulatory-grade hazard characterization, and AOP stressor mapping. Distinct from drug-safety analysis (use tooluniverse-pharmacovigilance for drugs).

tooluniverse-aging-senescenceSkill

Aging biology, cellular senescence, and longevity research. Covers senescence markers (p16/CDKN2A, SASP, SA-beta-gal), aging hallmarks, senolytic drug discovery (dasatinib+quercetin, fisetin, navitoclax), epigenetic clocks, telomere biology, and longevity GWAS. Use for senescence-pathway analysis, age-related disease genetics, senolytic-target discovery, and centenarian-genetics queries. Distinguishes correlative vs causal evidence (knockout, intervention).

tooluniverse-binder-discoverySkill

Discover novel small-molecule binders for protein targets using structure-based and ligand-based screening. Covers druggability assessment, known-ligand mining (ChEMBL, BindingDB), similarity expansion, ADMET filtering, and synthesis feasibility. Use for hit identification, virtual screening, target-to-compounds workflows, and lead-finding before commit-to-medchem.

tooluniverse-cancer-classificationSkill

Translate free-text tumor descriptions to OncoTree codes and resolve cancer subtypes/tissue hierarchy. Cross-references UMLS/NCI vocabularies. Use for standardizing cancer-type nomenclature in EHR free-text, building cohorts in OncoKB or GDC, mapping tumor-board notes to ontology codes, and ensuring consistent terminology across cancer-genomics pipelines.