Skill1.6k repo starsupdated today

tooluniverse-electron-microscopy

This Claude Code skill searches and analyzes electron microscopy structural data across EMDB density maps, PDB fitted atomic models, EMPIAR raw micrograph datasets, and CryoET Data Portal volumes. Use it when seeking 3D structural information on proteins or complexes, comparing experimental cryo-EM resolution to AlphaFold predictions, accessing raw EM datasets for reprocessing, or cross-referencing density maps with atomic models across multiple resolution scales.

View source Repository: ToolUniverse

Install in Claude Code

Copy

git clone --depth 1 https://github.com/mims-harvard/ToolUniverse /tmp/tooluniverse-electron-microscopy && cp -r /tmp/tooluniverse-electron-microscopy/plugin/skills/tooluniverse-electron-microscopy ~/.claude/skills/tooluniverse-electron-microscopy

Then start a new Claude Code session; the skill loads automatically.

Definition

SKILL.md

# Electron Microscopy Structure Analysis

Pipeline for discovering and analyzing electron microscopy data across the full resolution spectrum: from 3D density maps (EMDB) to fitted atomic models (PDB), raw micrograph datasets (EMPIAR), and cryo-electron tomography volumes (CryoET Data Portal). Connects EM data to structural biology context via PDB and AlphaFold.

**Guiding principles**:
1. **Resolution awareness** -- always report and interpret map resolution; sub-4A enables atomic modeling, 4-8A enables domain fitting, >8A is shape-level
2. **Map before model** -- the density map is the primary experimental data; fitted models are interpretations
3. **Method matters** -- single particle analysis, tomography, 2D crystallography, and helical reconstruction have different strengths and limitations
4. **Raw data value** -- EMPIAR raw data enables reprocessing with newer algorithms; always note availability
5. **Cross-reference structures** -- connect EMDB maps to PDB entries and AlphaFold predictions for completeness
6. **English-first queries** -- use English terms in tool calls

EM resolution determines what you can see. TEM resolves individual protein complexes (~2nm). Cryo-EM achieves near-atomic resolution (<4Å) for large complexes. SEM shows surface topology. Choose the right EM modality for the question.

## LOOK UP, DON'T GUESS
When uncertain about any scientific fact, SEARCH databases first rather than reasoning from memory. A database-verified answer is always more reliable than a guess.

---

## COMPUTE, DON'T DESCRIBE
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.

## When to Use

Typical triggers:
- "Find cryo-EM structures of [protein/complex]"
- "What EMDB maps are available for [target]?"
- "Get raw micrograph data for [structure]"
- "Find tomography datasets for [organelle/cell type]"
- "What is the resolution of [EMDB entry]?"
- "Cross-reference this EM map with PDB models"
- "Find cryo-ET datasets for [sample]"

**Not this skill**: For X-ray crystallography or NMR structures, use PDB search tools directly. For protein structure prediction, use `tooluniverse-protein-structure`.

---

## Core Databases

| Database | Content | Best For |
|----------|---------|----------|
| **EMDB** | 3D EM density maps (>40K entries) | Finding processed maps, resolution data, fitting info |
| **EMPIAR** | Raw micrograph/tilt series datasets | Accessing original image data for reprocessing |
| **CryoET Data Portal** | Cryo-electron tomography data | Tomographic volumes, cellular context, in-situ structures |
| **PDB (RCSB)** | Atomic models fitted to EM maps | Structural models derived from EM data |
| **AlphaFold** | AI-predicted protein structures | Complementary models when EM resolution is limited |

---

## Workflow Overview

```
Phase 0: Query Parsing
  Identify target protein/complex, method preference, resolution needs
    |
Phase 1: Map & Image Search (EMDB)
  Find EM density maps, resolution, method, sample details
    |
Phase 2: Structure Fitting (EMDB + PDB)
  Identify fitted atomic models, fitting quality
    |
Phase 3: Raw Data Access (EMPIAR)
  Find raw micrographs, tilt series, particle stacks
    |
Phase 4: Tomography (CryoET Data Portal)
  Search cryo-ET datasets, reconstructed volumes
    |
Phase 5: Cross-Reference & Context (PDB + AlphaFold)
  Connect to atomic models, predicted structures, literature
    |
Phase 6: Report Synthesis
  Integrated EM data landscape for the target
```

---

## Phase Details

### Phase 0: Query Parsing

Identify from the user's request:
- **Target**: protein name, complex name, or organism
- **Method preference**: single particle, tomography, micro-ED, helical
- **Resolution needs**: atomic modeling (<4A), domain fitting (4-8A), shape (>8A)
- **Data type**: processed maps, raw data, fitted models, or all

### Phase 1: Map & Image Search (EMDB)

**Objective**: Find EM density maps matching the query.

**Tools**:
- `EMDB_search_structures` -- search EMDB by keyword, organism, resolution
  - Input: `query` (search term), optional `resolution_min`, `resolution_max`, `method`, `limit`
  - Output: entries with EMDB ID, title, resolution, method, sample
- `EMDB_get_structure` -- get full details for an EMDB entry
  - Input: `emdb_id` (e.g., "EMD-1234")
  - Output: map details, resolution, sample, processing info, citations
- `EMDB_get_map_info` -- get map-specific info (resolution, contour, dimensions)
  - Input: `emdb_id`
- `EMDB_get_sample_info` -- get sample preparation details
  - Input: `emdb_id`

**Workflow**:
1. Search EMDB for the target protein/complex
2. Sort results by resolution (best first)
3. For top entries, get full details including sample preparation and processing
4. Note the EM method used (single particle, tomography, helical, etc.)
5. Record associated PDB and EMPIAR accessions

**Resolution interpretation**:
- < 2.5A: near-atomic; side chains visible
- 2.5-4.0A: atomic; backbone and large side chains traceable
- 4.0-8.0A: domain level; secondary structure elements visible
- > 8.0A: shape; overall architecture only

### Phase 2: Structure Fitting (EMDB + PDB)

**Objective**: Find atomic models fitted into EM maps and assess fitting quality.

**Tools**:
- `EMDB_get_validation` -- get fitting/validation data for an EMDB entry
  - Input: `emdb_id`
  - Output: fitted PDB models, fitting statistics, validation scores
- `RCSBData_get_entry` -- get PDB entry details
  - Input: `entry_id` (PDB ID)
  - Output: structure details, resolution, method, citation
- `RCSBAdvSearch_search_structures` -- advanced PDB search
  - Input: `query` (search term), optional `experimental_method`, `resolution_max`, `limit`
  - Output: PDB entries matching criteria

**Workflow**:
1. For each EMDB entry

More from this repository

setup-tooluniverseSkill

Install and configure ToolUniverse for any use case — MCP server (chat-based), CLI (command line with 9 subcommands), or Python SDK (Coding API with 3 calling patterns). Covers uv/uvx setup, MCP configuration for 12+ AI clients (Cursor, Claude Desktop, Windsurf, VS Code, Codex, Gemini CLI, Trae, Cline, etc.), full CLI reference (tu list/grep/find/info/run/test/status/build/serve), Coding API quickstart, agentic tools, code executor, API key walkthrough, skill installation, and upgrading. Use when user asks how to set up ToolUniverse, which access mode to use (MCP vs CLI vs SDK), configuring MCP servers, using the CLI, troubleshooting installation, upgrading, or mentions installing ToolUniverse or setting up scientific tools. Also triggers for "how do I use ToolUniverse", "what's the best way to access tools", "command line", "tu command", "coding API", "tu build".

tooluniverse-acmg-variant-classificationSkill

Systematic ACMG/AMP germline variant classification with all 28 criteria (PVS1, PS1-4, PM1-6, PP1-5, BA1, BS1-4, BP1-7) for clinical significance. Produces 5-tier verdict (Pathogenic / Likely Pathogenic / VUS / Likely Benign / Benign) with cited evidence per criterion. Use for variant interpretation, VUS resolution, and pathogenicity assessment. Combines ClinVar, gnomAD, computational predictors, and gene-mechanism context.

tooluniverse-admet-predictionSkill

Comprehensive ADMET (Absorption, Distribution, Metabolism, Excretion, Toxicity) profiling for drug candidates. Integrates ADMET-AI predictions, SwissADME drug-likeness, PubChemTox experimental toxicity, ChEMBL clinical data, Lipinski rule-of-five, and CYP interaction data. Use for drug-likeness assessment, BBB penetration, bioavailability, hepatotoxicity prediction, ADME/PK profiling, or screening compound libraries before lab testing.

tooluniverse-adverse-event-detectionSkill

Detect and analyze adverse drug event signals using FDA FAERS reports, drug labels, and disproportionality statistics (PRR, ROR, IC). Generates quantitative safety signal scores (0-100) with evidence grading. Use for post-market surveillance, pharmacovigilance, drug safety assessment, regulatory submissions, and detecting rare AE signals not visible in clinical trials.

tooluniverse-adverse-outcome-pathwaySkill

Map environmental and industrial chemicals to adverse outcome pathways (AOPs) — molecular initiating event to organ-level toxicity. Uses AOPWiki, GHS classification, IARC carcinogen status, and LD50 data. Use for environmental/industrial chemical risk assessment, regulatory-grade hazard characterization, and AOP stressor mapping. Distinct from drug-safety analysis (use tooluniverse-pharmacovigilance for drugs).

tooluniverse-aging-senescenceSkill

Aging biology, cellular senescence, and longevity research. Covers senescence markers (p16/CDKN2A, SASP, SA-beta-gal), aging hallmarks, senolytic drug discovery (dasatinib+quercetin, fisetin, navitoclax), epigenetic clocks, telomere biology, and longevity GWAS. Use for senescence-pathway analysis, age-related disease genetics, senolytic-target discovery, and centenarian-genetics queries. Distinguishes correlative vs causal evidence (knockout, intervention).

tooluniverse-antibody-engineeringSkill

Therapeutic antibody engineering and optimization, lead-to-clinical-candidate. Covers sequence humanization (germline alignment, framework retention), affinity maturation, developability (aggregation, stability, PTMs), structure modeling (AlphaFold/PDB CDR analysis), immunogenicity prediction, and manufacturing feasibility. Use for biologic-drug optimization, mAb design review, biosimilar engineering, and clinical-precedent comparison.

tooluniverse-binder-discoverySkill

Discover novel small-molecule binders for protein targets using structure-based and ligand-based screening. Covers druggability assessment, known-ligand mining (ChEMBL, BindingDB), similarity expansion, ADMET filtering, and synthesis feasibility. Use for hit identification, virtual screening, target-to-compounds workflows, and lead-finding before commit-to-medchem.