Skip to main content
ClaudeWave
Skill1.4k estrellas del repoactualizado today

tooluniverse-functional-genomics-screens

This skill prioritizes genetic screen hits (CRISPR-KO, CRISPRi, shRNA) by integrating essentiality data from DepMap, constraint metrics from gnomAD, pathway enrichment via Reactome and STRING, druggability assessment through DGIdb, and clinical evidence from CIViC and COSMIC databases. Use it to validate screen results, rank candidate targets by selectivity and actionability, and convert raw hit lists into evidence-backed target shortlists for drug discovery.

Instalar en Claude Code
Copiar
git clone --depth 1 https://github.com/mims-harvard/ToolUniverse /tmp/tooluniverse-functional-genomics-screens && cp -r /tmp/tooluniverse-functional-genomics-screens/plugin/skills/tooluniverse-functional-genomics-screens ~/.claude/skills/tooluniverse-functional-genomics-screens
Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

SKILL.md

# Functional Genomics Screen Interpretation

Pipeline for validating and prioritizing hits from genetic screens (CRISPR-KO, CRISPRi, shRNA) by integrating essentiality (DepMap), constraint (gnomAD), pathways (Reactome, STRING), druggability (DGIdb), and clinical evidence (CIViC, COSMIC).

**Guiding principles**:
1. **Hits are hypotheses** -- screen results contain false positives; validate through orthogonal evidence
2. **Selectivity matters** -- pan-essential genes are poor drug targets; context-specific essentiality is high-value
3. **Pathway over gene** -- enriched pathways are more robust than individual hits
4. **Druggability is practical** -- prioritize chemically modulable targets
5. **English-first queries** -- use English gene names in tool calls

## LOOK UP, DON'T GUESS
When uncertain about any scientific fact, SEARCH databases first.

---

## COMPUTE, DON'T DESCRIBE
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.

## Workflow

```
Phase 0: Input Processing → gene list, screen type, cell line, disease context
Phase 1: Hit Validation → DepMap dependency, gnomAD constraint, UniProt function
Phase 2: Pathway & Network → Reactome enrichment, STRING network, functional clusters
Phase 3: Druggability → DGIdb interactions, druggable categories, PharmacoDB
Phase 4: Clinical Evidence → CIViC, COSMIC mutations
Phase 5: Literature → PubMed for key hits
Phase 6: Prioritized Report → ranked target list with multi-dimensional scoring
```

---

## Phase Details

### Phase 1: Hit Validation

**Tools**:
- `DepMap_get_gene_dependencies(gene_symbol=...)` -- returns gene metadata only (NOT per-cell-line scores)
- `DepMap_search_cell_lines(query=...)` -- cell line metadata
- `gnomad_get_gene_constraints(gene_symbol=...)` -- pLI, LOEUF (may return "Service overloaded")
- `UniProt_get_function_by_accession(accession=...)` -- function summary

**Classification**: Pan-essential (>90% lines), Selectively essential (specific lineages), Context-specific (screen model only). Chronos < -0.5 = likely essential, < -1.0 = strongly essential.

**DepMap per-cell-line Chronos scores**: `DepMap_get_gene_dependencies` returns metadata only. For the actual per-cell-line scores, use the bundled script in the cell-line-profiling skill — `tooluniverse-cell-line-profiling/scripts/depmap_gene_dependency.py` (downloads the current DepMap Public CRISPRGeneEffect.csv once, cached; queries by `gene` or `cell-line`):

```bash
python depmap_gene_dependency.py gene KRAS --lineage Lung --top 20   # most-dependent lines
python depmap_gene_dependency.py cell-line A375 --top 25             # genes the line needs
```
Chronos < -0.5 ≈ dependency, < -1.0 strongly essential. Fallback if you can't run it: gnomAD constraint + `PubMed_search_articles(query="[gene] CRISPR screen [cancer]")`.

### Phase 2: Pathway & Network

- `ReactomeAnalysis_pathway_enrichment(identifiers="TP53 BRCA1 EGFR")` -- space-separated string
- `STRING_get_network(identifiers="GENE1\rGENE2\rGENE3", species=9606)` -- carriage-return separated
- `STRING_functional_enrichment(identifiers=..., species=9606)` -- GO/KEGG enrichment

### Phase 3: Druggability

- `DGIdb_get_drug_gene_interactions(genes=["EGFR","BRAF"])` -- drug-gene interactions
- `DGIdb_get_gene_druggability(genes=[...])` -- categories (kinase, GPCR, etc.)
- For high-priority hits, also search `search_clinical_trials` and PubMed for novel inhibitors not yet in DGIdb.

### Phase 4: Clinical Evidence

- `civic_search_evidence_items(molecular_profile=gene)` -- NOT `query`
- `COSMIC_get_mutations_by_gene(gene_name=...)` -- somatic mutation frequency

### Phase 6: Prioritized Report

**Scoring (0-18)**:

| Criterion | Score 3 | Score 0 |
|-----------|---------|---------|
| Selective essentiality | <-0.5 in disease AND >-0.2 elsewhere | >-0.2 (not essential) |
| Pathway convergence | 3+ hits same pathway | Isolated hit |
| Druggability | Approved drug exists | Not druggable |
| Clinical evidence | CIViC therapeutic | No clinical data |
| Constraint | pLI >0.9 | No data |
| Literature | Multiple validation studies | No publications |

**Tiers**: T1 (15-18) high-confidence, T2 (10-14) promising, T3 (5-9) speculative, T4 (<5) likely false positive.

---

## Edge Cases

- **gnomAD overloaded**: Retry once, proceed without, note gap
- **Gene not in DepMap**: Fall back to gnomAD + UniProt
- **Large hit lists (>500)**: Pathway enrichment on full list; per-gene analysis on top 50
- **Non-cancer screens**: DepMap less informative; weight constraint/pathway more
- **shRNA vs CRISPR**: Higher validation bar for shRNA (off-target effects)

## Limitations

- DepMap is cancer-centric (~1000 cancer lines)
- No raw screen analysis (use MAGeCK/BAGEL upstream)
- STRING interactions are associations, not causal
setup-tooluniverseSkill

Install and configure ToolUniverse for any use case — MCP server (chat-based), CLI (command line with 9 subcommands), or Python SDK (Coding API with 3 calling patterns). Covers uv/uvx setup, MCP configuration for 12+ AI clients (Cursor, Claude Desktop, Windsurf, VS Code, Codex, Gemini CLI, Trae, Cline, etc.), full CLI reference (tu list/grep/find/info/run/test/status/build/serve), Coding API quickstart, agentic tools, code executor, API key walkthrough, skill installation, and upgrading. Use when user asks how to set up ToolUniverse, which access mode to use (MCP vs CLI vs SDK), configuring MCP servers, using the CLI, troubleshooting installation, upgrading, or mentions installing ToolUniverse or setting up scientific tools. Also triggers for "how do I use ToolUniverse", "what's the best way to access tools", "command line", "tu command", "coding API", "tu build".

tooluniverse-acmg-variant-classificationSkill

Systematic ACMG/AMP germline variant classification with all 28 criteria (PVS1, PS1-4, PM1-6, PP1-5, BA1, BS1-4, BP1-7) for clinical significance. Produces 5-tier verdict (Pathogenic / Likely Pathogenic / VUS / Likely Benign / Benign) with cited evidence per criterion. Use for variant interpretation, VUS resolution, and pathogenicity assessment. Combines ClinVar, gnomAD, computational predictors, and gene-mechanism context.

tooluniverse-admet-predictionSkill

Comprehensive ADMET (Absorption, Distribution, Metabolism, Excretion, Toxicity) profiling for drug candidates. Integrates ADMET-AI predictions, SwissADME drug-likeness, PubChemTox experimental toxicity, ChEMBL clinical data, Lipinski rule-of-five, and CYP interaction data. Use for drug-likeness assessment, BBB penetration, bioavailability, hepatotoxicity prediction, ADME/PK profiling, or screening compound libraries before lab testing.

tooluniverse-adverse-event-detectionSkill

Detect and analyze adverse drug event signals using FDA FAERS reports, drug labels, and disproportionality statistics (PRR, ROR, IC). Generates quantitative safety signal scores (0-100) with evidence grading. Use for post-market surveillance, pharmacovigilance, drug safety assessment, regulatory submissions, and detecting rare AE signals not visible in clinical trials.

tooluniverse-adverse-outcome-pathwaySkill

Map environmental and industrial chemicals to adverse outcome pathways (AOPs) — molecular initiating event to organ-level toxicity. Uses AOPWiki, GHS classification, IARC carcinogen status, and LD50 data. Use for environmental/industrial chemical risk assessment, regulatory-grade hazard characterization, and AOP stressor mapping. Distinct from drug-safety analysis (use tooluniverse-pharmacovigilance for drugs).

tooluniverse-aging-senescenceSkill

Aging biology, cellular senescence, and longevity research. Covers senescence markers (p16/CDKN2A, SASP, SA-beta-gal), aging hallmarks, senolytic drug discovery (dasatinib+quercetin, fisetin, navitoclax), epigenetic clocks, telomere biology, and longevity GWAS. Use for senescence-pathway analysis, age-related disease genetics, senolytic-target discovery, and centenarian-genetics queries. Distinguishes correlative vs causal evidence (knockout, intervention).

tooluniverse-antibody-engineeringSkill

Therapeutic antibody engineering and optimization, lead-to-clinical-candidate. Covers sequence humanization (germline alignment, framework retention), affinity maturation, developability (aggregation, stability, PTMs), structure modeling (AlphaFold/PDB CDR analysis), immunogenicity prediction, and manufacturing feasibility. Use for biologic-drug optimization, mAb design review, biosimilar engineering, and clinical-precedent comparison.

tooluniverse-binder-discoverySkill

Discover novel small-molecule binders for protein targets using structure-based and ligand-based screening. Covers druggability assessment, known-ligand mining (ChEMBL, BindingDB), similarity expansion, ADMET filtering, and synthesis feasibility. Use for hit identification, virtual screening, target-to-compounds workflows, and lead-finding before commit-to-medchem.