predictor-hand-skill
**predictor-hand-skill** This Claude Code skill provides structured guidance for probabilistic forecasting and prediction accuracy. It teaches superforecasting principles from academic research, including decomposition of complex questions into researchable components, signal taxonomy for evaluating evidence strength, confidence calibration techniques, and post-analysis methods. Use this skill when making forecasts about future events, assessing prediction confidence levels, evaluating conflicting indicators, or improving systematic reasoning about uncertain outcomes.
git clone --depth 1 https://github.com/RightNow-AI/openfang /tmp/predictor-hand-skill && cp -r /tmp/predictor-hand-skill/crates/openfang-hands/bundled/predictor ~/.claude/skills/predictor-hand-skillSKILL.md
# Forecasting Expert Knowledge ## Superforecasting Principles Based on research by Philip Tetlock and the Good Judgment Project: 1. **Triage**: Focus on questions that are hard enough to be interesting but not so hard they're unknowable 2. **Break problems apart**: Decompose big questions into smaller, researchable sub-questions (Fermi estimation) 3. **Balance inside and outside views**: Use both specific evidence AND base rates from reference classes 4. **Update incrementally**: Adjust predictions in small steps as new evidence arrives (Bayesian updating) 5. **Look for clashing forces**: Identify factors pulling in opposite directions 6. **Distinguish signal from noise**: Weight signals by their reliability and relevance 7. **Calibrate**: Your 70% predictions should come true ~70% of the time 8. **Post-mortem**: Analyze why predictions went wrong, not just celebrate the right ones 9. **Avoid the narrative trap**: A compelling story is not the same as a likely outcome 10. **Collaborate**: Aggregate views from diverse perspectives --- ## Signal Taxonomy ### Signal Types | Type | Description | Weight | Example | |------|-----------|--------|---------| | Leading indicator | Predicts future movement | High | Job postings surge → company expanding | | Lagging indicator | Confirms past movement | Medium | Quarterly earnings → business health | | Base rate | Historical frequency | High | "80% of startups fail within 5 years" | | Expert opinion | Informed prediction | Medium | Analyst forecast, CEO statement | | Data point | Factual measurement | High | Revenue figure, user count, benchmark | | Anomaly | Deviation from pattern | High | Unusual trading volume, sudden hiring freeze | | Structural change | Systemic shift | Very High | New regulation, technology breakthrough | | Sentiment shift | Collective mood change | Medium | Media tone change, social media trend | ### Signal Strength Assessment ``` STRONG signal (high predictive value): - Multiple independent sources confirm - Quantitative data (not just opinions) - Leading indicator with historical track record - Structural change with clear causal mechanism MODERATE signal (some predictive value): - Single authoritative source - Expert opinion from domain specialist - Historical pattern that may or may not repeat - Lagging indicator (confirms direction) WEAK signal (limited predictive value): - Social media buzz without substance - Single anecdote or case study - Rumor or unconfirmed report - Opinion from non-specialist ``` --- ## Confidence Calibration ### Probability Scale ``` 95% — Almost certain (would bet 19:1) 90% — Very likely (would bet 9:1) 80% — Likely (would bet 4:1) 70% — Probable (would bet 7:3) 60% — Slightly more likely than not 50% — Toss-up (genuine uncertainty) 40% — Slightly less likely than not 30% — Unlikely (but plausible) 20% — Very unlikely (but possible) 10% — Extremely unlikely 5% — Almost impossible (but not zero) ``` ### Calibration Rules 1. NEVER use 0% or 100% — nothing is absolutely certain 2. If you haven't done research, default to the base rate (outside view) 3. Your first estimate should be the reference class base rate 4. Adjust from the base rate using specific evidence (inside view) 5. Typical adjustment: ±5-15% per strong signal, ±2-5% per moderate signal 6. If your gut says 80% but your analysis says 55%, trust the analysis ### Brier Score The gold standard for measuring prediction accuracy: ``` Brier Score = (predicted_probability - actual_outcome)^2 actual_outcome = 1 if prediction came true, 0 if not Perfect score: 0.0 (you're always right with perfect confidence) Coin flip: 0.25 (saying 50% on everything) Terrible: 1.0 (100% confident, always wrong) Good forecaster: < 0.15 Average forecaster: 0.20-0.30 Bad forecaster: > 0.35 ``` --- ## Domain-Specific Source Guide ### Technology Predictions | Source Type | Examples | Use For | |-------------|---------|---------| | Product roadmaps | GitHub issues, release notes, blog posts | Feature predictions | | Adoption data | Stack Overflow surveys, NPM downloads, DB-Engines | Technology trends | | Funding data | Crunchbase, PitchBook, TechCrunch | Startup success/failure | | Patent filings | Google Patents, USPTO | Innovation direction | | Job postings | LinkedIn, Indeed, Levels.fyi | Technology demand | | Benchmark data | TechEmpower, MLPerf, Geekbench | Performance trends | ### Finance Predictions | Source Type | Examples | Use For | |-------------|---------|---------| | Economic data | FRED, BLS, Census | Macro trends | | Earnings | SEC filings, earnings calls | Company performance | | Analyst reports | Bloomberg, Reuters, S&P | Market consensus | | Central bank | Fed minutes, ECB statements | Interest rates, policy | | Commodity data | EIA, OPEC reports | Energy/commodity prices | | Sentiment | VIX, put/call ratio, AAII survey | Market mood | ### Geopolitics Predictions | Source Type | Examples | Use For | |-------------|---------|---------| | Official sources | Government statements, UN reports | Policy direction | | Think tanks | RAND, Brookings, Chatham House | Analysis | | Election data | Polls, voter registration, 538 | Election outcomes | | Trade data | WTO, customs data, trade balances | Trade policy | | Military data | SIPRI, defense budgets, deployments | Conflict risk | | Diplomatic signals | Ambassador recalls, sanctions, treaties | Relations | ### Climate Predictions | Source Type | Examples | Use For | |-------------|---------|---------| | Scientific data | IPCC, NASA, NOAA | Climate trends | | Energy data | IEA, EIA, IRENA | Energy transition | | Policy data | COP agreements, national plans | Regulation | | Corporate data | CDP disclosures, sustainability reports | Corporate action | | Technology data | BloombergNEF, patent filings | Clean tech trends | | Investment data | Green bond issuance, ESG flows | Capital allocation | --- ## Reasoning Chain Construction ### Template ``` PREDICTION: [
Playwright-based browser automation patterns for autonomous web interaction
Expert knowledge for AI video clipping — yt-dlp downloading, whisper transcription, SRT generation, and ffmpeg processing
Expert knowledge for AI intelligence collection — OSINT methodology, entity extraction, knowledge graphs, change detection, and sentiment analysis
Expert knowledge for the Infisical Sync Hand — Infisical API reference, vault operations, error patterns, security guidance
Expert knowledge for AI lead generation — web research, enrichment, scoring, deduplication, and report generation
Expert knowledge for AI deep research — methodology, source evaluation, search optimization, cross-referencing, synthesis, and citation formats
Expert knowledge for autonomous market intelligence and trading — technical analysis, risk management, Alpaca API, financial data sources
Expert knowledge for AI Twitter/X management — API v2 reference, content strategy, engagement playbook, safety, and performance tracking