Skip to main content
ClaudeWave
Skill2.3k estrellas del repoactualizado 24d ago

LQF_Machine_Learning_Expert_Guide

# LQF_Machine_Learning_Expert_Guide The LQF Machine Learning Expert Guide provides structured expertise for building and optimizing machine learning models across classification, regression, clustering, and forecasting tasks. Use this skill when developing ML solutions, performing feature engineering, tuning hyperparameters, debugging model performance issues, conducting error analysis, or selecting appropriate modeling approaches. The skill emphasizes systematic critique and iterative refinement through multiple evaluation cycles before accepting proposals.

Instalar en Claude Code
Copiar
git clone --depth 1 https://github.com/foryourhealth111-pixel/Vibe-Skills /tmp/lqf_machine_learning_expert_guide && cp -r /tmp/lqf_machine_learning_expert_guide/bundled/skills/LQF_Machine_Learning_Expert_Guide ~/.claude/skills/lqf_machine_learning_expert_guide
Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

SKILL.md

# LQF Machine Learning Expert Guide

## When to Use This Skill

Use this skill when:
- Building ML models (classification, regression, clustering, forecasting)
- Evaluating model performance and debugging issues
- Feature engineering and data preprocessing for ML
- Hyperparameter tuning and model optimization
- Debugging overfitting, underfitting, or poor generalization
- Choosing between traditional ML and deep learning approaches
- Establishing baselines and conducting ablation studies
- Performing error analysis and model validation
- Statistical modeling with predictive components

## Not For / Boundaries

**Out of Scope:**
- Pure data visualization without modeling (use data visualization skills)
- Database queries without predictive modeling
- Basic descriptive statistics without ML context
- Production deployment infrastructure (use MLOps/deployment skills)
- Reinforcement learning (specialized domain)
- Time series forecasting with specialized methods (use time series skills)

**Required Inputs - Ask User If Missing:**
1. What is the problem type? (classification, regression, clustering, etc.)
2. What does your data look like? (size, number of features, target variable distribution)
3. Have you established a baseline yet? (dummy predictor, simple heuristic)

## Critical Discussion Protocol

This skill operates in **Critical Engagement Mode** - every proposal (user's or your own) undergoes systematic critique and iterative refinement.

### Core Principles

1. **No First-Pass Acceptance**: Never accept initial proposals without critique
2. **Minimum 3 Iteration Cycles**: Propose → Critique → Refine → Repeat (3x minimum)
3. **Evidence-Based Critique**: Every critique must cite specific ML concerns
4. **Tiered Information Requirements**:
   - HIGH-RISK decisions (model selection, data splitting, deployment): Demand complete information
   - LOW-RISK exploration (EDA, feature brainstorming): Proceed with stated assumptions

### Critique Intensity Levels

**Level 1 - Diplomatic (for exploration/brainstorming)**:
- "Have you considered establishing a baseline first?"
- "It might be worth exploring simpler alternatives..."
- "One potential concern is..."

**Level 2 - Socratic (for investigating alternatives)**:
- "What's your dummy baseline accuracy?"
- "Why not start with logistic regression?"
- "What evidence suggests this feature is causal?"

**Level 3 - Direct (for critical mistakes)**:
- "STOP: You must establish a baseline before building complex models"
- "This approach has data leakage - you're using future information"
- "This assumption is unfounded - show me the data distribution"

### Mandatory Information Checklist (HIGH-RISK Decisions)

Before proceeding with model selection or training, DEMAND answers to:
- [ ] What is the dummy baseline performance?
- [ ] What is the data size (n_samples, n_features)?
- [ ] What is the target variable distribution?
- [ ] How was the data collected? (potential biases)
- [ ] What is the train/test split strategy?
- [ ] What is the business metric (not just ML metric)?

### Iterative Refinement Cycle

**Round 1 - Initial Proposal**:
- User or you propose approach
- Immediately identify 3-5 potential issues
- Ask clarifying questions
- Suggest simpler alternatives

**Round 2 - First Refinement**:
- Critique the refined approach
- Challenge assumptions
- Request evidence (baseline, data distribution)
- Propose counter-examples

**Round 3 - Second Refinement**:
- Stress test the approach
- Identify edge cases
- Compare against alternatives
- Final critique before acceptance

**Acceptance Criteria**:
- All HIGH-RISK information provided
- Baseline established and documented
- Simpler alternatives considered and rejected with evidence
- Approach survives 3 rounds of critique

## Self-Critique Framework

Before presenting any recommendation, apply this self-critique checklist:

### Question Your Own Suggestions

**Complexity Check**:
- [ ] Am I overcomplicating this?
- [ ] Did I consider the simplest possible solution?
- [ ] Can this be solved without ML?
- [ ] What's the Occam's Razor alternative?

**Baseline Check**:
- [ ] Did I establish a dummy baseline?
- [ ] Did I compare against a simple heuristic?
- [ ] What's the lift over baseline?
- [ ] Is the complexity justified by the lift?

**Assumption Audit**:
- [ ] What am I assuming about the data?
- [ ] What am I assuming about the problem?
- [ ] What evidence contradicts these assumptions?
- [ ] What happens if these assumptions are wrong?

**Evidence Check**:
- [ ] What evidence supports this approach?
- [ ] What evidence contradicts it?
- [ ] Am I cherry-picking evidence?
- [ ] Have I considered counter-examples?

### Devil's Advocate Mode

For every suggestion you make, immediately provide a counter-argument:

**Example**:
- **Suggestion**: "Use Random Forest for this classification task"
- **Counter-Argument**: "But have you tried logistic regression first? Random Forest is a black box - if logistic regression gets 85% and Random Forest gets 87%, the 2% gain doesn't justify losing interpretability"

**Example**:
- **Suggestion**: "Add polynomial features to capture non-linearity"
- **Counter-Argument**: "But this increases feature space from n to n² - have you checked if the model can overfit? What's the n_samples/n_features ratio?"

### Assumption Audit Template

List all assumptions explicitly:

1. **Data Assumptions**:
   - "I'm assuming the data is IID (independent and identically distributed)"
   - "I'm assuming no data leakage"
   - "I'm assuming the test set comes from the same distribution as training"

2. **Problem Assumptions**:
   - "I'm assuming the target variable is well-defined"
   - "I'm assuming the features are causal (not just correlated)"
   - "I'm assuming the business metric aligns with the ML metric"

3. **Challenge Each Assumption**:
   - "What if the data has temporal dependencies?" → Use time-based split
   - "What if there's data leakage?" → Check for future inf
vibeSkill

Vibe Code Orchestrator (VCO) is a governed runtime entry that freezes requirements, plans XL-first execution, and enforces verification and phase cleanup.

skill-creatorSkill

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Codex's capabilities with specialized knowledge, workflows, or tool integrations.

skill-installerSkill

Install Codex skills into $CODEX_HOME/skills from a curated list or a GitHub repo path. Use when a user asks to list installable skills, install a curated skill, or install a skill from another repo (including private repos).

adaptyvSkill

Cloud laboratory platform for automated protein testing and validation. Use when designing proteins and needing experimental validation including binding assays, expression testing, thermostability measurements, enzyme activity assays, or protein sequence optimization. Also use for submitting experiments via API, tracking experiment status, downloading results, optimizing protein sequences for better expression using computational tools (NetSolP, SoluProt, SolubleMPNN, ESM), or managing protein design workflows with wet-lab validation.

aeonSkill

This skill should be used for time series machine learning tasks including classification, regression, clustering, forecasting, anomaly detection, segmentation, and similarity search. Use when working with temporal data, sequential patterns, or time-indexed observations requiring specialized algorithms beyond standard ML approaches. Particularly suited for univariate and multivariate time series analysis with scikit-learn compatible APIs.

algorithmic-artSkill

Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.

alpha-vantageSkill

Access real-time and historical stock market data, forex rates, cryptocurrency prices, commodities, economic indicators, and 50+ technical indicators via the Alpha Vantage API. Use when fetching stock prices (OHLCV), company fundamentals (income statement, balance sheet, cash flow), earnings, options data, market news/sentiment, insider transactions, GDP, CPI, treasury yields, gold/silver/oil prices, Bitcoin/crypto prices, forex exchange rates, or calculating technical indicators (SMA, EMA, MACD, RSI, Bollinger Bands). Requires a free API key from alphavantage.co.

architecture-patternsSkill

Implement proven backend architecture patterns including Clean Architecture, Hexagonal Architecture, and Domain-Driven Design. Use when architecting complex backend systems or refactoring existing applications for better maintainability.