Skill63 estrellas del repoactualizado 2mo ago

ocr-document-processor

Extract text and structure from scans, images, and scanned PDFs. Use for OCR, searchable PDFs, table extraction, receipt parsing, and business card parsing.

Ver fuente Repositorio: chatgpt-skills

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/dkyazzentwatwa/chatgpt-skills /tmp/ocr-document-processor && cp -r /tmp/ocr-document-processor/ocr-document-processor ~/.claude/skills/ocr-document-processor

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# OCR Document Processor

Handle OCR-heavy inputs where text must be recovered from images or scanned pages.

## Use This For

- OCR on images and scanned PDFs
- Searchable PDF export
- Structured extraction to text, markdown, JSON, or HTML
- Table extraction from scanned material
- Receipt parsing and business card parsing

## Workflow

1. Decide whether plain OCR, structured extraction, or document-specific parsing is needed.
2. Preprocess noisy inputs before extraction when skew, blur, or shadows are present.
3. Use `scripts/ocr_processor.py` for core OCR tasks.
4. Use the focused helpers when the input is specialized:
   - `scripts/business_card_scanner.py`
   - `scripts/receipt_scanner.py`
5. Return confidence caveats when the source is low quality, rotated, handwritten, or multilingual.

## Guardrails

- Prefer explicit language selection when accuracy matters.
- Do not claim fields are exact when OCR confidence is weak.
- Route non-scanned digital PDFs to `document-converter-suite` instead of OCR by default.

Del mismo repositorio

crypto-ta-analyzerSkill

Run multi-indicator technical analysis on crypto or market OHLCV data. Use for deterministic trend, momentum, volume, and divergence analysis.

data-storytellerSkill

Analyze datasets and turn them into narrative reports with charts, audits, comparisons, and statistical summaries. Use for exploratory analysis and executive-ready outputs.

document-converter-suiteSkill

Convert PDFs, Office docs, markdown, HTML, and tables between editable formats. Use for best-effort document conversion, PDF utilities, form filling, and table extraction.

financial-calculatorSkill

Run loan, investment, NPV, retirement, savings, and risk calculations with schedules and charts. Use for deterministic financial modeling tasks.

geo-toolkitSkill

Parse, convert, geocode, visualize, and measure geographic data. Use for address cleanup, geo file conversion, mapping, and distance workflows.

image-enhancement-suiteSkill

Process images for cleanup, conversion, metadata, comparison, icons, palettes, collages, and sprite sheets. Use for single-image or batch image workflows.

mcp-builderSkill

Plan and build MCP servers with agent-friendly tools, schemas, error handling, and evaluation. Use when creating or refactoring MCP integrations.

media-toolkitSkill

Process audio and video with clipping, conversion, analysis, captions, thumbnails, GIFs, and batch utilities. Use for practical media manipulation workflows.