Skill1.1k estrellas del repoactualizado 5d ago

blog-feed-monitor

Blog Feed Monitor scrapes blog posts from RSS and Atom feeds with automatic feed discovery, optional keyword and date filtering, and a fallback to Apify for JavaScript-heavy sites. Use this skill to monitor multiple blogs for recent content without API costs, or when you need structured post data from sites that don't expose standard feed formats.

Ver fuente Repositorio: goose-skills

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/gooseworks-ai/goose-skills /tmp/blog-feed-monitor && cp -r /tmp/blog-feed-monitor/skills/capabilities/blog-feed-monitor ~/.claude/skills/blog-feed-monitor

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# Blog Feed Monitor

Scrape blog posts via RSS/Atom feeds (free) with optional Apify fallback for JS-heavy sites.

## Quick Start

No API key needed for RSS mode.

```bash
# Scrape a blog's RSS feed
python3 skills/blog-feed-monitor/scripts/scrape_blogs.py \
  --urls "https://example.com/blog" --days 30

# Multiple blogs with keyword filter
python3 skills/blog-feed-monitor/scripts/scrape_blogs.py \
  --urls "https://blog1.com,https://blog2.com" --keywords "AI,marketing" --output summary

# Force Apify for JS-heavy sites
python3 skills/blog-feed-monitor/scripts/scrape_blogs.py \
  --urls "https://example.com" --mode apify
```

## How It Works

### Auto Mode (default)
1. For each URL, tries to discover an RSS/Atom feed:
   - Checks HTML `<link rel="alternate">` tags
   - Probes common paths: `/feed`, `/rss`, `/atom.xml`, `/feed.xml`, `/rss.xml`, `/blog/feed`, `/index.xml`
2. Parses discovered feeds (supports RSS 2.0 and Atom)
3. If any URLs fail, falls back to Apify `jupri/rss-xml-scraper` (if token available)
4. Applies date and keyword filtering client-side

> **Note:** The Apify fallback actor `jupri/rss-xml-scraper` may need updating -- it has not been verified recently. RSS mode works reliably without it.

### RSS Mode
Only tries RSS feeds, no Apify fallback.

### Apify Mode
Uses Apify actor directly, skipping RSS discovery.

## CLI Reference

| Flag | Default | Description |
|------|---------|-------------|
| `--urls` | *required* | Blog URL(s), comma-separated |
| `--keywords` | none | Keywords to filter (comma-separated, OR logic) |
| `--days` | 30 | Only include posts from last N days |
| `--max-posts` | 50 | Max posts to return |
| `--mode` | auto | `auto` (RSS + fallback), `rss` (RSS only), `apify` (Apify only) |
| `--output` | json | Output format: `json` or `summary` |
| `--token` | env var | Apify token (only needed for Apify mode/fallback) |
| `--timeout` | 300 | Max seconds for Apify run |

## Cost

- **RSS mode:** Free (no API, no tokens)
- **Apify mode:** Uses `jupri/rss-xml-scraper` -- minimal Apify credits

Del mismo repositorio

aeoSkill

ai-video-calls-tavusSkill

AI video conversations - create real-time video calls with AI personas

ai-web-scraping-scrapegraphSkill

AI-powered web scraping - extract data using natural language prompts

amazon-searchSkill

Search Amazon products - find items, compare prices, read reviews

api-testerSkill

Test and document API endpoints - validate responses, check status, generate examples

apollo-lead-finderSkill

brand-intel-branddevSkill

Brand intelligence - logos, colors, fonts, styleguides, and company data from any domain

brand-voice-extractorSkill