Skip to main content
ClaudeWave
Skill578 repo starsupdated yesterday

nano-pdf

nano-pdf is a CLI tool that creates or edits PDF files using natural-language instructions without requiring manual formatting knowledge. Use it when users request PDF creation, page editing, text modifications, content conversion to PDF format, or multi-page document assembly, running separate commands for each page as needed.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/swarmclawai/swarmclaw /tmp/nano-pdf && cp -r /tmp/nano-pdf/skills/nano-pdf ~/.claude/skills/nano-pdf
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# nano-pdf

Use `nano-pdf` to apply edits to a specific page in a PDF using a natural-language instruction.

## Quick Start

```bash
nano-pdf edit deck.pdf 1 "Change the title to 'Q3 Results' and fix the typo in the subtitle"
```

## Creating a New PDF

```bash
nano-pdf create output.pdf "Create a one-page summary of quarterly results with a header, bullet points, and a footer"
```

## Usage in SwarmClaw

When a user asks to create or edit a PDF:

1. Check if `nano-pdf` is installed: `which nano-pdf`
2. If not installed, install via `uv tool install nano-pdf` or `pip install nano-pdf`
3. Run the appropriate command
4. Report the output file path to the user

## Notes

- Page numbers are 0-based or 1-based depending on the tool's version; if the result looks off by one, retry with the other.
- Always sanity-check the output PDF before reporting success.
- For multi-page edits, run separate commands per page.
coding-agentSkill

Delegate coding tasks to external coding agents (Claude Code, Codex, Pi, OpenCode) via shell. Use when: (1) building new features or apps in a separate project, (2) reviewing PRs, (3) refactoring large codebases, (4) iterative coding that needs file exploration. NOT for: simple one-liner fixes (just edit directly), reading code (use read/file tools), or work inside the SwarmClaw workspace itself.

githubSkill

GitHub operations via `gh` CLI: issues, PRs, CI runs, code review, API queries. Use when: (1) checking PR status or CI, (2) creating/commenting on issues, (3) listing/filtering PRs or issues, (4) viewing run logs. NOT for: local git operations (use git directly), non-GitHub repos, or cloning (use git clone).

google-workspaceSkill

Use Google Workspace CLI (`gws`) for Drive, Docs, Sheets, Gmail, Calendar, Chat, and related Workspace API tasks.

nano-banana-proSkill

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). Use when asked to create, generate, or edit images and a Gemini API key is available. Supports text-to-image generation, single-image editing, and multi-image composition (up to 14 images).

openai-image-genSkill

Generate images via OpenAI Images API (GPT Image, DALL-E 3, DALL-E 2). Supports batch generation with random prompt sampler and HTML gallery output. Use when asked to generate images with OpenAI and an OPENAI_API_KEY is available.

resourceful-problem-solvingSkill

Always-on guidance for solving tasks resourcefully. Teaches agents to escalate through skills, CLI tools, and custom scripts instead of refusing. Applies to any request where the agent lacks a dedicated tool.

skill-creatorSkill

Create, edit, improve, or audit skills for SwarmClaw agents. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory. Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".

summarizeSkill

Summarize or extract text/transcripts from URLs, podcasts, YouTube videos, and local files using the summarize CLI. Use when asked to summarize a link, article, video, or file, or to transcribe a YouTube video.