nano-pdf
nano-pdf is a CLI tool that creates or edits PDF files using natural-language instructions without requiring manual formatting knowledge. Use it when users request PDF creation, page editing, text modifications, content conversion to PDF format, or multi-page document assembly, running separate commands for each page as needed.
git clone --depth 1 https://github.com/swarmclawai/swarmclaw /tmp/nano-pdf && cp -r /tmp/nano-pdf/skills/nano-pdf ~/.claude/skills/nano-pdfSKILL.md
# nano-pdf Use `nano-pdf` to apply edits to a specific page in a PDF using a natural-language instruction. ## Quick Start ```bash nano-pdf edit deck.pdf 1 "Change the title to 'Q3 Results' and fix the typo in the subtitle" ``` ## Creating a New PDF ```bash nano-pdf create output.pdf "Create a one-page summary of quarterly results with a header, bullet points, and a footer" ``` ## Usage in SwarmClaw When a user asks to create or edit a PDF: 1. Check if `nano-pdf` is installed: `which nano-pdf` 2. If not installed, install via `uv tool install nano-pdf` or `pip install nano-pdf` 3. Run the appropriate command 4. Report the output file path to the user ## Notes - Page numbers are 0-based or 1-based depending on the tool's version; if the result looks off by one, retry with the other. - Always sanity-check the output PDF before reporting success. - For multi-page edits, run separate commands per page.
Delegate coding tasks to external coding agents (Claude Code, Codex, Pi, OpenCode) via shell. Use when: (1) building new features or apps in a separate project, (2) reviewing PRs, (3) refactoring large codebases, (4) iterative coding that needs file exploration. NOT for: simple one-liner fixes (just edit directly), reading code (use read/file tools), or work inside the SwarmClaw workspace itself.
GitHub operations via `gh` CLI: issues, PRs, CI runs, code review, API queries. Use when: (1) checking PR status or CI, (2) creating/commenting on issues, (3) listing/filtering PRs or issues, (4) viewing run logs. NOT for: local git operations (use git directly), non-GitHub repos, or cloning (use git clone).
Use Google Workspace CLI (`gws`) for Drive, Docs, Sheets, Gmail, Calendar, Chat, and related Workspace API tasks.
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). Use when asked to create, generate, or edit images and a Gemini API key is available. Supports text-to-image generation, single-image editing, and multi-image composition (up to 14 images).
Generate images via OpenAI Images API (GPT Image, DALL-E 3, DALL-E 2). Supports batch generation with random prompt sampler and HTML gallery output. Use when asked to generate images with OpenAI and an OPENAI_API_KEY is available.
Always-on guidance for solving tasks resourcefully. Teaches agents to escalate through skills, CLI tools, and custom scripts instead of refusing. Applies to any request where the agent lacks a dedicated tool.
Create, edit, improve, or audit skills for SwarmClaw agents. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory. Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".
Summarize or extract text/transcripts from URLs, podcasts, YouTube videos, and local files using the summarize CLI. Use when asked to summarize a link, article, video, or file, or to transcribe a YouTube video.