Skip to main content
ClaudeWave
Skill578 estrellas del repoactualizado 2d ago

openai-image-gen

This Claude Code skill generates images using OpenAI's image generation APIs, supporting GPT Image, DALL-E 3, and DALL-E 2 models with configurable parameters like size, quality, and style. Use it when you need to create images via OpenAI and have an OPENAI_API_KEY available, including batch generation with HTML gallery output for reviewing results.

Instalar en Claude Code
Copiar
git clone --depth 1 https://github.com/swarmclawai/swarmclaw /tmp/openai-image-gen && cp -r /tmp/openai-image-gen/skills/openai-image-gen ~/.claude/skills/openai-image-gen
Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

SKILL.md

# OpenAI Image Gen

Generate images via the OpenAI Images API with an HTML gallery viewer.

## Run

Note: Image generation can take longer than typical timeouts. Set a higher timeout when running via shell (e.g., 300 seconds).

```bash
python3 {baseDir}/scripts/gen.py
```

## Useful Flags

```bash
# GPT image models with various options
python3 {baseDir}/scripts/gen.py --count 16 --model gpt-image-1
python3 {baseDir}/scripts/gen.py --prompt "ultra-detailed studio photo of a lobster astronaut" --count 4
python3 {baseDir}/scripts/gen.py --size 1536x1024 --quality high --out-dir ./out/images
python3 {baseDir}/scripts/gen.py --model gpt-image-1.5 --background transparent --output-format webp

# DALL-E 3 (note: count is automatically limited to 1)
python3 {baseDir}/scripts/gen.py --model dall-e-3 --quality hd --size 1792x1024 --style vivid
python3 {baseDir}/scripts/gen.py --model dall-e-3 --style natural --prompt "serene mountain landscape"

# DALL-E 2
python3 {baseDir}/scripts/gen.py --model dall-e-2 --size 512x512 --count 4
```

## Model-Specific Parameters

### Size

- **GPT image models** (`gpt-image-1`, `gpt-image-1-mini`, `gpt-image-1.5`): `1024x1024`, `1536x1024` (landscape), `1024x1536` (portrait), or `auto`. Default: `1024x1024`
- **dall-e-3**: `1024x1024`, `1792x1024`, or `1024x1792`. Default: `1024x1024`
- **dall-e-2**: `256x256`, `512x512`, or `1024x1024`. Default: `1024x1024`

### Quality

- **GPT image models**: `auto`, `high`, `medium`, or `low`. Default: `high`
- **dall-e-3**: `hd` or `standard`. Default: `standard`
- **dall-e-2**: `standard` only

### Other Parameters

- **GPT image models** support `--background` (`transparent`, `opaque`, `auto`) and `--output-format` (`png`, `jpeg`, `webp`)
- **dall-e-3** supports `--style` (`vivid` for hyper-real, `natural` for more natural looking)
- **dall-e-3** only supports `n=1`; the script automatically limits count to 1

## Output

- Image files (`*.png`, `*.jpeg`, or `*.webp` depending on model and format)
- `prompts.json` (prompt-to-file mapping)
- `index.html` (thumbnail gallery — open in browser to review)
coding-agentSkill

Delegate coding tasks to external coding agents (Claude Code, Codex, Pi, OpenCode) via shell. Use when: (1) building new features or apps in a separate project, (2) reviewing PRs, (3) refactoring large codebases, (4) iterative coding that needs file exploration. NOT for: simple one-liner fixes (just edit directly), reading code (use read/file tools), or work inside the SwarmClaw workspace itself.

githubSkill

GitHub operations via `gh` CLI: issues, PRs, CI runs, code review, API queries. Use when: (1) checking PR status or CI, (2) creating/commenting on issues, (3) listing/filtering PRs or issues, (4) viewing run logs. NOT for: local git operations (use git directly), non-GitHub repos, or cloning (use git clone).

google-workspaceSkill

Use Google Workspace CLI (`gws`) for Drive, Docs, Sheets, Gmail, Calendar, Chat, and related Workspace API tasks.

nano-banana-proSkill

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). Use when asked to create, generate, or edit images and a Gemini API key is available. Supports text-to-image generation, single-image editing, and multi-image composition (up to 14 images).

nano-pdfSkill

Edit or create PDFs with natural-language instructions using the nano-pdf CLI. Use when asked to make a PDF, edit a PDF, add pages, change text in a PDF, or convert content to PDF format.

resourceful-problem-solvingSkill

Always-on guidance for solving tasks resourcefully. Teaches agents to escalate through skills, CLI tools, and custom scripts instead of refusing. Applies to any request where the agent lacks a dedicated tool.

skill-creatorSkill

Create, edit, improve, or audit skills for SwarmClaw agents. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory. Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".

summarizeSkill

Summarize or extract text/transcripts from URLs, podcasts, YouTube videos, and local files using the summarize CLI. Use when asked to summarize a link, article, video, or file, or to transcribe a YouTube video.