openai-whisper-api
This Claude Code skill transcribes audio files using OpenAI's Whisper API via curl commands. Use it when you need to convert speech in audio files (m4a, ogg, and other formats) into text transcripts, with optional language hints or speaker name prompts to improve accuracy.
git clone --depth 1 https://github.com/the-open-agent/openagent /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/skills/openai-whisper-api ~/.claude/skills/openai-whisper-apiSKILL.md
# OpenAI Whisper API (curl)
Transcribe an audio file via OpenAI's `/v1/audio/transcriptions` endpoint. Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway.
## Quick start
```bash
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@/path/to/audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
> transcript.txt
```
Defaults:
- Model: `whisper-1`
- Output format: `text`
## Options
```bash
# With language hint
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.ogg" \
-F "model=whisper-1" \
-F "response_format=text" \
-F "language=en" \
> transcript.txt
# With speaker hint (prompt)
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
-F "prompt=Speaker names: Peter, Daniel" \
> transcript.txt
# JSON output
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=json" \
> transcript.json
```
## Custom base URL
Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway:
```bash
API_BASE="${OPENAI_BASE_URL:-https://api.openai.com/v1}"
curl -sS "${API_BASE}/audio/transcriptions" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F "file=@audio.m4a" \
-F "model=whisper-1" \
-F "response_format=text" \
> transcript.txt
```
## API key
Set `OPENAI_API_KEY` environment variable before running commands.Set up and use 1Password CLI (op). Use when installing the CLI, enabling desktop app integration, signing in (single or multi-account), or reading/injecting/running secrets via op.
Manage Apple Notes via the `memo` CLI on macOS (create, view, edit, delete, search, move, and export notes). Use when a user asks OpenClaw to add a note, list notes, search notes, or manage note folders.
Manage Apple Reminders via remindctl CLI (list, add, edit, complete, delete). Supports lists, date filters, and JSON/plain output.
Create, search, and manage Bear notes via grizzly CLI.
Monitor blogs and RSS/Atom feeds for updates using the blogwatcher CLI.
BluOS CLI (blu) for discovery, playback, grouping, and volume.
Use when you need to send or manage iMessages via BlueBubbles (recommended iMessage integration). Calls go through the generic message tool with channel="bluebubbles".
Capture frames or clips from RTSP/ONVIF cameras.