Skill5.5k estrellas del repoactualizado 3d ago

openai-whisper-api

This Claude Code skill transcribes audio files using OpenAI's Whisper API via curl commands. Use it when you need to convert speech in audio files (m4a, ogg, and other formats) into text transcripts, with optional language hints or speaker name prompts to improve accuracy.

Ver fuente Repositorio: openagent

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/the-open-agent/openagent /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/skills/openai-whisper-api ~/.claude/skills/openai-whisper-api

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI's `/v1/audio/transcriptions` endpoint. Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway.

## Quick start

```bash
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@/path/to/audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt
```

Defaults:

- Model: `whisper-1`
- Output format: `text`

## Options

```bash
# With language hint
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.ogg" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "language=en" \
  > transcript.txt

# With speaker hint (prompt)
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  -F "prompt=Speaker names: Peter, Daniel" \
  > transcript.txt

# JSON output
curl -sS "https://api.openai.com/v1/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=json" \
  > transcript.json
```

## Custom base URL

Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway:

```bash
API_BASE="${OPENAI_BASE_URL:-https://api.openai.com/v1}"
curl -sS "${API_BASE}/audio/transcriptions" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -F "file=@audio.m4a" \
  -F "model=whisper-1" \
  -F "response_format=text" \
  > transcript.txt
```

## API key

Set `OPENAI_API_KEY` environment variable before running commands.

Del mismo repositorio

1passwordSkill

Set up and use 1Password CLI (op). Use when installing the CLI, enabling desktop app integration, signing in (single or multi-account), or reading/injecting/running secrets via op.

apple-notesSkill

Manage Apple Notes via the `memo` CLI on macOS (create, view, edit, delete, search, move, and export notes). Use when a user asks OpenClaw to add a note, list notes, search notes, or manage note folders.

apple-remindersSkill

Manage Apple Reminders via remindctl CLI (list, add, edit, complete, delete). Supports lists, date filters, and JSON/plain output.

bear-notesSkill

Create, search, and manage Bear notes via grizzly CLI.

blogwatcherSkill

Monitor blogs and RSS/Atom feeds for updates using the blogwatcher CLI.

blucliSkill

BluOS CLI (blu) for discovery, playback, grouping, and volume.

bluebubblesSkill

Use when you need to send or manage iMessages via BlueBubbles (recommended iMessage integration). Calls go through the generic message tool with channel="bluebubbles".

camsnapSkill

Capture frames or clips from RTSP/ONVIF cameras.