Skip to main content
ClaudeWave
Skill1.3k repo starsupdated today

openai-whisper-api

This skill transcribes audio files using OpenAI's Whisper API through a bash wrapper script. Use it when you need to convert speech in audio formats like M4A or OGG to text, with options to specify language, speaker names, or output format including JSON.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/trpc-group/trpc-agent-go /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/openclaw/skills/openai-whisper-api ~/.claude/skills/openai-whisper-api
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:

- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}
```