Skill1.6k repo starsupdated today

openai-whisper-api

This skill transcribes audio files using OpenAI's Whisper API through a bash wrapper script. Use it when you need to convert speech in audio formats like M4A or OGG to text, with options to specify language, speaker names, or output format including JSON.

View source Repository: trpc-agent-go

Install in Claude Code

Copy

git clone --depth 1 https://github.com/trpc-group/trpc-agent-go /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/openclaw/skills/openai-whisper-api ~/.claude/skills/openai-whisper-api

Then start a new Claude Code session; the skill loads automatically.

Definition

SKILL.md

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:

- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in `~/.openclaw/openclaw.json`:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}
```