Skip to main content
ClaudeWave
Skill2.7k repo starsupdated 7d ago

openai-whisper-api

This Claude Code skill provides a bash wrapper around OpenAI's Whisper audio transcription API, converting audio files in formats like M4A and OGG to text transcripts. Use it when you need to transcribe audio recordings via OpenAI's service, with support for language specification, custom prompts, and JSON output formatting.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/moltis-org/moltis /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/crates/skills/src/assets/audio/openai-whisper-api ~/.claude/skills/openai-whisper-api
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint. Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:

- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in the active Moltis config file (`$OPENCLAW_CONFIG_PATH`, default `~/.moltis/Moltis.json`). Optionally set `OPENAI_BASE_URL` (for example `http://127.0.0.1:51805/v1`) to use an OpenAI-compatible proxy or local gateway:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}
```