Skip to main content
ClaudeWave
Skill2.7k estrellas del repoactualizado 7d ago

openai-whisper-api

This Claude Code skill provides a bash wrapper around OpenAI's Whisper audio transcription API, converting audio files in formats like M4A and OGG to text transcripts. Use it when you need to transcribe audio recordings via OpenAI's service, with support for language specification, custom prompts, and JSON output formatting.

Instalar en Claude Code
Copiar
git clone --depth 1 https://github.com/moltis-org/moltis /tmp/openai-whisper-api && cp -r /tmp/openai-whisper-api/crates/skills/src/assets/audio/openai-whisper-api ~/.claude/skills/openai-whisper-api
Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

SKILL.md

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint. Set `OPENAI_BASE_URL` to use an OpenAI-compatible proxy or local gateway.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:

- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in the active Moltis config file (`$OPENCLAW_CONFIG_PATH`, default `~/.moltis/Moltis.json`). Optionally set `OPENAI_BASE_URL` (for example `http://127.0.0.1:51805/v1`) to use an OpenAI-compatible proxy or local gateway:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}
```