Skill2.8k estrellas del repoactualizado yesterday

sag

`sag` is a command-line tool that integrates ElevenLabs text-to-speech with local audio playback, supporting multiple voice models and SSML-style audio tags for expressive synthesis. Use it when you need to generate natural-sounding speech with character control, such as creating voice responses with specific emotional delivery or pronunciation adjustments, particularly with the v3 model's audio tags like whispers, shouts, and laughter.

Ver fuente Repositorio: moltis

Instalar en Claude Code

Copiar

git clone --depth 1 https://github.com/moltis-org/moltis /tmp/sag && cp -r /tmp/sag/crates/skills/src/assets/audio/sag ~/.claude/skills/sag

Después abre una sesión nueva de Claude Code; el skill carga automáticamente.

Definición

SKILL.md

# sag

Use `sag` for ElevenLabs TTS with local playback.

API key (required)

- `ELEVENLABS_API_KEY` (preferred)
- `SAG_API_KEY` also supported by the CLI

Quick start

- `sag "Hello there"`
- `sag speak -v "Roger" "Hello"`
- `sag voices`
- `sag prompting` (model-specific tips)

Model notes

- Default: `eleven_v3` (expressive)
- Stable: `eleven_multilingual_v2`
- Fast: `eleven_flash_v2_5`

Pronunciation + delivery rules

- First fix: respell (e.g. "key-note"), add hyphens, adjust casing.
- Numbers/units/URLs: `--normalize auto` (or `off` if it harms names).
- Language bias: `--lang en|de|fr|...` to guide normalization.
- v3: SSML `<break>` not supported; use `[pause]`, `[short pause]`, `[long pause]`.
- v2/v2.5: SSML `<break time="1.5s" />` supported; `<phoneme>` not exposed in `sag`.

v3 audio tags (put at the entrance of a line)

- `[whispers]`, `[shouts]`, `[sings]`
- `[laughs]`, `[starts laughing]`, `[sighs]`, `[exhales]`
- `[sarcastic]`, `[curious]`, `[excited]`, `[crying]`, `[mischievously]`
- Example: `sag "[whispers] keep this quiet. [short pause] ok?"`

Voice defaults

- `ELEVENLABS_VOICE_ID` or `SAG_VOICE_ID`

Confirm voice + speaker before long output.

## Chat voice responses

When the user asks for a "voice" reply (e.g., "crazy scientist voice", "explain in voice"), generate audio and send it:

```bash
# Generate audio file
sag -v Clawd -o /tmp/voice-reply.mp3 "Your message here"

# Then include in reply:
# MEDIA:/tmp/voice-reply.mp3
```

Voice character tips:

- Crazy scientist: Use `[excited]` tags, dramatic pauses `[short pause]`, vary intensity
- Calm: Use `[whispers]` or slower pacing
- Dramatic: Use `[sings]` or `[shouts]` sparingly

Default voice for Clawd: `lj2rcrvANS3gaWWnczSX` (or just `-v Clawd`)

Del mismo repositorio

shipSlash Command

Commit all changes, push branch, create/update PR, and run local validation

apple-notesSkill

Manage Apple Notes via the memo CLI on macOS (create, view, search, edit).

apple-remindersSkill

Manage Apple Reminders via remindctl CLI (list, add, complete, delete).

findmySkill

Track Apple devices and AirTags via FindMy.app on macOS using AppleScript and screen capture.

imessageSkill

Send and receive iMessages/SMS via the imsg CLI on macOS.

openai-whisper-apiSkill