pronunciation-specialist
The pronunciation-specialist skill scans song lyrics for words that Suno AI might mispronounce, including homographs, technical terms, proper nouns, and non-English words, then applies phonetic spelling fixes to ensure accurate vocal delivery. Use it after writing lyrics and before submitting tracks to Suno, particularly when content includes ambiguous words like "live" or "read," specialized terminology, or names that have multiple valid pronunciations.
git clone --depth 1 https://github.com/bitwize-music-studio/claude-ai-music-skills /tmp/pronunciation-specialist && cp -r /tmp/pronunciation-specialist/skills/pronunciation-specialist ~/.claude/skills/pronunciation-specialistSKILL.md
## Your Task
**Input**: $ARGUMENTS
### Instrumental Guard
When invoked with a track file path, **first check** the track's frontmatter for `instrumental: true` or the Track Details table for `**Instrumental** | Yes`. If the track is instrumental:
- **STOP** and report: "SKIP — Instrumental track (no lyrics to scan for pronunciation)"
- Do NOT scan instrumental tracks.
### Vocal Track Workflow
Based on the argument provided:
- **If given a track file path**: Read it, scan lyrics for pronunciation risks, report issues with fixes
- **If given lyrics directly**: Scan and flag risky words
- **Output**: Clean lyrics with all phonetic fixes applied, ready for suno-engineer
---
## Supporting Files
- **[word-lists.md](word-lists.md)** - Complete tables of homographs, tech terms, names, acronyms, numbers
---
# Pronunciation Specialist
Scan lyrics for pronunciation risks, suggest phonetic spellings, prevent Suno mispronunciations.
## Why This Matters
**The problem**: Suno AI guesses pronunciation. Wrong guess = wrong song = wasted generation.
**One wrong word ruins the take.**
## When to Invoke
**Always invoke between lyric-writer and lyric-reviewer:**
```
lyric-writer (WRITES + SUNO PROMPT) → pronunciation-specialist (RESOLVES) → lyric-reviewer (VERIFIES) → pre-generation-check
|
Scan, resolve, fix risky words
```
**Your role — RESOLVE:**
- The lyric-writer flags potential pronunciation risks and asks about homographs
- You do the deep scan, resolve ambiguities with the user, and apply all phonetic fixes
- The lyric-reviewer then verifies all resolutions were correctly applied
---
## High-Risk Word Categories
See [word-lists.md](word-lists.md) for complete tables. Summary:
### 1. Homographs (CRITICAL)
Same spelling, different pronunciation. **ALWAYS require clarification.**
*(Canonical reference: `${CLAUDE_PLUGIN_ROOT}/reference/suno/pronunciation-guide.md`. Keep this summary in sync.)*
| Word | Options | Fix |
|------|---------|-----|
| live | LYVE (verb) / LIV (adjective) | "lyve" or "liv" |
| read | REED (present) / RED (past) | "reed" or "red" |
| lead | LEED (guide) / LED (metal) | "leed" or "led" |
| wind | WYND (air) / WINED (coil) | "wynd" or "wined" |
| tear | TEER (cry) / TARE (rip) | "teer" or "tare" |
| bass | BAYSS (music) / BASS (fish) | "bayss" or "bass" |
### 2. Tech Terms
Suno often mispronounces tech words:
- Linux → "Lin-ucks" (not "Line-ucks")
- SQL → "S-Q-L" or "sequel"
- API, CLI, SSH → spell out with hyphens
### 3. Names & Proper Nouns
Non-English names need phonetic spelling:
- Jose → "Ho-zay"
- Ramos → "Rah-mohs"
- Sinaloa → "Sin-ah-lo-ah"
### 4. Acronyms
3-letter acronyms → spell out with hyphens (FBI → F-B-I)
Word-like acronyms → phonetic (RICO → Ree-koh, NASA → Nah-sah)
### 5. Numbers
- Years: Use apostrophes ('93) or words (nineteen ninety-three)
- Digits: Write out (four-oh-four, not 404)
---
## Pronunciation Guides
You reference TWO pronunciation guides:
### Base Guide (Plugin-Maintained)
- **Location**: `${CLAUDE_PLUGIN_ROOT}/reference/suno/pronunciation-guide.md`
- **Contains**: Universal pronunciation rules, common homographs, tech terms
- **Updated**: By plugin maintainers when new issues are discovered
## Override Support
Check for custom pronunciation entries:
### Loading Override
1. Call `load_override("pronunciation-guide.md")` — returns override content if found (auto-resolves path from config)
2. If found: load and merge with base guide (override entries take precedence)
3. If not found: use base guide only (skip silently)
### Override File Format
**`{overrides}/pronunciation-guide.md`:**
```markdown
# Pronunciation Guide (Override)
## Artist Names
| Name | Pronunciation | Notes |
|------|---------------|-------|
| Ramos | Rah-mohs | Character name |
## Album-Specific Terms
| Term | Pronunciation | Notes |
|------|---------------|-------|
| Sinaloa | Sin-ah-lo-ah | Location |
```
### How to Use Override
- Add artist names, album-specific terms, and genre-specific jargon
- Override entries take precedence over base guide entries for the same word
- Base guide updates via plugin updates without conflicts
- Override guide is version-controlled with your music content
---
## Scanning Workflow
### Step 1: Automated Scan via MCP
1. Extract lyrics: `extract_section(album_slug, track_slug, "lyrics")`
2. Homograph scan: `check_homographs(lyrics_text)` — returns found homographs with line numbers, pronunciation options
3. Additional manual scan for tech terms, acronyms, numbers, and names (not covered by MCP homograph list) — cross-reference [word-lists.md](word-lists.md)
4. If style prompt exists: `scan_artist_names(style_text)` — catch blocklisted names
After fixes are applied:
5. Verify: `check_pronunciation_enforcement(album_slug, track_slug)` — confirms all pronunciation table entries appear in lyrics
### Step 2: Review Results
From MCP results and manual scan:
- Which words were flagged?
- What's the recommended fix for each?
### Step 3: Generate Report
For each flagged word, provide:
1. Line number and context
2. Why it's risky (ambiguity type)
3. Suggested phonetic spelling
4. Alternative if multiple pronunciations exist
**Example output**:
```
PRONUNCIATION RISKS FOUND (3):
Line V1:3 -> "We live in darknet spaces"
Risk: "live" is homograph
Options: "lyve" (verb) or "liv" (adjective)
-> Needs clarification
Line C:1 -> "SQL injection in the code"
Risk: "SQL" is tech acronym
Fix: "S-Q-L" or "sequel"
-> Auto-fix: "S-Q-L injection in the code"
Line V2:5 -> "Reading Linux logs at 3AM"
Risk: "Linux" commonly mispronounced
Fix: "Lin-ucks"
-> Auto-fix: "Reading Lin-ucks logs at 3 A-M"
```
### Step 4: User Confirmation
**For ambiguous words (like "live")**: Ask user which pronunciation
**For clear fixes (tech terms)**: Auto-fix
---
## Auto-Fix Rules
### Always Auto-Fix
- Tech terms (SQL → S-Q-L, Linux → Lin-Provides information about the bitwize-music plugin, its version, and its creator. Use when the user asks about the plugin, its purpose, version, or capabilities.
Creates visual concepts for album artwork and generates AI art prompts. Use during planning for concept discussion, or after all tracks are Final for actual artwork generation.
Designs album concepts, tracklist architecture, and thematic planning through 7 structured phases. Use when planning a new album or reworking an existing album concept.
Shows a structured progress dashboard for an album with percentage complete per phase, blocking items, and status breakdown. Use for a quick visual overview of album progress.
Tracks and manages album ideas including brainstorming, planning, and status updates. Use when the user wants to add, review, or organize their album idea backlog.
Copies track content (lyrics, style prompts, streaming lyrics) to the system clipboard. Use when the user needs to paste lyrics or style prompts into Suno or other external tools.
Uploads promo videos and content to Cloudflare R2 or AWS S3. Use when the user wants to host promo content for social media or distribution.
Sets up or edits the plugin configuration file interactively. Use on first-time setup, when config is missing, or when the user wants to change settings.