Skip to main content
ClaudeWave
Skill452 repo starsupdated 6d ago

25-voice-clone-podcast

This Claude Code skill provides tools and workflows for creating AI-generated voiceovers and podcasts using voice cloning technology from platforms like ElevenLabs, Vbee, and HeyGen Voice. Use it when producing personal brand audio content such as podcasts, audiobooks, TikTok voiceovers, or repurposing a single podcast episode into multiple short clips, particularly when creating content at scale without appearing on camera.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/minhnv0807/ai-business-skills /tmp/25-voice-clone-podcast && cp -r /tmp/25-voice-clone-podcast/modules/personal-branding/vi/25-voice-clone-podcast ~/.claude/skills/25-voice-clone-podcast
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# Voice Clone & Podcast — Audio AI cho Personal Brand

> **Skill nay tap trung vao audio AI** — voice clone, podcast, audiobook, voiceover.
> Bo sung cho `24-ai-avatar-production` (video) — ket hop ca 2 de phu het content stack.

---

## 1. Cho nguoi moi (Newbie Guide)

### Audio AI la gi va khac voi video AI?

Audio AI la cong nghe tao ra giong noi nhan tao gan giong nguoi that — tu sample
giong cua ban, AI hoc va tao ra giong nhan ban (voice clone). Ban viet text →
AI doc thay (Text-to-Speech).

**Khac biet voi video AI:**
- Video AI (skill 24): Tao video co hinh + giong → lam talking head, social video
- Audio AI (skill nay): Chi tao giong → lam podcast, audiobook, voiceover, narration

### Khi nao dung audio AI thay vi quay video?

| Tinh huong | Chon audio AI | Chon video AI |
|-----------|---------------|---------------|
| Noi dung dai (>10 phut) | YES — podcast format | NO — too long for video |
| Khong muon len hinh | YES | NO |
| Can tao volume content nhanh | YES — 1 podcast = 10 short | YES nhung ton hon |
| Audience nghe khi lai xe / tap gym | YES | NO |
| Can visual de demo | NO | YES |
| Personal brand thought leader | YES — podcast = authority | YES — neu da co face brand |

### Tools chinh

- **ElevenLabs:** Tot nhat the gioi cho voice clone — VN voice tot, EN voice xuat sac
- **Vbee:** Tot nhat tieng Viet — natural intonation, da giong vung mien
- **HeyGen Voice:** Combo voi avatar HeyGen — workflow lien tuc voice + video
- **Descript:** AI editing — cat audio bang text, voice clone (Overdub)
- **Riverside:** Podcast recording — chat luong studio, AI Magic Clips repurpose

### Mat bao lau / chi phi?

| Cong viec | Thoi gian | Chi phi (USD/thang) |
|-----------|-----------|----------------------|
| Voice clone setup | 30-60 phut | $5-22 (ElevenLabs Starter/Pro) |
| Voiceover 60s (TikTok) | 5-10 phut | $5-22 |
| Podcast 30 phut (solo) | 1-2 gio | $22-99 (ElevenLabs + Riverside) |
| Audiobook 1 chuong (15 phut) | 30-45 phut | $22-99 |
| Repurpose 1 podcast → 10 clip | 1-2 gio | $0-30 (Descript/Opus) |

### 5 loi thuong gap

1. **Giong AI nghe robot:** Sample qua ngan hoac don dieu. Fix: thu lai 3-5 phut, doc nhieu cam xuc khac nhau (vui, buon, nghiem tuc).
2. **Phat am sai tu tieng Viet:** ElevenLabs van con yeu vai tu Han-Viet. Fix: dung Vbee cho VN content, hoac sua tu bang phonetic spelling.
3. **Audio bi clipping (vo tieng):** Levels qua cao. Fix: target -3dB peak, -16 LUFS loudness.
4. **Bi noise/echo:** Phong khong tieu am. Fix: thu am phong nho co rem, treo chan, hoac dung NVIDIA Broadcast / Krisp khu noise.
5. **Podcast nghe chan:** Khong co edit, qua nhieu "um a". Fix: Descript auto-remove filler words, them background music nhe (-20dB).

---

## 2. Thu thap thong tin

Hoi toi da 4 cau truoc khi bat dau:

1. **Use case chinh?** Voiceover ngan (TikTok/Reels) / Podcast 30-60 phut / Audiobook?
2. **Ngon ngu?** Tieng Viet / Tieng Anh / Song ngu (VN-EN)?
3. **Thoi luong tong?** <60s / 5-30 phut / 30-60 phut / >60 phut (audiobook)?
4. **Ngan sach tier?** Free ($0) / Starter ($5-22) / Pro ($22-99) / Business ($99+)?

> Dua tren 4 cau tra loi, chon use case + tool stack phu hop.

---

## 3. Voice clone setup

### Yeu cau sample

| Tieu chi | Yeu cau toi thieu | Toi uu |
|----------|---------------------|--------|
| Thoi luong | 1 phut (Free tier) | 3-5 phut (Pro tier) |
| Phong | Yen tinh, khong vang | Treo chan, rem, sach hap thu am |
| Mic | iPhone + tai nghe co mic | Condenser mic (AT2020, $80-100) |
| Distance | 20-30cm | 15-20cm voi pop filter |
| Format | MP3 128kbps | WAV 44.1kHz |
| Noi dung | 1 doan van da chuan bi | 3 doan van: business / casual / emotional |

> **Reference day du:** `references/voice-clone-prompts-vn.md` — 3 sample script
> theo vung giong (Bac/Trung/Nam) va 3 topic (business/lifestyle/educational).

### Tool comparison

| Tool | VN voice clone | Gia/thang | Setup time | Best for |
|------|----------------|-----------|------------|----------|
| **ElevenLabs Pro** | Tot (8/10) | $22 | 30 phut | Multi-language, content creator |
| **HeyGen Voice** | Trung binh (6/10) | Bundle voi avatar | 15 phut | Combo voi video AI |
| **Vbee Pro** | Xuat sac (9.5/10) | 199K-499K VND | 45 phut | VN-only, broadcast TTS |
| **Descript Overdub** | Trung binh (6/10) | $24 (Hobbyist) | 30 phut | Podcast editing |
| **Resemble.ai** | Trung binh (7/10) | $30 | 1 gio | API integration, custom |

**Khuyen nghi:**
- **VN-only content:** Vbee Pro (tot nhat phat am tieng Viet)
- **Multi-lang (VN + EN):** ElevenLabs Pro
- **Combo voi video:** HeyGen (1 platform — voice + avatar)

### Consent form template

```
THOA THUAN SU DUNG VOICE CLONE

Toi, [Ho ten], CMND/CCCD: [so], dong y cho [Brand/Cong ty]:
1. Su dung sample giong noi cua toi de tao voice clone AI
2. Su dung voice clone trong [pham vi: noi bo / quang cao / podcast / etc.]
3. Thoi han: [tu DD/MM/YYYY den DD/MM/YYYY]
4. Quyen rut lai: Toi co quyen yeu cau xoa voice clone bat ky luc nao
   bang van ban, brand co 7 ngay de xoa hoan toan.
5. Cong khai: Brand cam ket disclose "AI voice" theo quy tac VN.

Ky ten: ____________  Ngay: ____________
```

---

## 4. 3 use case rieng

### Use case A: Voiceover ngan TikTok/Reels (Energetic)

**Spec:**
- Thoi luong: 15-60s
- Pace: Fast (180-220 words/phut) — gioi tre VN
- Tone: Energetic, high-pitch, exciting
- Audio levels: -14 LUFS (TikTok loudness), peak -1dB
- CTA: Ro rang trong 5s cuoi

**Script template (30s):**
```
[HOOK 0-3s] "Ban co biet [stat shocking]?"
[PROBLEM 3-10s] "Hau het moi nguoi van dang [vong xoay sai]"
[SOLUTION 10-22s] "Toi da thu [phuong phap], va day la 3 dieu..."
[PAYOFF 22-27s] "Ket qua: [so cu the]"
[CTA 27-30s] "Comment 'YES' de minh gui chi tiet"
```

**Voice settings (ElevenLabs):**
- Stability: 35-45 (low — cho phep variation)
- Similarity: 75-85
- Style: 50-65 (boost expressiveness)
- Speaker Boost: ON

### Use case B: Podcast 30-60 phut (Conversational)

**Cau truc:**
- **Intro (1-2 phut):** Hook + introduce topic + welcome listeners
- **Body (2
PULL_REQUEST_TEMPLATESkill
channel-operatorSubagent

Agent van hanh kenh — thiet lap kenh, brief landing page, email marketing, social listening

content-producerSubagent

Agent san xuat noi dung — viet script, copy, brief creator, lap lich noi dung

mkt-strategistSubagent

Agent chien luoc marketing — lap ke hoach, nghien cuu thi truong, phan tich doi thu, xay dung chien luoc thuong hieu

performance-analystSubagent

Agent phan tich hieu suat — doc data, danh gia chien dich, tinh KPI, bao cao

personal-brand-builderSubagent

Agent xay dung thuong hieu ca nhan voi AI Avatar — chien luoc, content engine, monetization, community cho founder/coach/creator

29-dropshipping-mastery-globalSkill

Full dropshipping pipeline for US/EU/global markets — product research (winning criteria, Minea, PiPiAds), supplier sourcing (AliExpress, CJ Dropshipping, Spocket, Zendrop), Shopify store setup (themes, apps), ad creative pipeline (10 ads/week methodology, UGC pattern), audience targeting (interest stacking, lookalike, broad), pricing math (3-5x markup, BE-ROAS), customer service (long shipping, refunds), scaling playbook (CBO, vertical), compliance (FTC, EU CHRD). Trigger: 'dropshipping', 'shopify store', 'AliExpress', 'winning product', 'Facebook ads dropship', 'TikTok ads dropship', 'Shopify conversion'.

22-personal-brand-context-globalSkill

Foundation skill for global personal brand cluster. Creates `.agents/personal-brand-context-global.md` with region-specific personal brand context. 4 region variants (US/EU/SEA/LATAM); each covers founder/coach/creator inside. Reads BEFORE other PB skills (23-28 global). Trigger: 'global personal brand', 'international personal brand', 'US founder brand', 'EU coach brand', 'creator economy global'.