muapi-music-video
The muapi-music-video skill generates a short music video by creating multiple keyframe images based on a song theme, animating each keyframe into video clips, and producing a matching soundtrack. Use this when you need to visualize a narrative concept as a coordinated video story with synchronized music, such as creating promotional content, narrative animations, or thematic visual accompaniments to original compositions.
git clone --depth 1 https://github.com/SamurAIGPT/Generative-Media-Skills /tmp/muapi-music-video && cp -r /tmp/muapi-music-video/library/motion/music-video ~/.claude/skills/muapi-music-videoSKILL.md
# Music Video
**Build a short music video from a song theme — N keyframes, animate each, generate matching music.**
## Inputs
| Name | Type | Required | Default | Description |
|:---|:---|:---|:---|:---|
| `theme` | text | yes | — | Song / video theme (e.g. "lonely robot finds a friend, hopeful"). |
| `scenes` | int | no | 3 | Number of scenes (each becomes a 5s clip). |
| `music_style` | text | no | ambient cinematic, instrumental, slow tempo, warm | Suno-style tags for the soundtrack. |
| `visual_style` | text | no | cinematic, photoreal, soft volumetric light, 16:9 | |
## Steps
Build one the plan covering:
1. **Layer A (parallel)** — N keyframes + 1 music track all at once.
- For each scene 1..N: `muapi image generate` with a beat-specific prompt +
`{{visual_style}}`, model=nano-banana-pro (these feed video gen).
- One `muapi audio create` (kind=music) using `{{music_style}}`, duration =
N × 5 + a 2s tail.
2. **Layer B (parallel, depends on Layer A)** — animate each keyframe.
- For each scene: `muapi video from-image` with `image=$nX.url`, model=veo3.1-image-to-video,
duration=5, prompt=scene-specific motion direction.
3. Return:
- The scene keyframes (asset ids in order).
- The animation clips (asset ids in order).
- The music track asset id.
- A short summary describing the cut order.
## Notes
- Keep character continuity by repeating the character description in every
scene prompt verbatim.
- Don't auto-confirm any single video call > 50 cr — those need the user's
nod (the loop will prompt automatically).
- If a scene's `muapi video from-image` fails after failover, fall back to
`muapi video generate` (text-to-video) for that scene only.
## Trigger Keywords
`music video`, `mv`, `video story`, `song visualization`
---
## Notes for the Executing Agent
- This recipe is LLM-orchestrated: read each phase, gather any missing inputs from the user, then call `muapi` CLI commands. Use `muapi auth configure` first if `MUAPI_API_KEY` is unset.
- For model IDs without a CLI alias yet, fall back to the raw endpoint via `curl -X POST https://api.muapi.ai/api/v1/<endpoint> -H "x-api-key: $MUAPI_API_KEY" -H 'content-type: application/json' -d '{...}'` and poll with `muapi predict wait <request_id>`.
- Substitute `{{input_name}}` placeholders with the user's actual inputs before issuing each call.Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap, lipsync, video effects, and more
Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5
Setup and utility scripts for muapi.ai — configure API keys, test connectivity, and poll for async generation results
Turn a long video into N viral-ready short clips with a single managed API call. Wraps muapi.ai's `/ai-clipping` endpoint, which handles transcription, highlight ranking through a virality framework (hook / emotional peak / opinion bomb / revelation / conflict / quotable / story peak / practical value), overlap dedupe, and vertical face-tracking auto-crop server-side. No local Whisper, no local LLM, no GPU.
Transform a 2D logo into a premium 3D version and animate it with professional cinematic effects.
Generate a high-cut-density action / fight scene by first composing a 16-cell storyboard image, then driving Seedance 2.0 image-to-video off that storyboard. Stacks GPT-Image-2 (character sheet + storyboard), Nano-Banana-2 (environment concept), and Seedance 2.0 i2v.
Create a hilarious and ultra-realistic video of an anthropomorphic animal acting like a human vlogger in a real-world setting.
Generate a 15-second cinematic awards-ceremony video — a host announces a winner from the stage, a spotlight finds them in the crowd, they walk up to the podium, receive the award, and the LED display reveals their name and "THE BEST ACTOR".