Skip to main content
ClaudeWave
SamurAIGPT avatar
SamurAIGPT

Generative-Media-Skills

Ver en GitHub

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

Skills3.5k estrellas398 forksShellMITActualizado today
Nota editorial

Generative Media Skills is a schema-driven collection of Shell scripts, SKILL.md instruction files, and an MCP server that lets AI agents running inside Claude Code, Claude Desktop, or Cursor generate and edit images, videos, and audio through the muapi-cli npm package, which proxies requests to over 100 hosted models including Midjourney v7, Flux Kontext, Kling 3.0, Seedance 2.0, and Veo3. The repository splits into core primitives covering file upload, prompt-based image editing, and auth polling, and an expert library of domain-specific skills such as Cinema Director, UI Designer, Logo Creator, and AI Clipping, which converts long videos into ranked vertical short clips with server-side transcription and face-tracked auto-crop. Running muapi mcp serve exposes all 19 tools directly to any MCP-compatible agent. Forty-one named workflow recipes, each a SKILL.md the agent reads and executes, cover end-to-end pipelines such as product photo to cinematic ad. The primary audience is developers and creative professionals building multimodal agentic workflows who want structured, LLM-readable tooling without managing raw API calls or local media processing.

ClaudeWave Trust Score
100/100
Verified
Passed
  • Open-source license (MIT)
  • Actively maintained (<30d)
  • Healthy fork ratio
  • Clear description
  • Topics declared
  • Mature repo (>1y old)
Last scanned: 6/11/2026
Install as a Claude Code skill
Method: Clone
Terminal
git clone https://github.com/SamurAIGPT/Generative-Media-Skills ~/.claude/skills/generative-media-skills
1. Clone the repository into your ~/.claude/skills directory (or copy the skill folder containing SKILL.md).
2. Start a new Claude Code session so the skill registry reloads.
3. Invoke it by name, or let Claude trigger it automatically when the task matches.
💡 If the repo bundles several skills, copy only the folders you need.

24 items en este repositorio

Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap, lipsync, video effects, and more

Instalar

Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5

Instalar

Setup and utility scripts for muapi.ai — configure API keys, test connectivity, and poll for async generation results

Instalar

Turn a long video into N viral-ready short clips with a single managed API call. Wraps muapi.ai's `/ai-clipping` endpoint, which handles transcription, highlight ranking through a virality framework (hook / emotional peak / opinion bomb / revelation / conflict / quotable / story peak / practical value), overlap dedupe, and vertical face-tracking auto-crop server-side. No local Whisper, no local LLM, no GPU.

Instalar

Transform a 2D logo into a premium 3D version and animate it with professional cinematic effects.

Instalar

Generate a high-cut-density action / fight scene by first composing a 16-cell storyboard image, then driving Seedance 2.0 image-to-video off that storyboard. Stacks GPT-Image-2 (character sheet + storyboard), Nano-Banana-2 (environment concept), and Seedance 2.0 i2v.

Instalar

Create a hilarious and ultra-realistic video of an anthropomorphic animal acting like a human vlogger in a real-world setting.

Instalar

Generate a 15-second cinematic awards-ceremony video — a host announces a winner from the stage, a spotlight finds them in the crowd, they walk up to the podium, receive the award, and the LED display reveals their name and "THE BEST ACTOR".

Instalar

Convert a photo of a person into a Pixar-style 3D cartoon character, then animate it using a reference dance or motion video.

Instalar

Create a multi-part animated story video by first establishing a consistent character and then generating sequential scenes and animating them.

Instalar

Direct high-fidelity cinematic video with AI — translates creative intent into technical cinematographic directives for Veo3, Kling, and Luma video models via muapi.ai

Instalar

Generate aerial drone-perspective footage — sweeping bird's-eye views, orbit shots, and flyover sequences for landscapes, architecture, and events.

Instalar

Generate a cinematic "freeze effect" video where time stops mid-scene, the subject walks through the frozen world, then time resumes with a snap.

Instalar

Create a dramatic "Giant Product" visual where a regular item is showcased as a massive, building-sized object next to a person, then optionally animate the scene.

Instalar

Create a luxury jewelry advertisement with high-end commercial cinematography and detailed macro animation.

Instalar

Build a short music video from a song theme — N keyframes, animate each, generate matching music.

Instalar

Generate a single continuous cinematic shot video — no cuts, one seamless flowing scene with dramatic lighting and motion.

Instalar

Cinematic 5–10s product ad from a product photo + brand brief.

Instalar

Create a dynamic product showcase with explosive ingredient arrangements, followed by a realistic motion animation.

Instalar

Create a high-end cinematic product video advertisement starting from a simple product photo.

Instalar

Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation across Chinese, Global, and VIP tiers. Supports text-to-video, image-to-video, first-last-frame, omni reference, character training, omni-reference training, video editing, and watermark removal.

Instalar

Turn a single photo of a person into a 15-second cinematic pasta-making (or other cuisine) tutorial video. First builds a composite reference sheet (character + kitchen + 9-step action board), then animates the full cooking sequence with audio in a single continuous shot.

Instalar

Create a viral-style video of a talking baby with custom costumes and scripts.

Instalar

Generate UGC-style (User Generated Content) lifestyle photos of a person wearing or using your product — authentic, relatable, social-media-native imagery.

Instalar
Casos de uso

Resumen de Skills

README no disponible. Visita el repo en GitHub para la documentación completa.
agent-skillsagent-toolsai-agentsai-videoclaude-codeclaude-code-skillsclaude-skillsfluxgenerative-aiimage-generationklingmcpmidjourneymuapimultimodal-aiskillssunotext-to-imagetext-to-videovideo-generation

Lo que la gente pregunta sobre Generative-Media-Skills

¿Qué es SamurAIGPT/Generative-Media-Skills?

+

SamurAIGPT/Generative-Media-Skills es skills para el ecosistema de Claude AI. Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai. Tiene 3.5k estrellas en GitHub y se actualizó por última vez today.

¿Cómo se instala Generative-Media-Skills?

+

Puedes instalar Generative-Media-Skills clonando el repositorio (https://github.com/SamurAIGPT/Generative-Media-Skills) o siguiendo las instrucciones del README en GitHub. ClaudeWave también te ofrece bloques de instalación rápida en esta misma página.

¿Es seguro usar SamurAIGPT/Generative-Media-Skills?

+

Nuestro agente de seguridad ha analizado SamurAIGPT/Generative-Media-Skills y le ha asignado un Trust Score de 100/100 (tier: Verified). Revisa el desglose completo de comprobaciones superadas y flags en esta página.

¿Quién mantiene SamurAIGPT/Generative-Media-Skills?

+

SamurAIGPT/Generative-Media-Skills es mantenido por SamurAIGPT. La última actividad registrada en GitHub es de today, con 4 issues abiertos.

¿Hay alternativas a Generative-Media-Skills?

+

Sí. En ClaudeWave puedes explorar skills similares en /categories/skills, ordenados por popularidad o actividad reciente.

Despliega Generative-Media-Skills en tu cloud

Lleva este repo a producción en minutos. Cada plataforma genera su propio entorno con variables de entorno editables.

¿Mantienes este repo? Añade un badge a tu README

Pega el badge en tu README de GitHub para mostrar que está auditado por ClaudeWave. Cada badge enlaza de vuelta a esta página y muestra el Trust Score actual.

Featured on ClaudeWave: SamurAIGPT/Generative-Media-Skills
[![Featured on ClaudeWave](https://claudewave.com/api/badge/samuraigpt-generative-media-skills)](https://claudewave.com/repo/samuraigpt-generative-media-skills)
<a href="https://claudewave.com/repo/samuraigpt-generative-media-skills"><img src="https://claudewave.com/api/badge/samuraigpt-generative-media-skills" alt="Featured on ClaudeWave: SamurAIGPT/Generative-Media-Skills" width="320" height="64" /></a>

Más Skills

farion1231
cc-switch
yesterday

A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

99.4k6.6kRust
Skillsai-toolsclaude-codeInstall
code-yeongyu
oh-my-openagent
today

omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode

62k5kTypeScript
Skillsaiai-agentsInstall
Egonex-AI
Understand-Anything
yesterday

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI, and more.

58.2k4.8kTypeScript
Skillsantigravity-skillsbusiness-knowledgeInstall
K-Dense-AI
scientific-agent-skills
today

Turn any AI agent into an AI Scientist. The #1 Agent Skills library for science, used by 160,000+ scientists worldwide. 140 ready-to-use skills plus 100+ scientific databases covering biology, chemistry, medicine, and drug discovery. Compatible with Cursor, Claude Code, Codex, Antigravity, and the open Agent Skills standard.

28.1k2.9kPython
Skillsagent-skillsai-scientistInstall
VoltAgent
awesome-agent-skills
today

A curated collection of 1000+ agent skills from official dev teams and the community, compatible with Claude Code, Codex, Gemini CLI, Cursor, and more.

25.2k2.7k
Skillsagent-skillsai-agentsInstall
JimLiu
baoyu-skills
today

No description provided.

21.4k2.5kTypeScript
Skillsagent-skillsclaude-skillsInstall