SamurAIGPT/Generative-Media-Skills
Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.
Schema-driven CLI primitives delegate all operations to `muapi-cli`, enabling agents to generate across 100+ models (Midjourney, Flux, Kling, Veo3) with structured JSON outputs and semantic exit codes for agentic pipelines. The Expert Library layer adds domain-specific skills (Cinema Director for cinematography, Nano-Banana for reasoning-driven imagery, UI Designer for atomic design) that translate creative intent into technical directives. Runs as an MCP server exposing 19 typed tools to Claude Desktop, Cursor, and other MCP-compatible agents, with native support for async polling, local file upload, and direct media display via `--view`.
2,902 stars. Actively maintained with 9 commits in the last 30 days.
Stars
2,902
Forks
319
Language
Shell
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/SamurAIGPT/Generative-Media-Skills"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
AgriciDaniel/claude-shorts
Interactive longform-to-shortform video creator — Claude Code skill with Remotion-rendered...
swimmingkiim/image-edit-tools
Deterministic image editing SDK for AI agents. Ships with MCP tools.
smixs/youtube-publisher
AI agent skill: tell Claude Code, Codex, Gemini or OpenClaw to upload your recording to YouTube...
zysilm-ai/ai-video-producer-skill
LLM Agent skill to create videos via local diffusion models
jjenkins/agent-image-skills
Image storage for AI agents. Upload, retrieve, and manage images with a single curl call — no...