smixs/youtube-publisher
AI agent skill: tell Claude Code, Codex, Gemini or OpenClaw to upload your recording to YouTube — it downloads from Drive, transcribes, generates timestamps and metadata automatically.
Implements a modular skill architecture with `SKILL.md` + standalone Python scripts that work across multiple AI agent platforms (Claude Code, Gemini CLI, OpenClaw) without external dependencies. The pipeline parallelizes transcription across 6 workers, optimizes audio to 64kbps mono 16kHz for speech processing, and automatically generates timestamps by detecting topic boundaries — integrating Google Drive/YouTube APIs, Fireworks Whisper, and Deepgram Nova-3 for speech-to-text.
Stars
4
Forks
2
Language
Python
License
MIT
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/smixs/youtube-publisher"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SamurAIGPT/Generative-Media-Skills
Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI)....
AgriciDaniel/claude-shorts
Interactive longform-to-shortform video creator — Claude Code skill with Remotion-rendered...
swimmingkiim/image-edit-tools
Deterministic image editing SDK for AI agents. Ships with MCP tools.
zysilm-ai/ai-video-producer-skill
LLM Agent skill to create videos via local diffusion models
ristponex/free-image-and-video-generation-skill
Free AI image & video processing toolkit — 7 local AI tools + 300+ cloud models via Atlas Cloud...