aman179102/podvoice
Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.
Supports both a CLI and web-based Studio GUI for interactive voice generation, with built-in multi-speaker VCTK models alongside XTTS v2. Uses deterministic speaker-to-voice hashing for consistent character voices across runs, and stores speaker profiles as YAML with optional reference audio for voice cloning. Leverages FastAPI for the Studio interface, CPU-first inference with optional CUDA acceleration, and automatic model caching across all platforms.
Stars
25
Forks
10
Language
Python
License
MIT
Category
Last pushed
Feb 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/aman179102/podvoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple
snakers4/silero-stress
Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...
JSchmie/ScrAIbe-WebUI
WebUI for ScAIbe
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...
isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio