aman179102/podvoice

Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.

/ 100

Emerging

Supports both a CLI and web-based Studio GUI for interactive voice generation, with built-in multi-speaker VCTK models alongside XTTS v2. Uses deterministic speaker-to-voice hashing for consistent character voices across runs, and stores speaker profiles as YAML with optional reference audio for voice cloning. Leverages FastAPI for the Studio interface, CPU-first inference with optional CUDA acceleration, and automatic model caching across all platforms.

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

snakers4/silero-stress

Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...

JSchmie/ScrAIbe-WebUI

WebUI for ScAIbe

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...

isaiahbjork/orpheus-tts-local

Run Orpheus 3B Locally With LM Studio

Explore Voice AI Tools

All categories Trending Voice AI directory Insights