puntorigen/podcast_tts
A class for generating realistic audio (TTS) for podcasts and dialogues.
Supports multi-speaker dialogues with dynamic voice profile generation and automatic caching, leveraging ChatTTS for synthesis. Includes spatial audio mixing with configurable channel placement (left/right/both), background music integration with fade controls, and text normalization—all exposed via an async Python API that outputs MP3 or WAV files.
No commits in the last 6 months. Available on PyPI.
Stars
65
Forks
6
Language
Python
License
MIT
Category
Last pushed
Dec 08, 2024
Monthly downloads
14
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/puntorigen/podcast_tts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mozilla-ai/document-to-podcast
Blueprint by Mozilla.ai for generating podcasts from documents using local AI
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.
BandarLabs/gitpodcast
Convert any git repository into an engaging podcast
ismailperim/reportcast
Transform reports into podcasts with AI - Nobody reads your reports. But they'll listen.
cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.