harmlessman/PAFTS

PAFTS : Library That Preprocessing Audio For TTS.

/ 100

Emerging

Integrates UVR for vocal/music separation, pyannote-audio for speaker diarization, and OpenAI's Whisper for speech-to-text transcription to create speaker-isolated, noise-cleaned training datasets. The pipeline automatically organizes output into speaker-labeled directories with corresponding JSON transcriptions, enabling end-to-end conversion of raw multi-speaker audio into structured TTS training data. Requires PyTorch (GPU-accelerated), FFmpeg, and HuggingFace authentication for diarization models.

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 18 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

KoljaB/RealtimeTTS

Converts text to speech in realtime

nateshmbhat/pyttsx3

Offline Text To Speech synthesis for python

pndurette/gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

n1teshy/yapper-tts

offline text to speech and free SOTA LLM APIs to let your programs speak to you

dputhier/pygtftk

A python package and a set of shell commands to handle GTF files

Explore Voice AI Tools

All categories Trending Voice AI directory Insights