rioharper/VocalForge

Your one-stop solution for voice dataset creation

45
/ 100
Emerging

Combines Whisper transcription, PyAnnote speaker diarization, and CTC segmentation to automatically process raw audio into aligned speech datasets with minimal manual curation. The toolkit handles speaker isolation, voice activity detection, noise filtering, and text normalization across multiple audio sources, then exports in LJSpeech format. Includes VCAuditor, a browser-based verification interface for reviewing waveforms, correcting alignments, and filtering low-confidence segments before final dataset export.

130 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

130

Forks

24

Language

Python

License

MIT

Last pushed

Dec 10, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rioharper/VocalForge"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.