hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

/ 100

Established

Automatically downloads YouTube videos with subtitles, extracts audio, and aligns transcriptions through intelligent segmentation based on subtitle timing. Includes built-in preprocessing pipelines for silence trimming, audio concatenation with configurable length limits, and metadata generation in LJ Speech or JSON formats compatible with TTS frameworks. Supports multi-language subtitle extraction and produces standardized directory structures with paired audio/text files ready for speech synthesis model training.

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Related tools

IS2AI/Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis...

Hecate2/sukasuka-vocal-dataset-builder

すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass...

youmebangbang/TTS-dataset-tools

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom...

taresh18/TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

keonlee9420/DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Explore Voice AI Tools

All categories Trending Voice AI directory Insights