PaddlePaddle/Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Archived

/ 100

Emerging

Built on PaddlePaddle's dynamic graph framework, Parakeet implements a modular two-stage TTS pipeline combining acoustic models (mel-spectrogram prediction) with neural vocoders for waveform synthesis. It provides standardized data preprocessing, text frontend processing (including rule-based Chinese text analysis), and enables voice cloning through speaker embedding transfer learning. The toolkit targets multilingual synthesis across Chinese and English datasets (CSMSC, AISHELL-3, LJSpeech) with pre-trained checkpoints for production deployment.

618 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

618

Forks

Language

Python

License

—

Higher-rated alternatives

shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Explore Voice AI Tools

All categories Trending Voice AI directory Insights