PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
ArchivedBuilt on PaddlePaddle's dynamic graph framework, Parakeet implements a modular two-stage TTS pipeline combining acoustic models (mel-spectrogram prediction) with neural vocoders for waveform synthesis. It provides standardized data preprocessing, text frontend processing (including rule-based Chinese text analysis), and enables voice cloning through speaker embedding transfer learning. The toolkit targets multilingual synthesis across Chinese and English datasets (CSMSC, AISHELL-3, LJSpeech) with pre-trained checkpoints for production deployment.
618 stars. No commits in the last 6 months.
Stars
618
Forks
83
Language
Python
License
—
Category
Last pushed
Nov 19, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/PaddlePaddle/Parakeet"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
daanzu/deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments