coqui-ai/STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

/ 100

Emerging

Built on TensorFlow with streaming inference capabilities, it delivers real-time transcription with confidence scores per hypothesis and supports multi-GPU training for efficient model optimization. The toolkit provides language bindings across multiple programming languages and includes a compact acoustic model designed for low-footprint deployment on edge devices.

2,572 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

2,572

Forks

302

Language

C++

License

MPL-2.0

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Explore Voice AI Tools

All categories Trending Voice AI directory Insights