agent87/RW-DEEPSPEECH-API

An end to end deep speech REST API containing speech to text and text speech services for Kinyarwanda.

/ 100

Emerging

Leverages pre-trained models from Nvidia (Conformer CTC for STT trained on 2000 hours of Kinyarwanda speech) and Digital Umuganda (YourTTS for TTS with zero-shot voice adaptation), exposing both through FastAPI with WebSocket support and Uvicorn. Integrates MongoDB for inference logging, includes Docker containerization for deployment, and uses the Transformers and NeMo libraries for model inference with customization capabilities for domain-specific fine-tuning.

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 9 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

GPL-3.0

Higher-rated alternatives

herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

Explore Voice AI Tools

All categories Trending Voice AI directory Insights