kristofferv98/VoiceProcessingToolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

/ 100

Emerging

Built on Picovoice Porcupine for wake word detection, OpenAI's Whisper for speech-to-text, and ElevenLabs API for synthesis, the toolkit provides a unified `VoiceProcessingManager` class that orchestrates the full pipeline with configurable audio parameters (sample rate, channels, buffer size) and voice activity detection thresholds. Includes decorator-based action registration for wake word triggers and integrates with multi-agent frameworks like AutoGen for building voice-activated assistants.

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 2 / 25

Adoption 8 / 25

Maturity 18 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

Explore Voice AI Tools

All categories Trending Voice AI directory Insights