kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Built on Picovoice Porcupine for wake word detection, OpenAI's Whisper for speech-to-text, and ElevenLabs API for synthesis, the toolkit provides a unified `VoiceProcessingManager` class that orchestrates the full pipeline with configurable audio parameters (sample rate, channels, buffer size) and voice activity detection thresholds. Includes decorator-based action registration for wake word triggers and integrates with multi-agent frameworks like AutoGen for building voice-activated assistants.
No commits in the last 6 months. Available on PyPI.
Stars
4
Forks
1
Language
Python
License
—
Category
Last pushed
Jun 05, 2025
Monthly downloads
159
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kristofferv98/VoiceProcessingToolkit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios