kristofferv98/VoiceProcessingToolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

40
/ 100
Emerging

Built on Picovoice Porcupine for wake word detection, OpenAI's Whisper for speech-to-text, and ElevenLabs API for synthesis, the toolkit provides a unified `VoiceProcessingManager` class that orchestrates the full pipeline with configurable audio parameters (sample rate, channels, buffer size) and voice activity detection thresholds. Includes decorator-based action registration for wake word triggers and integrates with multi-agent frameworks like AutoGen for building voice-activated assistants.

No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 2 / 25
Adoption 8 / 25
Maturity 18 / 25
Community 12 / 25

How are scores calculated?

Stars

4

Forks

1

Language

Python

License

Last pushed

Jun 05, 2025

Monthly downloads

159

Commits (30d)

0

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kristofferv98/VoiceProcessingToolkit"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.