mrtozner/vox

Local voice AI framework for Rust. Whisper + LLM + TTS with no cloud dependencies.

/ 100

Emerging

Pluggable VAD, STT, and TTS backends via trait-based architecture let you swap Whisper/Sherpa/streaming engines and TTS providers (Kokoro, Piper, Pocket, Chatterbox). Exposes both CLI, Rust/Python libraries, and HTTP/WebSocket APIs for real-time streaming transcription and synthesis. Auto-downloads models on first run with configurable backends—Silero VAD feeds audio to chosen STT, results flow through user callbacks, optionally triggering TTS playback.

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

Apache-2.0

Higher-rated alternatives

TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure Rust implementation of...

izwi-ai/izwi

On-device AI engine for transcription, TTS, and voice workflows.

darkautism/sensevoice-rs

A Rust-based, SenseVoiceSmall

thewh1teagle/vad-rs

Speech detection using silero vad in Rust

0xPD33/sonori

Sonori is a fully local STT app for Linux (Wayland).

Explore Voice AI Tools

All categories Trending Voice AI directory Insights