izwi-ai/izwi

On-device AI engine for transcription, TTS, and voice workflows.

/ 100

Emerging

Runs entirely on-device with no cloud dependencies, exposing OpenAI-compatible API endpoints for seamless integration. Supports advanced audio workflows including speaker diarization, voice cloning, voice design from text descriptions, and forced word-level alignment—all powered by a unified model ecosystem (Qwen3, Parakeet, Whisper, Gemma). Long-form ASR automatically chunks and stitches overlapping transcripts with configurable parameters, eliminating model window limitations.

181 stars.

No License No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 3 / 25

Community 14 / 25

How are scores calculated?

Stars

181

Forks

Language

Rust

License

—

Higher-rated alternatives

TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure Rust implementation of...

mrtozner/vox

Local voice AI framework for Rust. Whisper + LLM + TTS with no cloud dependencies.

darkautism/sensevoice-rs

A Rust-based, SenseVoiceSmall

thewh1teagle/vad-rs

Speech detection using silero vad in Rust

0xPD33/sonori

Sonori is a fully local STT app for Linux (Wayland).

Explore Voice AI Tools

All categories Trending Voice AI directory Insights