alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

/ 100

Established

Supports WebSocket-based streaming with partial hypothesis updates, automatic audio segmentation on silence, and lattice rescoring with large language models. The architecture uses a master-worker pattern enabling horizontal scaling across machines, with workers handling individual recognition sessions independently using either GMM-HMM or DNN-based acoustic models. Includes built-in post-processing hooks for result transformation and multiple client SDKs (Python, Java, JavaScript, Haskell).

1,092 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,092

Forks

339

Language

Python

License

BSD-2-Clause

Related tools

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...

pykaldi/pykaldi

A Python wrapper for Kaldi

scarletcho/KoLM

Korean text normalization and language preparation package for LM in Kaldi-based ASR system

Explore Voice AI Tools

All categories Trending Voice AI directory Insights