Kaldi ASR Ecosystem Voice AI Tools
Tools, recipes, models, and utilities built on or for the Kaldi ASR framework, including language-specific implementations, format converters, and training pipelines. Does NOT include non-Kaldi ASR systems, general speech recognition APIs, or TTS tools.
There are 66 kaldi asr ecosystem tools tracked. 5 score above 50 (established tier). The highest-rated is daanzu/kaldi-active-grammar at 61/100 with 347 stars and 749 monthly downloads.
Get all 66 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=kaldi-asr-ecosystem&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set... |
|
Established |
| 2 |
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files |
|
Established |
| 3 |
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project. |
|
Established |
| 4 |
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's... |
|
Established |
| 5 |
pykaldi/pykaldi
A Python wrapper for Kaldi |
|
Established |
| 6 |
scarletcho/KoLM
Korean text normalization and language preparation package for LM in... |
|
Emerging |
| 7 |
alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit... |
|
Emerging |
| 8 |
jcsilva/docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server. |
|
Emerging |
| 9 |
alumae/gst-kaldi-nnet2-online
GStreamer plugin around Kaldi's online neural network decoder |
|
Emerging |
| 10 |
goodatlas/zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project |
|
Emerging |
| 11 |
revdotcom/fstalign
An efficient OpenFST-based tool for calculating WER and aligning two... |
|
Emerging |
| 12 |
ARBML/klaam
Arabic speech recognition, classification and text-to-speech. |
|
Emerging |
| 13 |
XiaoMi/kaldi-onnx
Kaldi model converter to ONNX |
|
Emerging |
| 14 |
alumae/kaldi-offline-transcriber
Offline transcription system for Estonian using Kaldi |
|
Emerging |
| 15 |
YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using... |
|
Emerging |
| 16 |
jimbozhang/kaldi-gop
Kaldi-based goodness of pronunciation (GOP) |
|
Emerging |
| 17 |
dspavankumar/keras-kaldi
Keras Interface for Kaldi ASR |
|
Emerging |
| 18 |
skit-ai/kaldi-serve
Server framework for Kaldi ASR Toolkit |
|
Emerging |
| 19 |
opensource-spraakherkenning-nl/Kaldi_NL
Code related to the Dutch instance and user groups of the KALDI speech... |
|
Emerging |
| 20 |
m1el/nemotron-asr.cpp
Nemotron ASR rewrite to GGML |
|
Emerging |
| 21 |
srinivr/kaldi-long-audio-alignment
Long audio alignment using Kaldi |
|
Emerging |
| 22 |
uiuc-sst/asr24
24-hour Automatic Speech Recognition |
|
Emerging |
| 23 |
loretoparisi/htk
HTK Toolkit with Linux 64 bit and Docker support |
|
Emerging |
| 24 |
collectivat/cmusphinx-models
Acoustic and language models for minorised languages. |
|
Emerging |
| 25 |
scarletcho/prep4kaldi
Data preparation code for building Kaldi ASR system |
|
Emerging |
| 26 |
falabrasil/kaldi-br
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro |
|
Emerging |
| 27 |
daanzu/kaldi_ag_training
Docker image and scripts for training finetuned or completely personal Kaldi... |
|
Emerging |
| 28 |
lars76/forced-alignment-chinese
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner |
|
Emerging |
| 29 |
Ma-Dan/asr-decode
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库 |
|
Experimental |
| 30 |
hmeutzner/kaldi-avsr
Kaldi-based audio-visual speech recognition |
|
Experimental |
| 31 |
jcsilva/docker-kaldi-android
Dockerfile for compiling Kaldi for Android. |
|
Experimental |
| 32 |
Hamahmi/kaldi-tut
This is a Kaldi tutorial for beginners |
|
Experimental |
| 33 |
srvk/srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles |
|
Experimental |
| 34 |
Anwarvic/Arabic-Speech-Recognition
This repository contains my attempt to use two famous speech recognition... |
|
Experimental |
| 35 |
ZoraizQ/urdu-speech-recognition
Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs... |
|
Experimental |
| 36 |
falabrasil/cmusphinx-br
Scripts e recursos para ASR em Português Brasileiro |
|
Experimental |
| 37 |
tsengia/SphinxTrainHelper
A Bash script designed to make training sphinx4 and pocketsphinx acoustic... |
|
Experimental |
| 38 |
synesthesiam/pt-synesthesiam
CMU Sphinx acoustic model for Portugese (pt-br) |
|
Experimental |
| 39 |
mcw519/Brownie
Post processing for speech recognition |
|
Experimental |
| 40 |
amirharati/kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi |
|
Experimental |
| 41 |
pigzach/MagicSpeechASR
magicspeech competition recipe |
|
Experimental |
| 42 |
SethiPawandeep/kaldi-for-dummies
This is the repository for my version of Kaldi for Dummies example. |
|
Experimental |
| 43 |
german-asr/kaldi-german
Scripts for training Kaldi for German speech recognition (ASR). |
|
Experimental |
| 44 |
t13m/kaldi-readers-for-tensorflow
readers that enable reading kaldi ark in tensorflow |
|
Experimental |
| 45 |
jailuthra/asr
Kaldi ASR wrapper scripts |
|
Experimental |
| 46 |
aalto-speech/finnish-parliament-scripts
Scripts for retrieving and aligning speech and meeting transcripts from the... |
|
Experimental |
| 47 |
tifaniwarnita/indonesian-asr
Automatic speech recognition (ASR) for Indonesian language built by using... |
|
Experimental |
| 48 |
lyncisdev/voco
Create a speech recognition system for programming by voice using Kaldi |
|
Experimental |
| 49 |
FarawaySail/Kaldi_thchs30
媒体与认知语音识别大作业 |
|
Experimental |
| 50 |
mvshyvk/KaldiService
Service for easy access to speech recognition capabilities of Kaldi using... |
|
Experimental |
| 51 |
JarbasAl/pocketsphinx-models-mirror
pocketsphinx models for languages originating from the iberian peninsula |
|
Experimental |
| 52 |
bagustris/id
Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019. |
|
Experimental |
| 53 |
Agrover112/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem
Goodness of Pronunciation Pipelines for OOV Removal |
|
Experimental |
| 54 |
mathquis/node-kaldi-online-nnet3-decoder
ASR online decoding using Kaldi NNet3 GrammarFST |
|
Experimental |
| 55 |
conbitin/htk3.5-install
Installation steps of HTK 3.5 under Ubuntu |
|
Experimental |
| 56 |
keymastervn/htksupport
Minimal HTK for supporting HTK in Vietnamese. |
|
Experimental |
| 57 |
alx741/kaldi_spanish_dimex100
Kaldi ASR Spanish example using the DIMEx100 corpus |
|
Experimental |
| 58 |
sidgupta234/Indian_English_ASR
An Indian English ASR system based on Hidden Markov Models (HMM) has been... |
|
Experimental |
| 59 |
burrmill/burrmill
BurrMill core |
|
Experimental |
| 60 |
jerrykuo7727/ASR-common-voice-zh-tw
HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi. |
|
Experimental |
| 61 |
asrajeh/kaldi-arabic
HHM-based Arabic ASR using Kaldi engine |
|
Experimental |
| 62 |
sasivatsal7122/Ckrett-package-pypi
a very basic ciphering/deciphering tool |
|
Experimental |
| 63 |
lormaechea/kaldi-grammar-compiler
A minimal tool that helps transforming fixed grammars into compiled Finite... |
|
Experimental |
| 64 |
cassiotbatista/asr-remote
TV Remote Control via Offline Speech Recognition |
|
Experimental |
| 65 |
falabrasil/espnet-br
📍🇧🇷 Scripts para o ESPnet em Português Brasileiro |
|
Experimental |
| 66 |
falabrasil/htk-br
Scripts para treino de modelos acústicos |
|
Experimental |