Kaldi ASR Ecosystem Voice AI Tools

Tools, recipes, models, and utilities built on or for the Kaldi ASR framework, including language-specific implementations, format converters, and training pipelines. Does NOT include non-Kaldi ASR systems, general speech recognition APIs, or TTS tools.

There are 66 kaldi asr ecosystem tools tracked. 5 score above 50 (established tier). The highest-rated is daanzu/kaldi-active-grammar at 61/100 with 347 stars and 749 monthly downloads.

Get all 66 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=kaldi-asr-ecosystem&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set...

61
Established
2 nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

60
Established
3 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

53
Established
4 gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's...

51
Established
5 pykaldi/pykaldi

A Python wrapper for Kaldi

50
Established
6 scarletcho/KoLM

Korean text normalization and language preparation package for LM in...

48
Emerging
7 alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit...

44
Emerging
8 jcsilva/docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.

44
Emerging
9 alumae/gst-kaldi-nnet2-online

GStreamer plugin around Kaldi's online neural network decoder

43
Emerging
10 goodatlas/zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

43
Emerging
11 revdotcom/fstalign

An efficient OpenFST-based tool for calculating WER and aligning two...

42
Emerging
12 ARBML/klaam

Arabic speech recognition, classification and text-to-speech.

42
Emerging
13 XiaoMi/kaldi-onnx

Kaldi model converter to ONNX

41
Emerging
14 alumae/kaldi-offline-transcriber

Offline transcription system for Estonian using Kaldi

41
Emerging
15 YoavRamon/awesome-kaldi

This is a list of features, scripts, blogs and resources for better using...

40
Emerging
16 jimbozhang/kaldi-gop

Kaldi-based goodness of pronunciation (GOP)

40
Emerging
17 dspavankumar/keras-kaldi

Keras Interface for Kaldi ASR

40
Emerging
18 skit-ai/kaldi-serve

Server framework for Kaldi ASR Toolkit

38
Emerging
19 opensource-spraakherkenning-nl/Kaldi_NL

Code related to the Dutch instance and user groups of the KALDI speech...

36
Emerging
20 m1el/nemotron-asr.cpp

Nemotron ASR rewrite to GGML

34
Emerging
21 srinivr/kaldi-long-audio-alignment

Long audio alignment using Kaldi

32
Emerging
22 uiuc-sst/asr24

24-hour Automatic Speech Recognition

32
Emerging
23 loretoparisi/htk

HTK Toolkit with Linux 64 bit and Docker support

32
Emerging
24 collectivat/cmusphinx-models

Acoustic and language models for minorised languages.

32
Emerging
25 scarletcho/prep4kaldi

Data preparation code for building Kaldi ASR system

31
Emerging
26 falabrasil/kaldi-br

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

31
Emerging
27 daanzu/kaldi_ag_training

Docker image and scripts for training finetuned or completely personal Kaldi...

30
Emerging
28 lars76/forced-alignment-chinese

Mandarin Chinese audio datasets aligned with Montreal Forced Aligner

30
Emerging
29 Ma-Dan/asr-decode

从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库

29
Experimental
30 hmeutzner/kaldi-avsr

Kaldi-based audio-visual speech recognition

29
Experimental
31 jcsilva/docker-kaldi-android

Dockerfile for compiling Kaldi for Android.

29
Experimental
32 Hamahmi/kaldi-tut

This is a Kaldi tutorial for beginners

29
Experimental
33 srvk/srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

25
Experimental
34 Anwarvic/Arabic-Speech-Recognition

This repository contains my attempt to use two famous speech recognition...

25
Experimental
35 ZoraizQ/urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs...

25
Experimental
36 falabrasil/cmusphinx-br

Scripts e recursos para ASR em Português Brasileiro

24
Experimental
37 tsengia/SphinxTrainHelper

A Bash script designed to make training sphinx4 and pocketsphinx acoustic...

24
Experimental
38 synesthesiam/pt-synesthesiam

CMU Sphinx acoustic model for Portugese (pt-br)

24
Experimental
39 mcw519/Brownie

Post processing for speech recognition

24
Experimental
40 amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

24
Experimental
41 pigzach/MagicSpeechASR

magicspeech competition recipe

24
Experimental
42 SethiPawandeep/kaldi-for-dummies

This is the repository for my version of Kaldi for Dummies example.

24
Experimental
43 german-asr/kaldi-german

Scripts for training Kaldi for German speech recognition (ASR).

24
Experimental
44 t13m/kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

23
Experimental
45 jailuthra/asr

Kaldi ASR wrapper scripts

23
Experimental
46 aalto-speech/finnish-parliament-scripts

Scripts for retrieving and aligning speech and meeting transcripts from the...

22
Experimental
47 tifaniwarnita/indonesian-asr

Automatic speech recognition (ASR) for Indonesian language built by using...

22
Experimental
48 lyncisdev/voco

Create a speech recognition system for programming by voice using Kaldi

22
Experimental
49 FarawaySail/Kaldi_thchs30

媒体与认知语音识别大作业

21
Experimental
50 mvshyvk/KaldiService

Service for easy access to speech recognition capabilities of Kaldi using...

21
Experimental
51 JarbasAl/pocketsphinx-models-mirror

pocketsphinx models for languages originating from the iberian peninsula

20
Experimental
52 bagustris/id

Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.

20
Experimental
53 Agrover112/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem

Goodness of Pronunciation Pipelines for OOV Removal

20
Experimental
54 mathquis/node-kaldi-online-nnet3-decoder

ASR online decoding using Kaldi NNet3 GrammarFST

20
Experimental
55 conbitin/htk3.5-install

Installation steps of HTK 3.5 under Ubuntu

18
Experimental
56 keymastervn/htksupport

Minimal HTK for supporting HTK in Vietnamese.

16
Experimental
57 alx741/kaldi_spanish_dimex100

Kaldi ASR Spanish example using the DIMEx100 corpus

15
Experimental
58 sidgupta234/Indian_English_ASR

An Indian English ASR system based on Hidden Markov Models (HMM) has been...

15
Experimental
59 burrmill/burrmill

BurrMill core

15
Experimental
60 jerrykuo7727/ASR-common-voice-zh-tw

HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi.

15
Experimental
61 asrajeh/kaldi-arabic

HHM-based Arabic ASR using Kaldi engine

14
Experimental
62 sasivatsal7122/Ckrett-package-pypi

a very basic ciphering/deciphering tool

13
Experimental
63 lormaechea/kaldi-grammar-compiler

A minimal tool that helps transforming fixed grammars into compiled Finite...

12
Experimental
64 cassiotbatista/asr-remote

TV Remote Control via Offline Speech Recognition

11
Experimental
65 falabrasil/espnet-br

📍🇧🇷 Scripts para o ESPnet em Português Brasileiro

11
Experimental
66 falabrasil/htk-br

Scripts para treino de modelos acústicos

10
Experimental

Comparisons in this category