Automatic Speech Recognition Voice AI Tools
Libraries, frameworks, and tools for building, training, and evaluating automatic speech recognition (ASR) systems. Does NOT include pre-built transcription APIs, TTS systems, or ASR applications like transcription apps or meeting summarizers.
There are 161 automatic speech recognition tools tracked. 3 score above 70 (verified tier). The highest-rated is Uberi/speech_recognition at 90/100 with 8,959 stars. 2 of the top 10 are actively maintained.
Get all 161 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=automatic-speech-recognition&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs,... |
|
Verified |
| 2 |
cmusphinx/pocketsphinx
A small speech recognizer |
|
Verified |
| 3 |
istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models |
|
Verified |
| 4 |
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition |
|
Established |
| 5 |
haoheliu/voicefixer
General Speech Restoration |
|
Established |
| 6 |
astorfi/speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:... |
|
Established |
| 7 |
tensorflow/lingvo
Lingvo |
|
Established |
| 8 |
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA... |
|
Established |
| 9 |
dictation-toolbox/dragonfly
Speech recognition framework allowing powerful Python-based scripting and... |
|
Established |
| 10 |
rwth-i6/rasr
The RWTH ASR Toolkit. |
|
Established |
| 11 |
sccn/eegprep
EEGPrep is an automated preprocessing tool for human EEG data built on a... |
|
Established |
| 12 |
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly... |
|
Established |
| 13 |
pierreaubert/spinorama
A library to display and compare spinorama (speakers measurements) graphs. |
|
Established |
| 14 |
zw76859420/ASR_Theory
语音识别理论、论文和PPT |
|
Established |
| 15 |
bambocher/pocketsphinx-python
Python interface to CMU Sphinxbase and Pocketsphinx libraries |
|
Established |
| 16 |
FunAudioLLM/Fun-ASR
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab. |
|
Established |
| 17 |
jonatasgrosman/asrecognition
ASRecognition: just an easy-to-use library for Automatic Speech Recognition. |
|
Emerging |
| 18 |
gooofy/zamia-speech
Open tools and data for cloudless automatic speech recognition |
|
Emerging |
| 19 |
funcwj/aps
A personal toolkit for single/multi-channel speech recognition & enhancement... |
|
Emerging |
| 20 |
drmfinlay/pyjsgf
JSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python. |
|
Emerging |
| 21 |
sovaai/sova-asr
SOVA ASR (Automatic Speech Recognition) |
|
Emerging |
| 22 |
Pankaj-Baranwal/pocketsphinx
Updated ROS bindings to pocketsphinx |
|
Emerging |
| 23 |
jianchang512/zh_recogn
将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型 |
|
Emerging |
| 24 |
AASHISHAG/deepspeech-german
Automatic Speech Recognition (ASR) - German |
|
Emerging |
| 25 |
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei... |
|
Emerging |
| 26 |
revdotcom/reverb
Open source inference code for Rev's model |
|
Emerging |
| 27 |
MHaggis/ASRGEN
ASR Configurator, Essentials and Atomic Testing |
|
Emerging |
| 28 |
finos/greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data |
|
Emerging |
| 29 |
petewarden/spchcat
Speech recognition tool to convert audio to text transcripts, for Linux and... |
|
Emerging |
| 30 |
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and... |
|
Emerging |
| 31 |
tarun7r/SpeechAlgo
A Comprehensive Speech Processing Algorithms Library for research and production use |
|
Emerging |
| 32 |
belambert/asr-tools
Libraries and scripts for manipulating and handling ASR output/n-bests/etc. |
|
Emerging |
| 33 |
racai-ai/RobinASR
Romanian Automatic Speech Recognition from the ROBIN project |
|
Emerging |
| 34 |
AudioLLMs/AudioBench
AudioBench: A Universal Benchmark for Audio Large Language Models |
|
Emerging |
| 35 |
zakuro-ai/asr
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in... |
|
Emerging |
| 36 |
zthxxx/python-Speech_Recognition
A simple example for use speech recognition baidu api with python. |
|
Emerging |
| 37 |
SergeyShk/Speech-to-Text-Russian
Проект для распознавания речи на русском языке на основе pykaldi. |
|
Emerging |
| 38 |
SpeechColab/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform... |
|
Emerging |
| 39 |
linagora-labs/ssak
SSAK contains helpers and tools to process data and train/infer ASR models. |
|
Emerging |
| 40 |
mravanelli/pySpeechRev
This python code performs an efficient speech reverberation starting from a... |
|
Emerging |
| 41 |
drivendataorg/childrens-speech-recognition-benchmark-pub
Tutorial code for the On Top of Pasketti: Children’s Speech Recognition Challenge |
|
Emerging |
| 42 |
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020:... |
|
Emerging |
| 43 |
Franck-Dernoncourt/ASR_benchmark
Program to benchmark various speech recognition APIs |
|
Emerging |
| 44 |
goodmike31/pl-asr-bigos-tools
Extendable toolkit for comprehensive evaluation of ASR systems. Currently... |
|
Emerging |
| 45 |
HaoQChen/iflytek_awaken_asr
use iflytek's technology to realize awaken and order recognition |
|
Emerging |
| 46 |
efeslab/LiteASR
[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with... |
|
Emerging |
| 47 |
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and... |
|
Emerging |
| 48 |
sp-squared/Turkic-Languages-Audio-to-Text-Transcription
Open-source Automatic Speech Recognition (ASR) pipeline for Bashkir... |
|
Emerging |
| 49 |
dmatekenya/Chichewa-Speech2Text
Automated Speech Recognition for Chichewa. |
|
Emerging |
| 50 |
atharva-again/indic-asr-onnx
Helper package for using quantized versions of the Indic ASR Model by AI4Bharat. |
|
Emerging |
| 51 |
fquirin/speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework |
|
Emerging |
| 52 |
mramshaw/Speech-Recognition
Speech recognition with Python |
|
Emerging |
| 53 |
jopedroliveira/speech_recog_uc
Speech processing ROS-package. Performs speech recognition and estimates the... |
|
Emerging |
| 54 |
jfainberg/lattice_combination
Lattice combination algorithm to combine inaccurate transcripts with... |
|
Emerging |
| 55 |
Abhishek-op/SR
💡Kivy-android speech recognition |
|
Emerging |
| 56 |
robmsmt/SpeechLoop
Many ASRs under one roof. With Benchmarking... answering the question. What... |
|
Emerging |
| 57 |
Audio-WestlakeU/UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA)... |
|
Emerging |
| 58 |
kurianbenoy/malayalam_asr_benchmarking
A study to benchmark whisper based ASRs in Malayalam |
|
Emerging |
| 59 |
sooftware/jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural... |
|
Experimental |
| 60 |
shervinemami/practice_speechrec_mappings
A game to help design a better character mapping and to learn the mapping... |
|
Experimental |
| 61 |
Forced-Alignment-and-Vowel-Extraction/fave-asr
Interface for automated transcription and time alignment of conversational... |
|
Experimental |
| 62 |
opensource-spraakherkenning-nl/asr_nl
Dutch Speech Recognition webservice |
|
Experimental |
| 63 |
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for... |
|
Experimental |
| 64 |
robotology/natural-speech
This repository contains a codebase to build automatic speech recognition... |
|
Experimental |
| 65 |
dialpad/mucs_2021_dialpad
Dialpad team's submission to the MUCS 2021 workshop |
|
Experimental |
| 66 |
dcavar/ELAN2split
Split ELAN Annotation Files and corresponding speech files into a corpus... |
|
Experimental |
| 67 |
MichaelGrafnetter/defender-asr-admx
Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR) |
|
Experimental |
| 68 |
robmsmt/CommonCorrections
Easily fix common corrections in speech! |
|
Experimental |
| 69 |
crazymidnight/speech-recognition
[WIP] Speech recognition microservice |
|
Experimental |
| 70 |
SABER-labs/SABER
Semi-Supervised Audio Baseline for Easy Reproduction |
|
Experimental |
| 71 |
Animator617/jasper
Jasper is a AI asistence programm based on deeplearning |
|
Experimental |
| 72 |
Llamacha/asr-htk-quechua
ASR for quechua language is an open source which can run in real time using... |
|
Experimental |
| 73 |
SALT-Research/SHALLOW
SHALLOW, the first hallucination benchmark for ASR models |
|
Experimental |
| 74 |
csikasote/bembaspeech-exps
Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English... |
|
Experimental |
| 75 |
Cosmos-Break/asr
沪语(上海话)ASR(语音识别)模型 |
|
Experimental |
| 76 |
idiap/FiniteStateTransducers.jl
Play with Weighted Finite State Transducers (WFST) in the Julia language. |
|
Experimental |
| 77 |
ynop/spych
Scripts/Tools used for working with automatic speech recognition. |
|
Experimental |
| 78 |
linagora-labs/asr_benchmark
Toolkit to benchmark various speech recognition APIs (NeMo, Whisper...) and... |
|
Experimental |
| 79 |
belambert/asr-scripts
Lots of miscellaneous scripts to work with Sphinx ASR files and other... |
|
Experimental |
| 80 |
idiap/TIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS database |
|
Experimental |
| 81 |
SoCXin/ASR1606
L4 R2: ASR 624MHz Cortex-R5 Cat.1 SoC (ASR1606/ASR1602) |
|
Experimental |
| 82 |
SoCXin/ASR1601
L4 R3: ASR Cortex-R5 LTE Cat.1 SoC (ASR1601/ASR1603/ASR3601) |
|
Experimental |
| 83 |
burrmill/sph2pipe
sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just... |
|
Experimental |
| 84 |
nikhilkumarsingh/Wit-Speech-API-Wrapper
A python client for interacting with Wit Speech Recognition API |
|
Experimental |
| 85 |
anshulgupta0803/ASSR
ASSR: Automatic Stuttered Speech Recognition |
|
Experimental |
| 86 |
AASHISHAG/asr-german
Automatic Speech Recognition (ASR) - German |
|
Experimental |
| 87 |
chizkidd/igbo-asr-tonal-evaluation
Systematic evaluation of tonal fidelity in facebook/omniASR-CTC-1B when... |
|
Experimental |
| 88 |
andreiliphd/speech-recognition-one-two-three-deep-learning
Speech recognition project of one, two and three said in a microphone. |
|
Experimental |
| 89 |
abdnh/anki-asr
Anki add-on for speech recognition |
|
Experimental |
| 90 |
berangerthomas/ASR.lab
Benchmarking platform for automatic speech recognition models |
|
Experimental |
| 91 |
frankcholula/sapr
Speech & Audio Processing & Recognition 🗣️ |
|
Experimental |
| 92 |
dobby-seo/korean-speech-recognition-quartznet
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식 |
|
Experimental |
| 93 |
nmstoker/SimpleSpeechLoop
A very basic demonstration connecting speech recognition and text-to-speech |
|
Experimental |
| 94 |
SpringerNLP/Chapter8
Chapter 8: Automatic Speech Recognition |
|
Experimental |
| 95 |
raj-sutariya/gujarati_speech_recognition
Offline speech recognition for Gujarati Language. |
|
Experimental |
| 96 |
danvers/medienpaed-asr
Understanding ASR |
|
Experimental |
| 97 |
meichthys/sword_drill
Displays Bible verses from parsed microphone input. |
|
Experimental |
| 98 |
JunhoKim94/ASR_project
This repository created for the NHN ASR hackathon competition. |
|
Experimental |
| 99 |
german-asr/nvidia-jasper-german
Scripts for training NVIDIA Jasper for German Speech Recognition (ASR). |
|
Experimental |
| 100 |
asafu-art/deepspeech-kabyle
Automatic Speech Recognition (ASR) - Kabyle |
|
Experimental |
| 101 |
TicooLiu/HowTo-ASR
开源语音识别自定义数据模型训练指南 |
|
Experimental |
| 102 |
stefanpantic/asr
Automatic speech recognition using neural networks |
|
Experimental |
| 103 |
SakshiRathi77/hindiSpeechPro-Automatic-Speech-Recognization
The project,being part of Kagglex BIPOC Mentorship Program final project,... |
|
Experimental |
| 104 |
llxlr/Speech-Recognition-With-Python
Speech Recognition With Python | python语音识别 |
|
Experimental |
| 105 |
JaesungHuh/look-listen-recognise
Dataset page for Look, Listen and Recognise : character-aware audio-visual... |
|
Experimental |
| 106 |
EN10/SimpleSpeech
Simple Audio Recognition |
|
Experimental |
| 107 |
sknadig/ASR_2018_T01
Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects |
|
Experimental |
| 108 |
parvatijay2901/Hindi-ASR-and-TTS
EC499: Major Project |
|
Experimental |
| 109 |
jcsilva/asr-benchmark
Benchmark of industrial Speech Recognition systems for Brazilian Portuguese |
|
Experimental |
| 110 |
R1ckShi/SeACo-Paraformer
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer. |
|
Experimental |
| 111 |
ysdede/asrtk
An open-source Python toolkit designed to streamline the development and... |
|
Experimental |
| 112 |
kirshiyin89/py-searchable-audio
Search keywords in audio files with Python |
|
Experimental |
| 113 |
jarvisx17/ASR
ASR (Automatic Speech Recognition) Notebooks |
|
Experimental |
| 114 |
khanld/speechenhancement
Speech Enhancement for ASR usage |
|
Experimental |
| 115 |
OpenLake/Speech-Analyser
An App to help you improve your English fluency 🎤 |
|
Experimental |
| 116 |
AdilShamim8/BUET-CSE-Fest-2026
DL Sprint 4.0 | BUET CSE Fest 2026 — Bengali Long-Form Speech Recognition... |
|
Experimental |
| 117 |
Aprataksh/Python-Files
mic_py : Python 3 code for successful use of microphone on windows.... |
|
Experimental |
| 118 |
dangvansam/nvidia-nemo-jasper-quartznet-asr-vietnamese
Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo |
|
Experimental |
| 119 |
Aslm-Fawzy/Speech-Recognition-Using-Raspberry-Pi
Simple Speech Recognition Program Run on Raspberry Pi |
|
Experimental |
| 120 |
yulinliu101/ASR_ATC
speech recognition system to transcribe ATC voice data |
|
Experimental |
| 121 |
AmirHoseein99/Persian_ASR
a ASR(automatic speech recognition) model for Persian language based on... |
|
Experimental |
| 122 |
jqi41/Subrank
ICASSP 2020 |
|
Experimental |
| 123 |
zhaoyi2/Classical-Speech-Algorithms
Classical speech recognition and speaker recognition algorithms |
|
Experimental |
| 124 |
Slothologist/AudioSegmenter
Segmentation of audio for a speech pipeline |
|
Experimental |
| 125 |
MML-Group/code4AVE-Speech
Source Code for AVE Speech Dataset |
|
Experimental |
| 126 |
atmehedi/Speech-to-text-in-Assamese
TASK ORIENTED DIALOG SYSTEM IN NATIVE LANGUAGE(ASSAMESE) |
|
Experimental |
| 127 |
opensource-spraakherkenning-nl/ASR_NL_results
Results of Dutch ASR models, collected by the community |
|
Experimental |
| 128 |
srvk/jsalt-2018-grounded-s2s
Grounded Sequence-to-Sequence Transduction Team at JSALT 2018 |
|
Experimental |
| 129 |
orbxball/timit-preprocessor
Extract mfcc vectors and phones from TIMIT dataset |
|
Experimental |
| 130 |
rbrigden/you-only-listen-once
Practical speech-only authentication system |
|
Experimental |
| 131 |
gheyret/UyghurASR
Uyghurche Aptomatik Awaz Tonush(Uyghur ASR) |
|
Experimental |
| 132 |
ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language |
|
Experimental |
| 133 |
fafilia/speech-to-text
TThis session is how a speech can be recognized by a computer and how a... |
|
Experimental |
| 134 |
acousticclown/Xenator
A Speech recognition system that runs basic Programming commands and gives output |
|
Experimental |
| 135 |
gheyret/uyghurasr_python
Uyghurche Aptomatik Awaz Tonush(Uyghur Automatic Speech Recognition)(ASR) |
|
Experimental |
| 136 |
itsrohanvj/Name-Recogniser
Activates when ever your name is uttered and sends you a mail. |
|
Experimental |
| 137 |
mrglaster/PySpeechRecognizer
Recognizes speech from .wav file |
|
Experimental |
| 138 |
Kabir5296/Kakatua-ASR
Official Training Module for IUT National ICT Fest 2024 Datathon:... |
|
Experimental |
| 139 |
asrajeh/deepspeech-arabic
End-to-End Arabic ASR using DeepSpeech engine |
|
Experimental |
| 140 |
LeoVarnet/fastACI
fastACI toolbox: the MATLAB toolbox for investigating auditory perception... |
|
Experimental |
| 141 |
pika-online/Foreign_Pronunciation_Generator_for_Code-Switch_ASR
a socket script to obtain chinese phones-sequence for any english word |
|
Experimental |
| 142 |
belambert/cl-asr
A (not entirely working) stand-alone speech recognizer written in Common Lisp |
|
Experimental |
| 143 |
AASHISHAG/deepspeech-swiss-german
Automatic Speech Recognition (ASR) - Swiss-German |
|
Experimental |
| 144 |
mariateleki/Comparing-ASR-Systems
Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of... |
|
Experimental |
| 145 |
BenyaminZojaji/speech_recognition
Speech Recognition Assignments. |
|
Experimental |
| 146 |
yehuohan/ln-asr
Automatic Speech Recognition |
|
Experimental |
| 147 |
Omitg24/IIS-ASR
Repositorio para Administración de Sistemas y Redes (ASR), asignatura del... |
|
Experimental |
| 148 |
msalhab96/AraSpot
The official implementation of the AraSpot research paper |
|
Experimental |
| 149 |
yjg30737/pyqt_speech_recognition
PyQt speech recognition demonstrating example (using pydub and... |
|
Experimental |
| 150 |
Sheldon1999/speech-recognition
a device-state based speech recognition script |
|
Experimental |
| 151 |
pprattis/automatic-speech-recognision-system-ASR
A python script that implements an automatic speech recognision system. |
|
Experimental |
| 152 |
Aashish1106/Speech_Recognition
Real-Time Speech Recognition |
|
Experimental |
| 153 |
marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python |
|
Experimental |
| 154 |
csalt-research/OpenASR-py
Minimal toolkit for end-to-end automatic speech recognition and related... |
|
Experimental |
| 155 |
tafaust/pyCrow
Python 3.6 Speech Recognition Framework |
|
Experimental |
| 156 |
AlexDolch/Project-8-Real_Time_Translation
Data Science Bootcamp @WBS Coding School |
|
Experimental |
| 157 |
ys2843/speech-recognition-camera-app
Desktop camera app based on CMU Sphinx |
|
Experimental |
| 158 |
thekripaverse/Speech-Recognition-System-using-Python
A Python-based speech recognition system that converts spoken audio into... |
|
Experimental |
| 159 |
vyahello/speech-recogniser
🤖 Speech recognition program (python) |
|
Experimental |
| 160 |
gchrupala/analyzing-analytical-methods
Code for the paper "Analyzing analytical methods"... |
|
Experimental |
| 161 |
remarkablemark/speech-recognition-demo
Speech recognition demo. |
|
Experimental |