Keyword Speech Recognition Voice AI Tools
Machine learning models for recognizing isolated spoken words/commands from audio using CNNs, RNNs, and neural networks. Does NOT include continuous speech-to-text ASR, end-to-end speech recognition pipelines, or general audio classification beyond single-word detection.
There are 112 keyword speech recognition tools tracked. The highest-rated is awsaf49/audio_classification_models at 44/100 with 13 stars and 332 monthly downloads.
Get all 112 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=keyword-speech-recognition&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
awsaf49/audio_classification_models
Tensorflow Audio Classification Models |
|
Emerging |
| 2 |
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine |
|
Emerging |
| 3 |
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files |
|
Emerging |
| 4 |
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow) |
|
Emerging |
| 5 |
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux. |
|
Emerging |
| 6 |
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System |
|
Emerging |
| 7 |
subho406/TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge... |
|
Emerging |
| 8 |
xxbb1234021/speech_recognition
中文语音识别 |
|
Emerging |
| 9 |
MohammedRashad/FPGA-Speech-Recognition
Expiremental Speech Recognition System using VHDL & MATLAB. |
|
Emerging |
| 10 |
stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral... |
|
Emerging |
| 11 |
felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM.... |
|
Emerging |
| 12 |
kamilc/speech-recognition
Companion repository for the blog article:... |
|
Emerging |
| 13 |
AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR |
|
Emerging |
| 14 |
tugstugi/pytorch-speech-commands
Speech commands recognition with PyTorch | Kaggle 10th place solution in... |
|
Emerging |
| 15 |
supikiti/PNCC
A implementation of Power Normalized Cepstral Coefficients: PNCC |
|
Emerging |
| 16 |
Sciss/SpeechRecognitionHMM
Exported from... |
|
Emerging |
| 17 |
zhihanyang2022/gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained... |
|
Emerging |
| 18 |
SkyDocs/speaker-identification
Speaker Identification using Neural Net. |
|
Emerging |
| 19 |
hamzaehsan97/Speech_Recognition_CNN
CNN (Convolutional Neural Networks) Speech Recognition |
|
Emerging |
| 20 |
wblgers/hmm_speech_recognition_demo
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python |
|
Experimental |
| 21 |
lucko515/Speech-commands-recognition
Recognizing common speech commands using Keras and Tensorflow. |
|
Experimental |
| 22 |
yh1008/speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras |
|
Experimental |
| 23 |
cosmoquester/speech-recognition
Develop speech recognition models with Tensorflow 2 |
|
Experimental |
| 24 |
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement |
|
Experimental |
| 25 |
Ralireza/spoken-digit-recognition
Classifying English spoken digit by Hidden Markov Model |
|
Experimental |
| 26 |
placebokkk/e6870
assignments for e6870 ASR class |
|
Experimental |
| 27 |
msalhab96/SpeeQ
A framework for automatic speech recognition |
|
Experimental |
| 28 |
creafz/kaggle-speech-recognition
Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%) |
|
Experimental |
| 29 |
gogyzzz/beamformit_matlab
A MATLAB implementation of CHiME4 baseline Beamformit |
|
Experimental |
| 30 |
HristovB/Speech_Recognition_Macedonian
Speech recognition model for recognising Macedonian spoken language. |
|
Experimental |
| 31 |
Pooventhiran/VSR
Speaker-Independent Speech Recognition using Visual Features |
|
Experimental |
| 32 |
arthurfortes/speech2text_keras
This repository reports how to build a speech to text model to recognize... |
|
Experimental |
| 33 |
ace19-dev/tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge |
|
Experimental |
| 34 |
ShihabYasin/Isolated-Bengali-Word-and-Speaker-Recognition.
Isolated Bengali word and speaker recognition. |
|
Experimental |
| 35 |
ShoYamanishi/AndroidMFCC
26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON... |
|
Experimental |
| 36 |
zhongyuchen/speech-classification
CNN and VGG speech classification with interactive website for testing |
|
Experimental |
| 37 |
super13/tensorflow-speech-recognition-pai
Speech recognition using tensorflow in aliyun pai. |
|
Experimental |
| 38 |
vinbhaskara/Digit-Speech-Recognition
Using MFCC features on Speech Signals to classify Digits after matching... |
|
Experimental |
| 39 |
TCL606/Speech-Number-Recognition
基于数字信号处理的语音数字识别器 |
|
Experimental |
| 40 |
backpropper/DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016) |
|
Experimental |
| 41 |
PiasRoY/Bangla-Spoken-Number-Recognition
recognizing spoken Bangla numbers using MFCCs and CNN. |
|
Experimental |
| 42 |
guglielmocamporese/learning_invariances_in_speech_recognition
In this work I investigate the speech command task developing and analyzing... |
|
Experimental |
| 43 |
JaesungBae/Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on... |
|
Experimental |
| 44 |
theawless/sr-lib
Automatic Speech Recognition library for my BTech Project. |
|
Experimental |
| 45 |
gtiwari333/speech-recognition-java-hidden-markov-model-vq-mfcc
Automatically exported from... |
|
Experimental |
| 46 |
timkrebs/VoiceDetection
Speech Recognition implementation with MFCC and HMM |
|
Experimental |
| 47 |
aishoot/DTWSpeech
A simple application of DTW Algorithm in isolate word speech recognition. |
|
Experimental |
| 48 |
common-voice/our-voices-model-competition
Our Voices Competition |
|
Experimental |
| 49 |
saztorralba/CNNWordReco
Code and scripts for training and testing isolated spoken word recognition... |
|
Experimental |
| 50 |
seyedsaleh/persian-speech-recognition
Simple word recognition using CNN on Raspberry Pi board 🗣 |
|
Experimental |
| 51 |
trungd/speech-recognition
experimental speech recognition library in tensorflow |
|
Experimental |
| 52 |
orbxball/DSP
2016 Autumn (105-1) -- Fundamentals of Digital Speech Signal Processing |
|
Experimental |
| 53 |
shitian-ni/speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in... |
|
Experimental |
| 54 |
aleksandarbos/Sound-Recognition-Convo2D-Neural-Network
Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:... |
|
Experimental |
| 55 |
rwightman/pytorch-commands
Some PyTorch code for the Kaggle Speech Recognition Challenge |
|
Experimental |
| 56 |
verrannt/snn_speechrec
Convolutional Spiking Neural Network to recognize speech utterances using... |
|
Experimental |
| 57 |
mhagglun/Speech-Recognition
Tensorflow implementation for Speech Recognition using Convolutional Neural... |
|
Experimental |
| 58 |
sangramsingnk/Audio-Feature-Extraction
In sound processing, the mel-frequency cepstrum (MFC) is a representation of... |
|
Experimental |
| 59 |
ameroyer/SIC
(SIC) Similarity by Iterative Classifications, ICASSP 2016 |
|
Experimental |
| 60 |
aminul-huq/Speech-Command-Classification
Speech command classification on Speech-Command v0.02 dataset using PyTorch... |
|
Experimental |
| 61 |
Lhx94As/Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress |
|
Experimental |
| 62 |
anicolson/matlab_feat
Functions for creating speech features in MATLAB. |
|
Experimental |
| 63 |
popcornell/MicRank
MicRank is a Learning to Rank neural channel selection framework where a DNN... |
|
Experimental |
| 64 |
zssloth/TF-Speech-Recognition
Speech Recognition Using Tensorflow |
|
Experimental |
| 65 |
codersinthestorm/RecurrentNN_SpeechRecognition
A model based in Tensorflow to recognize words from the 30 word Speech... |
|
Experimental |
| 66 |
rwightman/tensorflow-speech_commands
Speech commands training/models from TF repo adapted for speech commands Kaggle |
|
Experimental |
| 67 |
ivallesp/Xception1d
Xception1d implementation for audio categorization |
|
Experimental |
| 68 |
miguelangelnieto/DNN-Speech-Recognizer
Built a deep neural network that functions as part of an end-to-end... |
|
Experimental |
| 69 |
AmourWaltz/BayesLMs
Project of IEEE/ACM TASLP “Bayesian Neural Network Language Modeling for... |
|
Experimental |
| 70 |
cmaroti/speech_recognition
Convolutional Neural Network for Speech Recognition, implemented in Ms. Pacman game |
|
Experimental |
| 71 |
Pchambet/tp-hmm-markov
Markov Chains and Hidden Markov Models: weather modeling with discrete... |
|
Experimental |
| 72 |
techbd123/SpeechRecognition
Bengali Speech Recognition |
|
Experimental |
| 73 |
wvangansbeke/Audio-Speech
Build a cross-talk canceler and a speech recognizer |
|
Experimental |
| 74 |
YoungloLee/tf2-speech-recognition-las
Tensorflow 2 Speech Recognition Code (LAS) |
|
Experimental |
| 75 |
Erfanafshar/speech-gender-detection
An audio signal processing project that detects speaker gender from recorded... |
|
Experimental |
| 76 |
raminnakhli/HMM-DNN-Speech-Recognition
This repository is a Python implementation of HMM-DNN model. |
|
Experimental |
| 77 |
salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction
Feature extraction from audio signal (explained in Persian) |
|
Experimental |
| 78 |
sindhura-pv/lip-reading
In this project, visual speech recognition has been attempted using 2 major... |
|
Experimental |
| 79 |
vault-42/AIND_DNN_Speech_Recognizer
End-to-end speech to text recognition |
|
Experimental |
| 80 |
Amiannn/Simple-HmmGmm
Simple HMM implementation |
|
Experimental |
| 81 |
OldBonhart/TensorFlow_Speech_Recognition_Challenge
TensorFlow Speech Recognition Challenge -... |
|
Experimental |
| 82 |
FarzadForuozanfar/Speech-Recognition
I recorded 10 voices with the same words from myself and compared them with... |
|
Experimental |
| 83 |
type-a/speechnet
Automatic Speech Recognition |
|
Experimental |
| 84 |
YuriyGuts/gdg-speech-classifier
A machine learning system that recognizes the word 'Google' in human speech... |
|
Experimental |
| 85 |
gathrean/Nebula
Neural Network in Python trained for multi-musical instruments recognition. |
|
Experimental |
| 86 |
inspektral/audioMNIST-classifier
simple CNN on MFCC for Audio MNIST classification |
|
Experimental |
| 87 |
alainnguema/SpeechLangID-GMM
Ce projet implémente un système de détection de langue capable d'identifier... |
|
Experimental |
| 88 |
nilkanthshirodkar/Speech-Recognition-Using-HMM
Automatic Speech Recognition (ASR) system was implemented using the HMM... |
|
Experimental |
| 89 |
vinsis/speech-commands-recognition
Single word speech recognition using PyTorch |
|
Experimental |
| 90 |
IvanEvan/chinese-digital-speech-recognition
中文数字语音识别:识别类语音验证码的8位数字语音 |
|
Experimental |
| 91 |
uigiporc/icon-sr
Progetto di Ingegneria della conoscenza, autori: Porcelli Luigi, Nicolo Cucinotta. |
|
Experimental |
| 92 |
samuelebh/CNN-Spoken-Digit-Classifier
Repository containing Python code of a classifier that recognizes spoken... |
|
Experimental |
| 93 |
FandosA/Speech_Recognition_Keras_TF
Project I carried out during my Machine Learning course in the Master. |
|
Experimental |
| 94 |
SvenWientjes/SpeechRecognition
Classifying sound signals as Links, Midden or Rechts using features computed... |
|
Experimental |
| 95 |
kevobt/speech-to-text
Speech recognition framework using keras |
|
Experimental |
| 96 |
ragibson/MFCC-speech-recognition
Real-time speech recognition via "Mel-Frequency Cepstral Coefficients"... |
|
Experimental |
| 97 |
dannis999/trained_SpeechRecognition
此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用 |
|
Experimental |
| 98 |
YoungloLee/tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer) |
|
Experimental |
| 99 |
samimoftheworld/Voice-Activity-Detection-FInal-Project-work
this repository concedes my project work done in my bachelors |
|
Experimental |
| 100 |
khaykingleb/research-playground
Efficient ML/DL implementations across multiple domains with K3s multi-node... |
|
Experimental |
| 101 |
mradovic38/dtw-speech-recognition
Speech recognition system that uses feature extraction and dynamic time... |
|
Experimental |
| 102 |
shun60s/Wave-DNN-likelihood
音声認識エンジンJuliusのディクテーションキットに含まれるDNN-HMMモデルを利用して対数尤度を計算するpython |
|
Experimental |
| 103 |
belambert/cl-mfcc
MFCC feature computation |
|
Experimental |
| 104 |
briansm-github/shipping_recognition
Training/test data and code fror speech recognition experiments using UK... |
|
Experimental |
| 105 |
mohammadnabia/Speech-recognition-HMM
This project focuses on building a speech recognition system for the Farsi... |
|
Experimental |
| 106 |
yihong1120/Speech-Commands-Classification-LSTM
A TensorFlow project for classifying speech commands using LSTM neural... |
|
Experimental |
| 107 |
showman-sharma/speech_writing-recognition
We are given 2 different problems to solve. 1. Isolated spoken digit... |
|
Experimental |
| 108 |
hakula139/naive-speech-recognizer
A naive speech recognizer from scratch, written in Python 3 |
|
Experimental |
| 109 |
skyradez/Speech-Recognition-using-Convolutional-Neural-Network
Tutorial on Speech Recognition using Convolutional Neural Network |
|
Experimental |
| 110 |
VictorAtPL/Speech_Commands_Recognition_Bi_LSTM_with_Tensorflow_2
Neural Network with Bidirectional Long Short-Term Memory block for... |
|
Experimental |
| 111 |
trungrockyngo/GMM-speech-recognizer
Final project for CSCI 201 - Machine Learning |
|
Experimental |
| 112 |
g1y5x3/Speech_Phone_Detection
Recognize base phones (/a/, /u/, /i/) from a given speech and indicate the... |
|
Experimental |