Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

/ 100

Verified

Provides a unified interface abstraction over 12+ speech recognition backends (Google, Azure, IBM, OpenAI Whisper, Vosk, CMU Sphinx, etc.), enabling developers to swap engines without rewriting code. Handles audio acquisition from microphone or file input, applies preprocessing like noise calibration and energy thresholding, and supports both cloud APIs and local offline models. Includes hotword detection (Snowboy) and optional language pack customization for multilingual support across backends.

8,959 stars. Used by 18 other packages. Actively maintained with 62 commits in the last 30 days. Available on PyPI.

Maintenance 25 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 25 / 25

How are scores calculated?

Stars

8,959

Forks

2,434

Language

Python

License

BSD-3-Clause

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Related tools

cmusphinx/pocketsphinx

A small speech recognizer

istupakov/onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

tensorflow/lingvo

Lingvo

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...

PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

Explore Voice AI Tools

All categories Trending Voice AI directory Insights