istupakov/onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

/ 100

Established

Supports multiple modern ASR architectures (NeMo Conformer/Parakeet/Canary, GigaAM, Kaldi Icefall Zipformer, Whisper) with built-in preprocessing and greedy decoding, eliminating the need for external dependencies like PyTorch or FFmpeg. Runs on heterogeneous hardware via ONNX Runtime backends (CUDA, TensorRT, CoreML, DirectML, ROCm), from edge devices to GPUs, with VAD-based long-form recognition and token-level timestamps. Integrates with Hugging Face Hub for model distribution and accepts WAV files or NumPy arrays with automatic resampling.

281 stars and 20,143 monthly downloads. Available on PyPI.

Maintenance 13 / 25

Adoption 20 / 25

Maturity 18 / 25

Community 15 / 25

How are scores calculated?

Stars

281

Forks

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Related tools

Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

cmusphinx/pocketsphinx

A small speech recognizer

tensorflow/lingvo

Lingvo

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...

PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

Explore Voice AI Tools

All categories Trending Voice AI directory Insights