istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models
Supports multiple modern ASR architectures (NeMo Conformer/Parakeet/Canary, GigaAM, Kaldi Icefall Zipformer, Whisper) with built-in preprocessing and greedy decoding, eliminating the need for external dependencies like PyTorch or FFmpeg. Runs on heterogeneous hardware via ONNX Runtime backends (CUDA, TensorRT, CoreML, DirectML, ROCm), from edge devices to GPUs, with VAD-based long-form recognition and token-level timestamps. Integrates with Hugging Face Hub for model distribution and accepts WAV files or NumPy arrays with automatic resampling.
281 stars and 20,143 monthly downloads. Available on PyPI.
Stars
281
Forks
27
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Monthly downloads
20,143
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/istupakov/onnx-asr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related tools
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
tensorflow/lingvo
Lingvo
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition