istupakov/onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

66
/ 100
Established

Supports multiple modern ASR architectures (NeMo Conformer/Parakeet/Canary, GigaAM, Kaldi Icefall Zipformer, Whisper) with built-in preprocessing and greedy decoding, eliminating the need for external dependencies like PyTorch or FFmpeg. Runs on heterogeneous hardware via ONNX Runtime backends (CUDA, TensorRT, CoreML, DirectML, ROCm), from edge devices to GPUs, with VAD-based long-form recognition and token-level timestamps. Integrates with Hugging Face Hub for model distribution and accepts WAV files or NumPy arrays with automatic resampling.

281 stars and 20,143 monthly downloads. Available on PyPI.

Maintenance 13 / 25
Adoption 20 / 25
Maturity 18 / 25
Community 15 / 25

How are scores calculated?

Stars

281

Forks

27

Language

Python

License

MIT

Last pushed

Mar 12, 2026

Monthly downloads

20,143

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/istupakov/onnx-asr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.