tarun7r/SpeechAlgo
A Comprehensive Speech Processing Algorithms Library for research and production use
Implements 20 foundational algorithms across preprocessing, VAD, pitch detection, enhancement, and feature extraction using NumPy-based, type-annotated code with mathematical documentation. Built on SciPy and SoundFile, it provides both educational reference implementations and production-ready modules like YIN pitch estimation, spectral subtraction, and MFCC extraction with real-time capability for many algorithms.
Available on PyPI.
Stars
15
Forks
2
Language
Python
License
MIT
Category
Last pushed
Oct 25, 2025
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tarun7r/SpeechAlgo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
cmusphinx/pocketsphinx
A small speech recognizer
istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition
haoheliu/voicefixer
General Speech Restoration