IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
Packages Mozilla's DeepSpeech model (trained on Common Voice dataset) as a containerized REST API, handling automatic resampling of non-16kHz audio to match the model's expected input. Deploys via Docker, Kubernetes, or OpenShift with a Swagger interface for easy testing, and pre-built images available on Quay.io.
No commits in the last 6 months.
Stars
76
Forks
32
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/IBM/MAX-Speech-to-Text-Converter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT
Kini218/speech-to-text
Speech to text script on python