Renovamen/Speech-and-Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Provides flexible input/output options for speech processing—supporting both real-time microphone capture and file-based audio, with three distinct speech-to-text backends (cloud APIs from Baidu and iFlytek plus offline CMU PocketSphinx). Architecture leverages platform-specific dependencies like PyAudio for microphone access and integrates with SpeechRecognition library, while text-to-speech relies on pyttsx3 with cross-platform TTS engine abstraction.
341 stars. No commits in the last 6 months.
Stars
341
Forks
78
Language
Python
License
—
Category
Last pushed
Jun 03, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Renovamen/Speech-and-Text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
snakers4/open_stt
Open STT