wav2vec2-live and wav2vec2-live-japanese-translator
The two tools are complements, with the Japanese translator leveraging the underlying live speech recognition capability to extend its functionality to a specific language and application.
About wav2vec2-live
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Provides real-time streaming speech recognition by continuously processing microphone input through any wav2vec2 model from Hugging Face, with configurable audio devices and per-inference timing metrics. The architecture uses PyAudio for live audio capture and runs inference asynchronously, returning recognized text alongside processing latency and sample duration for performance monitoring.
About wav2vec2-live-japanese-translator
ttop32/wav2vec2-live-japanese-translator
real time japanese speech recognition translator using wav2vec2
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work