wav2vec2-live and wav2vec2-live-japanese-translator

The two tools are complements, with the Japanese translator leveraging the underlying live speech recognition capability to extend its functionality to a specific language and application.

Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 20/25
Maintenance 0/25
Adoption 7/25
Maturity 16/25
Community 8/25
Stars: 378
Forks: 58
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 39
Forks: 3
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: GPL-3.0
Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About wav2vec2-live

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Provides real-time streaming speech recognition by continuously processing microphone input through any wav2vec2 model from Hugging Face, with configurable audio devices and per-inference timing metrics. The architecture uses PyAudio for live audio capture and runs inference asynchronously, returning recognized text alongside processing latency and sample duration for performance monitoring.

About wav2vec2-live-japanese-translator

ttop32/wav2vec2-live-japanese-translator

real time japanese speech recognition translator using wav2vec2

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work