oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

/ 100

Emerging

Provides real-time streaming speech recognition by continuously processing microphone input through any wav2vec2 model from Hugging Face, with configurable audio devices and per-inference timing metrics. The architecture uses PyAudio for live audio capture and runs inference asynchronously, returning recognized text alongside processing latency and sample duration for performance monitoring.

378 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

378

Forks

Language

Python

License

MIT

Compare

wav2vec2-live and wav2asr wav2vec2-live and wav2vec2-live-japanese-translator

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

silversparro/wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights