oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

46
/ 100
Emerging

Provides real-time streaming speech recognition by continuously processing microphone input through any wav2vec2 model from Hugging Face, with configurable audio devices and per-inference timing metrics. The architecture uses PyAudio for live audio capture and runs inference asynchronously, returning recognized text alongside processing latency and sample duration for performance monitoring.

378 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

378

Forks

58

Language

Python

License

MIT

Last pushed

Feb 04, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/oliverguhr/wav2vec2-live"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.