nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.
Provides real-time transcription with pause/resume control during recording sessions. Uses the Web Audio API to capture microphone input and streams audio to OpenAI's Whisper endpoint via callback-based async methods. Handles microphone permissions automatically, making it suitable for embedding directly into React, Vue, or vanilla JavaScript web applications without additional permission management.
No commits in the last 6 months. Available on npm.
Stars
33
Forks
8
Language
TypeScript
License
MIT
Category
Last pushed
May 06, 2023
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nitaiaharoni1/whisper-speech-to-text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX