MuGuiLin/VoiceDictation
迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Implements streaming speech-to-text via WebSocket API with real-time result delivery, supporting multiple languages and dialects (Mandarin, Cantonese, Sichuan) with configurable auto-close timeouts. Built as an npm package for browser environments, it leverages Web Audio API for microphone access and provides lifecycle callbacks (onWillStatusChange, onTextChange, onError) to enable interactive UI feedback during recognition.
137 stars. No commits in the last 6 months.
Stars
137
Forks
35
Language
JavaScript
License
—
Category
Last pushed
Sep 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MuGuiLin/VoiceDictation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TalAter/annyang
💬 Speech recognition for your site
Picovoice/web-voice-processor
A library for real-time voice processing in web browsers
EddyVerbruggen/nativescript-speech-recognition
:speech_balloon: Speech to text, using the awesome engines readily available on the device.
sdkcarlos/artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library....
evancohen/sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection