gillesdemey/google-speech-v2

:speech_balloon: Reverse Engineering Google's Speech To Text API (v2)

40
/ 100
Emerging

Supports both FLAC (44.1kHz, 32-bit) and 16-bit PCM audio formats via direct HTTP POST requests to Google's endpoint, requiring proper Content-Type headers matching the audio encoding and sample rate. Returns JSON responses with transcription alternatives and optional confidence scores when Google's recognition certainty is below 100%. Note: Google has since released an official Cloud Speech API, making this reverse-engineered approach primarily useful for historical reference or environments where the official API isn't accessible.

470 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 22 / 25

How are scores calculated?

Stars

470

Forks

82

Language

License

Last pushed

Apr 18, 2017

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gillesdemey/google-speech-v2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.