jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Covers the full voice ML pipeline from audio collection and feature extraction through modeling, generation, and visualization, with dedicated chapters on server architecture design and ethical considerations. Dependencies like FFmpeg and SoX are pre-configured via setup.py, supporting cross-platform audio processing. Complements the broader NeuroLex ecosystem, particularly the Allie framework for building production voice ML models.
388 stars. No commits in the last 6 months.
Stars
388
Forks
86
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 08, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jim-schwoebel/voicebook"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voicegain/platform
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization, search, and...
aws-samples/amazon-transcribe-live-call-analytics
Amazon Transcribe Live Call Analytics (LCA) Sample Solution
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while...
bensonruan/Chrome-Web-Speech-API
Chrome Web Speech API