yandexdataschool/speech_course
YSDA course in Speech Processing.
Covers practical speech processing tasks spanning DSP fundamentals, discriminative models (VAD, SED, keyword spotting, speaker verification), automatic speech recognition with CTC/RNN-T decoding and streaming inference, text-to-speech synthesis with acoustic modeling and neural vocoders, and signal enhancement (noise reduction, beamforming). Emphasizes hands-on implementation through weekly seminars and graded assignments, with code examples for mel-spectrogram extraction, contrastive loss training (ECAPA-TDNN), and real-time ASR pipelines.
319 stars.
Stars
319
Forks
101
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yandexdataschool/speech_course"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Picovoice/rhino
On-device Speech-to-Intent engine powered by deep learning
MycroftAI/adapt
Adapt Intent Parser
IBM/BigLittleNet
Official repository for Big-Little Net
espnet/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
Picovoice/speech-to-intent-benchmark
benchmark for Speech-to-Intent engines