yandexdataschool/speech_course

YSDA course in Speech Processing.

/ 100

Established

Covers practical speech processing tasks spanning DSP fundamentals, discriminative models (VAD, SED, keyword spotting, speaker verification), automatic speech recognition with CTC/RNN-T decoding and streaming inference, text-to-speech synthesis with acoustic modeling and neural vocoders, and signal enhancement (noise reduction, beamforming). Emphasizes hands-on implementation through weekly seminars and graded assignments, with code examples for mel-spectrogram extraction, contrastive loss training (ECAPA-TDNN), and real-time ASR pipelines.

319 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

319

Forks

101

Language

Jupyter Notebook

License

MIT

Related tools

Picovoice/rhino

On-device Speech-to-Intent engine powered by deep learning

MycroftAI/adapt

Adapt Intent Parser

IBM/BigLittleNet

Official repository for Big-Little Net

espnet/interspeech2019-tutorial

INTERSPEECH 2019 Tutorial Materials

Picovoice/speech-to-intent-benchmark

benchmark for Speech-to-Intent engines

Explore Voice AI Tools

All categories Trending Voice AI directory Insights