microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

47
/ 100
Emerging

Unifies self-supervised and supervised learning across multiple speech tasks (ASR, speaker recognition, speech enhancement) through variants like WavLM and UniSpeech-SAT that incorporate speaker-aware pre-training and intermediate layer supervision. Models scale from 960 hours (LibriSpeech) to 94k hours across Libri-Light, GigaSpeech, and VoxPopuli datasets, with multilingual support for English, French, Spanish, and Italian. Fully integrated with HuggingFace for straightforward model loading and fine-tuning on downstream tasks.

479 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

479

Forks

76

Language

Python

License

Last pushed

Apr 05, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/microsoft/UniSpeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.