yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

34
/ 100
Emerging

Provides modular baseline recipes for both ASR and speaker diarization tracks, with integrated voice activity detection (VAD) pipelines that generate RTTM outputs for diarization error rate (DER) evaluation. Supports training of both single-speaker and multi-speaker ASR models on multi-channel meeting audio, with character error rate (CER) as the evaluation metric. Built around the AliMeeting dataset and designed for reproducibility on the CodaLab evaluation platform.

135 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 16 / 25

How are scores calculated?

Stars

135

Forks

18

Language

Python

License

Last pushed

Jun 10, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yufan-aslp/AliMeeting"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.