yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

/ 100

Emerging

Provides modular baseline recipes for both ASR and speaker diarization tracks, with integrated voice activity detection (VAD) pipelines that generate RTTM outputs for diarization error rate (DER) evaluation. Supports training of both single-speaker and multi-speaker ASR models on multi-channel meeting audio, with character error rate (CER) as the evaluation metric. Built around the AliMeeting dataset and designed for reproducibility on the CodaLab evaluation platform.

135 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

135

Forks

Language

Python

License

—

Higher-rated alternatives

byjlw/video-analyzer

Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition

XnneHangLab/XnneHangLab

不会聊天的字幕提取器不是一个好 B 站下载器~

harry0703/AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

bakaburg1/minutemaker

Generate meeting minutes starting from an audio recording or a transcripts using speech-to-text and LLMs.

kromme/Teams-Notetaker

Let AI create the notes of your Teams Meeting

Explore Voice AI Tools

All categories Trending Voice AI directory Insights