OpenBMB/UltraEval-Audio

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测，知己知彼。

/ 100

Emerging

Provides unified evaluation across 34 authoritative benchmarks spanning speech understanding and generation tasks (ASR, TTS, AST, audio codecs) in 10 languages and 12 task categories. Features isolated inference via IPC to prevent dependency conflicts, automated dataset acquisition and metric binding, and GPU-accelerated parallel evaluation with resume-from-checkpoint capabilities. Integrates with popular audio foundation models (Qwen3-Omni, GLM-4-Voice, VoxCPM, CosyVoice) through standardized replication documentation and one-click evaluation commands.

281 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

281

Forks

Language

Python

License

Apache-2.0

Featured in

Things AI Won't Tell You About Building a Voice App

Higher-rated alternatives

fgnt/meeteval

MeetEval - A meeting transcription evaluation toolkit

kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka...

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Explore Voice AI Tools

All categories Trending Voice AI directory Insights