OpenBMB/UltraEval-Audio
Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。
Provides unified evaluation across 34 authoritative benchmarks spanning speech understanding and generation tasks (ASR, TTS, AST, audio codecs) in 10 languages and 12 task categories. Features isolated inference via IPC to prevent dependency conflicts, automated dataset acquisition and metric binding, and GPU-accelerated parallel evaluation with resume-from-checkpoint capabilities. Integrates with popular audio foundation models (Qwen3-Omni, GLM-4-Voice, VoxCPM, CosyVoice) through standardized replication documentation and one-click evaluation commands.
281 stars.
Stars
281
Forks
21
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/OpenBMB/UltraEval-Audio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio.
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka...
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).