metame-ai/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Curated collection aggregating papers, projects, datasets, and toolkits across nine audio domains—including audio encoding, speech translation, emotion recognition, audio separation, and voice omni-models. Content is automatically sourced from arXiv, Hugging Face Papers, GitHub trending, Papers with Code, and social media, then organized into category-specific markdown docs with surveys, implementations, and evaluation resources. Each domain includes dedicated sections for research papers, open-source projects, commercial products, and benchmark datasets to support both academic research and production deployment.
411 stars.
Stars
411
Forks
19
Language
—
License
MIT
Category
Last pushed
Nov 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/metame-ai/awesome-audio-plaza"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Spr-Aachen/Easy-Voice-Toolkit
A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
ftyers/commonvoice-utils
Linguistic processing for Common Voice
alphacep/awesome-russian-speech
Russian speech technology links
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing