metame-ai/awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

/ 100

Emerging

Curated collection aggregating papers, projects, datasets, and toolkits across nine audio domains—including audio encoding, speech translation, emotion recognition, audio separation, and voice omni-models. Content is automatically sourced from arXiv, Hugging Face Papers, GitHub trending, Papers with Code, and social media, then organized into category-specific markdown docs with surveys, implementations, and evaluation resources. Each domain includes dedicated sections for research papers, open-source projects, commercial products, and benchmark datasets to support both academic research and production deployment.

411 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

411

Forks

Language

—

License

MIT

Higher-rated alternatives

Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

ftyers/commonvoice-utils

Linguistic processing for Common Voice

alphacep/awesome-russian-speech

Russian speech technology links

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Explore Voice AI Tools

All categories Trending Voice AI directory Insights