open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

47
/ 100
Emerging

Amphion implements task-specific generation pipelines (TTS, SVC, voice conversion, text-to-audio) with unified architectural components including neural vocoders and standardized evaluation metrics for reproducibility. Built on PyTorch, it provides modular design supporting both classical and foundation models—such as Vevo (zero-shot voice imitation with prosody control) and MaskGCT (non-autoregressive TTS)—alongside large-scale datasets like Emilia (200k+ hours) for training. The toolkit integrates with Hugging Face and ModelScope, enabling seamless model sharing and deployment across speech generation tasks.

9,712 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

9,712

Forks

796

Language

Python

License

MIT

Last pushed

May 27, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/open-mmlab/Amphion"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.