AIGC-Audio/AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

45
/ 100
Emerging

Integrates multiple specialized foundation models (Whisper, VITS, DiffSinger, Make-An-Audio, GeneFace) through a unified LLM interface, enabling chained audio tasks like speech recognition→style transfer→synthesis. Leverages LangChain and Hugging Face infrastructure to compose diverse audio/speech/singing/video generation pipelines from natural language prompts, with support for cross-modal tasks including image-to-audio and audio inpainting.

10,210 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

10,210

Forks

865

Language

Python

License

Last pushed

Jul 06, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AIGC-Audio/AudioGPT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.