eigenpunk/ComfyUI-audio

some generative audio tools for ComfyUI

46
/ 100
Emerging

Provides multiple specialized generative audio models (Tacotron2, VALL-E X, Tortoise, MusicGen, AudioGen) as ComfyUI nodes, enabling text-to-speech, text-to-music, and audio continuation workflows. Wraps established research implementations (NVIDIA's Tacotron2, Meta's AudioCraft, community forks) with audio utility nodes for conversion and export. Targets GPU-accelerated inference on CUDA 12.1/11.8, though primarily tested on Linux.

101 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

101

Forks

20

Language

Python

License

GPL-3.0

Last pushed

Aug 10, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/eigenpunk/ComfyUI-audio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.