ComfyUI-VibeVoice and ComfyUI-KugelAudio

These are competitors—both provide TTS capabilities to ComfyUI, with VibeVoice optimized for expressive conversational audio and KugelAudio focused on multilingual voice cloning, requiring users to choose one based on their specific language and expressiveness needs.

ComfyUI-VibeVoice

Established

ComfyUI-KugelAudio

Emerging

Maintenance 2/25

Adoption 10/25

Maturity 15/25

Community 23/25

Maintenance 10/25

Adoption 7/25

Maturity 1/25

Community 16/25

Stars: 563

Forks: 105

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

Stars: 29

Forks: 7

Downloads: —

Commits (30d): 0

Language: Python

License: —

Stale 6m No Package No Dependents

No License No Package No Dependents

About ComfyUI-VibeVoice

wildminder/ComfyUI-VibeVoice

ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio

Integrates Microsoft's VibeVoice model directly into ComfyUI workflows for multi-speaker dialogue generation, supporting voice cloning via reference audio and hybrid zero-shot voice generation. Features 4-bit LLM quantization, multiple attention backends (eager/SDPA/Flash Attention/SageAttention), and automatic model management with configurable diffusion parameters for fine-grained control over speech synthesis.

About ComfyUI-KugelAudio

Saganaki22/ComfyUI-KugelAudio

🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice cloning for 24 European languages

Related comparisons

ComfyUI-VibeVoice and TTS-Audio-Suite ComfyUI-VibeVoice and VibeVoice-ComfyUI ComfyUI-VibeVoice and ComfyUI-VoxCPM ComfyUI-VibeVoice and ComfyUI-SparkTTS ComfyUI-VibeVoice and ComfyUI-MegaTTS ComfyUI-VibeVoice and ComfyUI-VoxCPMTTS

Scores updated daily from GitHub, PyPI, and npm data. How scores work