ComfyUI-VoxCPM and ComfyUI-KugelAudio

These are competitors—both provide text-to-speech with voice cloning capabilities for ComfyUI, but VoxCPM targets expressiveness and zero-shot cloning while KugelAudio emphasizes multilingual support across European languages.

ComfyUI-VoxCPM
47
Emerging
ComfyUI-KugelAudio
34
Emerging
Maintenance 6/25
Adoption 10/25
Maturity 15/25
Community 16/25
Maintenance 10/25
Adoption 7/25
Maturity 1/25
Community 16/25
Stars: 390
Forks: 42
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stars: 29
Forks: 7
Downloads:
Commits (30d): 0
Language: Python
License:
No Package No Dependents
No License No Package No Dependents

About ComfyUI-VoxCPM

wildminder/ComfyUI-VoxCPM

ComfyUI node for highly expressive speech and realistic zero-shot voice cloning

Implements a tokenizer-free diffusion-based TTS architecture built on MiniCPM-4 that models speech in continuous space rather than discrete tokens, enabling context-aware prosody generation. Includes native LoRA fine-tuning support within ComfyUI for custom voice style training, automatic model management with efficient VRAM offloading, and operates at 6.25Hz token rate for faster synthesis on consumer hardware. Integrates seamlessly with ComfyUI's node workflow system, supporting optional reference audio for voice cloning and compatible with multiple inference backends (CUDA, CPU, MPS, DirectML).

About ComfyUI-KugelAudio

Saganaki22/ComfyUI-KugelAudio

🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice cloning for 24 European languages

Scores updated daily from GitHub, PyPI, and npm data. How scores work