ComfyUI-VoxCPM and ComfyUI-ChatterboxTTS

These tools are competitors, as both provide ComfyUI nodes for text-to-speech functionality, with VoxCPM focusing on expressive speech and zero-shot voice cloning, while ChatterboxTTS emphasizes production-grade open-source TTS.

ComfyUI-VoxCPM

Emerging

ComfyUI-ChatterboxTTS

Emerging

Maintenance 6/25

Adoption 10/25

Maturity 15/25

Community 16/25

Maintenance 2/25

Adoption 5/25

Maturity 9/25

Community 14/25

Stars: 390

Forks: 42

Downloads: —

Commits (30d): 0

Language: Python

License: Apache-2.0

Stars: 13

Forks: 3

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

No Package No Dependents

Stale 6m No Package No Dependents

About ComfyUI-VoxCPM

wildminder/ComfyUI-VoxCPM

ComfyUI node for highly expressive speech and realistic zero-shot voice cloning

Implements a tokenizer-free diffusion-based TTS architecture built on MiniCPM-4 that models speech in continuous space rather than discrete tokens, enabling context-aware prosody generation. Includes native LoRA fine-tuning support within ComfyUI for custom voice style training, automatic model management with efficient VRAM offloading, and operates at 6.25Hz token rate for faster synthesis on consumer hardware. Integrates seamlessly with ComfyUI's node workflow system, supporting optional reference audio for voice cloning and compatible with multiple inference backends (CUDA, CPU, MPS, DirectML).

About ComfyUI-ChatterboxTTS

Yuan-ManX/ComfyUI-ChatterboxTTS

ComfyUI-ChatterboxTTS is now available in ComfyUI, Chatterbox is the first production-grade open-source TTS model.

Related comparisons

ComfyUI-VoxCPM and TTS-Audio-Suite ComfyUI-VoxCPM and VibeVoice-ComfyUI ComfyUI-VoxCPM and ComfyUI-VibeVoice ComfyUI-VoxCPM and ComfyUI-Maya1_TTS ComfyUI-VoxCPM and ComfyUI-XTTS ComfyUI-VoxCPM and ComfyUI-SparkTTS

Scores updated daily from GitHub, PyPI, and npm data. How scores work