ComfyUI-EdgeTTS and ComfyUI-SparkTTS

Both are custom ComfyUI nodes for text-to-speech, acting as competitors by offering different TTS backends (Microsoft Edge TTS versus a SparkTTS LLM-based system) within the same workflow environment.

ComfyUI-EdgeTTS
47
Emerging
ComfyUI-SparkTTS
41
Emerging
Maintenance 10/25
Adoption 8/25
Maturity 16/25
Community 13/25
Maintenance 2/25
Adoption 10/25
Maturity 16/25
Community 13/25
Stars: 66
Forks: 8
Downloads:
Commits (30d): 0
Language: Python
License: GPL-3.0
Stars: 124
Forks: 13
Downloads:
Commits (30d): 0
Language: Python
License: GPL-3.0
No Package No Dependents
Stale 6m No Package No Dependents

About ComfyUI-EdgeTTS

1038lab/ComfyUI-EdgeTTS

ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.

Provides complementary speech-to-text capabilities via OpenAI's Whisper with multiple model sizes and automatic language detection, alongside audio export nodes supporting WAV/MP3/FLAC formats with quality presets. The implementation uses lazy loading and caching to optimize performance and memory usage within ComfyUI's node-based workflow system. Integrates FFmpeg for audio codec handling and supports GPU acceleration via CUDA for faster Whisper inference.

About ComfyUI-SparkTTS

1038lab/ComfyUI-SparkTTS

ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large language models (LLMs) to generate highly accurate and natural-sounding speech.

Scores updated daily from GitHub, PyPI, and npm data. How scores work