ComfyUI-EdgeTTS and ComfyUI-MegaTTS

These are competitors, offering different implementations of text-to-speech synthesis within ComfyUI, with ComfyUI-EdgeTTS leveraging Microsoft's Edge TTS and ComfyUI-MegaTTS utilizing ByteDance MegaTTS3.

ComfyUI-EdgeTTS
47
Emerging
ComfyUI-MegaTTS
38
Emerging
Maintenance 10/25
Adoption 8/25
Maturity 16/25
Community 13/25
Maintenance 2/25
Adoption 8/25
Maturity 16/25
Community 12/25
Stars: 66
Forks: 8
Downloads:
Commits (30d): 0
Language: Python
License: GPL-3.0
Stars: 49
Forks: 6
Downloads:
Commits (30d): 0
Language: Python
License: GPL-3.0
No Package No Dependents
Stale 6m No Package No Dependents

About ComfyUI-EdgeTTS

1038lab/ComfyUI-EdgeTTS

ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.

Provides complementary speech-to-text capabilities via OpenAI's Whisper with multiple model sizes and automatic language detection, alongside audio export nodes supporting WAV/MP3/FLAC formats with quality presets. The implementation uses lazy loading and caching to optimize performance and memory usage within ComfyUI's node-based workflow system. Integrates FFmpeg for audio codec handling and supports GPU acceleration via CUDA for faster Whisper inference.

About ComfyUI-MegaTTS

1038lab/ComfyUI-MegaTTS

A ComfyUI custom node based on ByteDance MegaTTS3, enabling high-quality text-to-speech synthesis with voice cloning capabilities for both Chinese and English.

Scores updated daily from GitHub, PyPI, and npm data. How scores work