ChatTTS and voice-chatgpt-python
About ChatTTS
2noise/ChatTTS
A generative speech model for daily dialogue.
Based on the README, here's the technical summary: Built on a transformer architecture trained on 100,000+ hours of multilingual audio, ChatTTS enables fine-grained prosodic control through special tokens for laughter, pauses, and interjections while supporting multiple speakers via speaker embeddings. The model includes a discrete VAE encoder for zero-shot speaker inference and streaming audio generation capabilities, supporting English and Chinese with plans for additional languages.
About voice-chatgpt-python
enoobis/voice-chatgpt-python
This project is a conversational AI program that uses speech recognition, text-to-speech, and the OpenAI API to generate responses to user prompts, allowing for a natural conversation flow.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work