dia and Dia-TTS-Server

The second tool, Gmzxdotzz/Dia-TTS-Server, is a self-hosting server that implements and exposes the functionality of the first tool, nari-labs/dia, making them ecosystem siblings where one provides the core model and the other provides a server wrapper with a UI and API for deployment.

dia
50
Established
Dia-TTS-Server
37
Emerging
Maintenance 6/25
Adoption 10/25
Maturity 15/25
Community 19/25
Maintenance 13/25
Adoption 3/25
Maturity 9/25
Community 12/25
Stars: 19,202
Forks: 1,683
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
Stars: 4
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About dia

nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Built on a 1.6B parameter architecture, Dia directly synthesizes multi-speaker dialogue from transcripts with audio conditioning for voice cloning and emotion control, supporting nonverbal tags like laughter and coughing. Integrates with Hugging Face Transformers and provides inference through Python APIs, CLI, and Gradio UI, with realtime factor performance ranging from 0.9x–2.2x on RTX 4090 depending on precision. Uses the Descript Audio Codec for audio generation and supports speaker consistency via seed fixing or audio prompts.

About Dia-TTS-Server

Gmzxdotzz/Dia-TTS-Server

Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.

Scores updated daily from GitHub, PyPI, and npm data. How scores work