daswer123/xtts-api-server

A simple FastAPI Server to run XTTSv2

/ 100

Established

Exposes XTTSv2's multilingual text-to-speech capabilities through REST endpoints with support for voice cloning via speaker samples, configurable model versions, and optional GPU acceleration via DeepSpeed. Includes streaming mode for low-latency audio generation, result caching, and low-VRAM operation modes, with Docker deployment and integration hooks for applications like SillyTavern. Supports both official and custom fine-tuned models loaded from HuggingFace or local directories.

577 stars and 17,930 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 20 / 25

Maturity 18 / 25

Community 25 / 25

How are scores calculated?

Stars

577

Forks

156

Language

Python

License

MIT

Related tools

jamiepine/voicebox

The open-source voice synthesis studio

devnen/Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...

jianchang512/ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...

Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights