daswer123/xtts-api-server

A simple FastAPI Server to run XTTSv2

63
/ 100
Established

Exposes XTTSv2's multilingual text-to-speech capabilities through REST endpoints with support for voice cloning via speaker samples, configurable model versions, and optional GPU acceleration via DeepSpeed. Includes streaming mode for low-latency audio generation, result caching, and low-VRAM operation modes, with Docker deployment and integration hooks for applications like SillyTavern. Supports both official and custom fine-tuned models loaded from HuggingFace or local directories.

577 stars and 17,930 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 20 / 25
Maturity 18 / 25
Community 25 / 25

How are scores calculated?

Stars

577

Forks

156

Language

Python

License

MIT

Last pushed

Jul 21, 2024

Monthly downloads

17,930

Commits (30d)

0

Dependencies

18

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/daswer123/xtts-api-server"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.