Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
# Technical Summary Built on VOICEVOX ENGINE's architecture, this engine uses ONNX Runtime for efficient CPU-based inference while supporting the AIVMX model format (.aivmx)—a specialized container combining learned models, hyperparameters, style vectors, and speaker metadata. The REST API maintains VOICEVOX compatibility but extends functionality with emotion-rich voice generation and style control through embedded style IDs. Deployable across Windows, macOS, and Linux with optional NVIDIA GPU acceleration via CUDA or DirectML, it integrates with the AivisSpeech desktop application and supports model discovery via local directories or S3-compatible storage.
150 stars.
Stars
150
Forks
25
Language
Python
License
LGPL-3.0
Category
Last pushed
Mar 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Aivis-Project/AivisSpeech-Engine"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
devnen/Chatterbox-TTS-Server
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
jamiepine/voicebox
The open-source voice synthesis studio
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...
gokhaneraslan/chatterbox-finetuning
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with...