gpustack/vox-box

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

/ 100

Established

Provides flexible model sourcing through HuggingFace and ModelScope repositories with GPU acceleration via CUDA, enabling deployment across Linux, Windows, and macOS with configurable model sizes from tiny to large variants. Implements a stateless server architecture that auto-downloads and caches models, supporting both streaming (Paraformer-zh-streaming) and batch processing pipelines with CLI configuration for device binding and data directory management.

200 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

200

Forks

Language

Python

License

Apache-2.0

Compare

vox-box and voicebox

Related tools

devnen/Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...

daswer123/xtts-api-server

A simple FastAPI Server to run XTTSv2

jamiepine/voicebox

The open-source voice synthesis studio

Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

jianchang512/ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights