OpenMOSS/MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

/ 100

Established

# Technical Summary Built on a modular architecture, MOSS-TTS decomposes speech synthesis into five specialized models—flagship TTS for zero-shot voice cloning with phoneme-level control, a dialogue model outperforming closed-source baselines on objective metrics, a prompt-based voice generator requiring no reference audio, a low-latency realtime agent model (180ms TTFB), and a sound effect generator. The framework supports multiple inference backends including PyTorch-free deployment via llama.cpp with GGUF quantization and ONNX audio codec decoding, plus SGLang acceleration achieving 3× faster generation throughput. Models are available on Hugging Face and ModelScope with fine-tuning tutorials and REST API documentation via the MOSI.AI studio platform.

922 stars. Actively maintained with 30 commits in the last 30 days.

No Package No Dependents

Maintenance 23 / 25

Adoption 10 / 25

Maturity 11 / 25

Community 17 / 25

How are scores calculated?

Stars

922

Forks

Language

Python

License

Apache-2.0

Compare

MOSS-TTS and MOSS-TTSD MOSS-TTS and MOSS-Speech

Related tools

travisvn/chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment...

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...

OpenMOSS/MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....

Explore Voice AI Tools

All categories Trending Voice AI directory Insights