HildaM/LongCat-AudioDiT-Web

LongCat-AudioDiT 网页版本｜ Web UI for LongCat-AudioDiT — SOTA diffusion TTS with zero-shot voice cloning, audio splitting, and Whisper ASR integration. Supports CUDA / MPS / CPU.

/ 100

Experimental

No Package No Dependents

Maintenance 13 / 25

Adoption 4 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

Agents365-ai/video-podcast-maker

AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube,...

Saganaki22/ComfyUI-OmniVoice-TTS

OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning,...

AlexandreSajus/JARVIS

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

fikrikarim/parlor

On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that...

team-telnyx/ai

Official one-stop shop for AI Agents and developers building with Telnyx.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights