HildaM/LongCat-AudioDiT-Web
LongCat-AudioDiT 网页版本 | Web UI for LongCat-AudioDiT — SOTA diffusion TTS with zero-shot voice cloning, audio splitting, and Whisper ASR integration. Supports CUDA / MPS / CPU.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/HildaM/LongCat-AudioDiT-Web"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Agents365-ai/video-podcast-maker
AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube,...
Saganaki22/ComfyUI-OmniVoice-TTS
OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning,...
AlexandreSajus/JARVIS
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
fikrikarim/parlor
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that...
team-telnyx/ai
Official one-stop shop for AI Agents and developers building with Telnyx.