met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs in-browser (WebGPU/WASM) or on local Node.js WebSocket/REST server (CPU).
Leverages transformers.js with ONNX Runtime for client-side model execution, supporting both WebGPU acceleration and WASM fallback with configurable quantization levels (fp32/fp16/q8/q4). Provides phoneme-level timing data and Oculus-compatible visemes for precise lip-sync animation, with adjustable timing offsets for integration with 3D avatar frameworks like TalkingHead. Supports flexible endpoint configuration with automatic fallback between in-browser and Node.js server backends, enabling graceful degradation across browsers and deployment scenarios.
112 stars and 375 monthly downloads. Available on npm.
Stars
112
Forks
16
Language
JavaScript
License
MIT
Category
Last pushed
Dec 08, 2025
Monthly downloads
375
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/met4citizen/HeadTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with...