HeadTTS and StreamingKokoroJS
The former is a user-facing application for text-to-speech with advanced features like lip-sync, while the latter appears to be a client library for browser-based text-to-speech, suggesting the latter could be a foundational component or a simpler alternative to integrate Kokoro-JS functionality.
About HeadTTS
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs in-browser (WebGPU/WASM) or on local Node.js WebSocket/REST server (CPU).
Leverages transformers.js with ONNX Runtime for client-side model execution, supporting both WebGPU acceleration and WASM fallback with configurable quantization levels (fp32/fp16/q8/q4). Provides phoneme-level timing data and Oculus-compatible visemes for precise lip-sync animation, with adjustable timing offsets for integration with 3D avatar frameworks like TalkingHead. Supports flexible endpoint configuration with automatic fallback between in-browser and Node.js server backends, enabling graceful degradation across browsers and deployment scenarios.
About StreamingKokoroJS
sai9640nayak/StreamingKokoroJS
Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work