kokoro-tts and kokoro-tts-addon
These are complements: the CLI tool provides a backend text-to-speech engine while the browser addon consumes TTS functionality as a frontend client, enabling local neural synthesis directly in the browser.
About kokoro-tts
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
Builds on ONNX Runtime for efficient inference and leverages chunk-based processing with configurable speech speed and format output (WAV/MP3). The tool integrates stdin piping for workflow composition and intelligently extracts chapter structure from EPUB/PDF files, enabling organized batch processing of long-form content with optional per-chapter audio splitting.
About kokoro-tts-addon
pinguy/kokoro-tts-addon
Local neural TTS for Browsers: fast, expressive, and offline—runs on modest hardware.
Implements a Flask-based local server paired with the 82M-parameter Kokoro model, enabling multi-voice synthesis with support for nine languages and accents through a Firefox extension popup. The architecture separates the inference backend from the browser frontend via HTTP, supporting both CPU and GPU acceleration while maintaining real-time performance even on legacy hardware like 2013 Xeons.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work