yapit-tts/yapit

Listen to anything. TTS for documents, papers, and web pages.

/ 100

Emerging

Leverages vision-based document layout detection (DocLayout-YOLO) to extract and semantically parse complex PDFs with math equations, tables, and citations—converting them to spoken descriptions rather than raw text. Supports pluggable TTS backends via OpenAI API compatibility, enabling local synthesis (Kokoro) or premium voices, plus scalable GPU workers via Redis job queue for distributed processing. Also exports clean markdown via URL endpoints (`/md`, `/md-annotated`) and integrates vision models (Gemini, vLLM-compatible APIs) for AI-powered document extraction.

Available on PyPI.

Maintenance 13 / 25

Adoption 3 / 25

Maturity 18 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

AGPL-3.0

Higher-rated alternatives

kxxt/aspeak

A simple text-to-speech client for Azure TTS API.

fishaudio/fish-audio-python

The official Python library for the Fish Audio API.

aahl/zai-tts

🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API，支持克隆音色，基于智谱TTS

Aivis-Project/aivmlib

Aivis Voice Model File (.aivm/.aivmx) Utility Library

simonw/ospeak

CLI tool for running text through OpenAI Text to speech

Explore Voice AI Tools

All categories Trending Voice AI directory Insights