danthelion/doc2audiobook

Convert text documents to high fidelity audio(books).

/ 100

Emerging

Supports 30+ input document formats (PDF, DOCX, EPUB, images with OCR, etc.) via textract, then synthesizes audio using Google Cloud's WaveNet models for natural-sounding speech. Runs containerized with Docker, mapping local input/output directories and requiring GCP authentication via service account credentials. Offers flexible voice selection across multiple languages and speaker profiles through command-line configuration.

204 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

204

Forks

Language

Python

License

MIT

Higher-rated alternatives

NVIDIA-AI-Blueprints/pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

tjunttila/pdf2video

A tool for making videos from PDF presentations.

chaonan99/ppt_presenter

Convert ppt to video with audio track, using text to speech synthesis

eminemahjoub/pdf-voice-reader

"PDF Reader: A Python application for seamless PDF viewing with enhanced text-to-speech capabilities."

hutchresearch/latex2speech

TeX2Speech is an application that turns LaTeX documents into spoken audio.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights