danthelion/doc2audiobook

Convert text documents to high fidelity audio(books).

45
/ 100
Emerging

Supports 30+ input document formats (PDF, DOCX, EPUB, images with OCR, etc.) via textract, then synthesizes audio using Google Cloud's WaveNet models for natural-sounding speech. Runs containerized with Docker, mapping local input/output directories and requiring GCP authentication via service account credentials. Offers flexible voice selection across multiple languages and speaker profiles through command-line configuration.

204 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

204

Forks

34

Language

Python

License

MIT

Last pushed

Jan 17, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/danthelion/doc2audiobook"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.