DrewThomasson/doc2interview

This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.

12
/ 100
Experimental

Leverages Ollama's `phi3.5` model for local LLM-based dialogue generation and XTTS for text-to-speech synthesis, with optional CUDA acceleration for faster audio generation. Provides a Gradio web interface supporting both PDF uploads and article URLs, outputting chapter-wise audio files and a combined final interview track. Runs entirely offline without external API dependencies, with speaker voice customization through reference audio samples.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 1 / 25
Community 5 / 25

How are scores calculated?

Stars

16

Forks

1

Language

Python

License

Last pushed

Dec 08, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/DrewThomasson/doc2interview"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.