johnGettings/LIHQ

Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)

/ 100

Emerging

Combines multiple open-source models (FOMM for head motion, Wav2Lip for lip-sync, GFPGAN for upscaling) in a pipeline that transfers facial expressions from reference video to a static image while syncing generated or uploaded audio. Designed exclusively for Google Colab to leverage free GPU, with optional frame interpolation and background matting for final quality enhancement.

262 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

262

Forks

Language

Python

License

—

Higher-rated alternatives

FelippeChemello/podcast-maker

Fully automated video maker using motion graphics and text-to-speech synthesis to turn...

ManimCommunity/manim-voiceover

Manim plugin for all things voiceover

charleprr/redditube

A video generator from Reddit posts and comments

HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader

A series of 3 programs that will automatically receive scripts from Reddit, allow the user to...

haolinwang819-boop/ai-video-generation-workflow

AI video generation workflow with script, slides, TTS, subtitles, and FFmpeg rendering.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights