johnGettings/LIHQ
Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)
Combines multiple open-source models (FOMM for head motion, Wav2Lip for lip-sync, GFPGAN for upscaling) in a pipeline that transfers facial expressions from reference video to a static image while syncing generated or uploaded audio. Designed exclusively for Google Colab to leverage free GPU, with optional frame interpolation and background matting for final quality enhancement.
262 stars. No commits in the last 6 months.
Stars
262
Forks
40
Language
Python
License
—
Category
Last pushed
Jul 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/johnGettings/LIHQ"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
FelippeChemello/podcast-maker
Fully automated video maker using motion graphics and text-to-speech synthesis to turn...
ManimCommunity/manim-voiceover
Manim plugin for all things voiceover
charleprr/redditube
A video generator from Reddit posts and comments
HA6Bots/Automatic-Youtube-Reddit-Text-To-Speech-Video-Generator-and-Uploader
A series of 3 programs that will automatically receive scripts from Reddit, allow the user to...
haolinwang819-boop/ai-video-generation-workflow
AI video generation workflow with script, slides, TTS, subtitles, and FFmpeg rendering.