Yazdi9/Talking_Face_Avatar
Avatar Generation For Characters and Game Assets Using Deep Fakes
Combines Leonardo.ai image generation and ElevenLabs text-to-speech APIs to produce talking head videos from AI-generated portraits and synthesized audio. Uses SadTalker's audio-driven facial animation pipeline with ExpNet and PoseVAE models for expression and pose prediction, plus GFPGAN for face enhancement and Wav2Lip for lip-sync accuracy. Supports multiple generation modes (still, reference, resize) with configurable preprocessing and can process both static images and video sources via CLI or web UI.
232 stars. No commits in the last 6 months.
Stars
232
Forks
42
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Aug 18, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Yazdi9/Talking_Face_Avatar"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.