jdh-algo/JoyHallo

JoyHallo: Digital human model for Mandarin

45
/ 100
Emerging

Implements audio-driven video synthesis with a semi-decoupled architecture that decouples lip, expression, and pose features to improve efficiency and cross-lingual capability. Uses Chinese wav2vec2 for Mandarin audio embedding and integrates Stable Diffusion with motion modules for frame generation, achieving 14.3% faster inference than the base Hallo model. Supports both Mandarin and English video generation while maintaining strong cross-language performance on the proprietary 29-hour jdh-Hallo dataset.

522 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

522

Forks

51

Language

Python

License

MIT

Last pushed

Sep 21, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/jdh-algo/JoyHallo"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.