DragonLiu1995/video-to-audio-through-text

[NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”

/ 100

Emerging

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 6 / 25

Stars

Forks

Language

Python

License

—

Category

Last pushed

Jul 24, 2025

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/DragonLiu1995/video-to-audio-through-text"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

OpenVGLab/OmniLottie

[CVPR 2026🔥] 🧑‍🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator...

Mrkomiljon/awesome-generative-ai

Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.

NVIDIA/Maya-ACE

Maya-ACE: A Reference Client Implementation for NVIDIA ACE Audio2Face Service

jdh-algo/JoyHallo

JoyHallo: Digital human model for Mandarin

michaelzhang-ai/Speech2Video

ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"