Lip Reading Synthesis ML Frameworks

Tools for reading lip movements from video and generating corresponding speech or text, plus systems for syncing audio with lip movements in video. Does NOT include general speech recognition, text-to-speech without visual input, or facial recognition beyond mouth/lip analysis.

There are 9 lip reading synthesis frameworks tracked. The highest-rated is astorfi/lip-reading-deeplearning at 43/100 with 1,901 stars.

Get all 9 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=lip-reading-synthesis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 astorfi/lip-reading-deeplearning

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

43
Emerging
2 deepconvolution/LipNet

Automated Lip reading from real-time videos in tensorflow in python

33
Emerging
3 d-kavinraja/MouthMap

MouthMap is a deep learning-based lip reading system that converts silent...

29
Experimental
4 articulateinstruments/DeepLabCut-for-Speech-Production

Trained deep neural-net models for estimating articulatory keypoints from...

28
Experimental
5 ZakirCodeArchitect/Sonic-Lipsync-AI

A Google Colab-based Gradio app for generating lip-synced videos using the...

21
Experimental
6 Cl0ud-9/Lip-Sync-Video-Generator

An AI-powered pipeline that transforms text into realistic lip-synced...

19
Experimental
7 MrfoxAK/Evaluate-Lip-reading-using-Deep-Learning-Techniques.

This paper explores Silent Sound Technology, focusing on its potential to...

17
Experimental
8 BenedettoSimone/Lipnet-ITA

LipReadingITA: Keras implementation of the method described in the paper...

15
Experimental
9 Viderspace/Look2Listen

End-to-end audio-visual speech enhancement pipeline — from preprocessing to...

14
Experimental