Image Caption Generation ML Frameworks

Applications and models for automatically generating textual descriptions of images using deep learning architectures (CNNs, RNNs, Transformers). Does NOT include sketch segmentation, image-to-audio conversion, or general object detection without caption output.

There are 71 image caption generation frameworks tracked. The highest-rated is tonybeltramelli/pix2code at 47/100 with 12,051 stars.

Get all 71 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=image-caption-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 tonybeltramelli/pix2code

pix2code: Generating Code from a Graphical User Interface Screenshot

47
Emerging
2 emilwallner/Screenshot-to-code

A neural network that transforms a design mock-up into a static website.

46
Emerging
3 ashnkumar/sketch-code

Keras model to generate HTML code from hand-drawn website mockups....

40
Emerging
4 bobbens/sketch_simplification

Models and code related to sketch simplification of rough sketches.

40
Emerging
5 jchenghu/ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in...

38
Emerging
6 MiteshPuthran/Image-Caption-Generator

The LSTM model generates captions for the input images after extracting...

38
Emerging
7 shagunsodhani/Image-Caption-Generator

A simple implementation of neural image caption generator

35
Emerging
8 Y-debug-sys/UCL-sketch

[IEEE TKDE] Official Implementation of "Learning-based Sketches for...

35
Emerging
9 val-iisc/sketch-parse

Code, demos and data for SketchParse (a neural network for sketch...

34
Emerging
10 aimagelab/camel

CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022

34
Emerging
11 hlamba28/Automatic-Image-Captioning

Generating Captions for images using Deep Learning

34
Emerging
12 mzbac/sketch2code

a simple model that implemented sketch to code

32
Emerging
13 riad5089/Image_Caption_Generator

This is a Deep Learning model which uses Computer Vision and NLP to generate...

28
Experimental
14 nasib-ullah/video-captioning-models-in-Pytorch

A PyTorch implementation of state of the art video captioning models from...

28
Experimental
15 dhruvik-patel/image-description

This repo represents our machine learning project Image Description which is...

28
Experimental
16 llegomark/openai-gpt4-vision

This repository contains a simple image captioning app that utilizes...

27
Experimental
17 Dantekk/Image-Captioning

Image Captioning using CNN and Transformer.

27
Experimental
18 GvHemanth/Image-to-Speech-Generation_Encoder-Attention-Decoder

This project aims to assist visually impaired individuals by providing a...

25
Experimental
19 ArnabKumarRoy02/Image-Caption-Generator

This project is a part of the semester long research-based Mini Project...

25
Experimental
20 ChaitanyaC22/Udacity-CVND-Project2-Automated-Image-Captioning

This project aims at training a CNN-RNN model to predict captions for a...

24
Experimental
21 iangitonga/capgen

A command-line AI captions generator for audio and videos.

23
Experimental
22 ammarlodhi255/image-captioning-system-to-assist-the-blind

An image captioning system that is able to predict and speak out a caption...

23
Experimental
23 nextml/caption-contest-data

Data from the caption contest.

23
Experimental
24 Aryavir07/Image-Captioning-Using-CNN-and-LSTM

Generating Captions for images using CNN & LSTM on Flickr8K dataset.The...

23
Experimental
25 nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning

A deep learning model that generates descriptions of an image.

23
Experimental
26 ArchAngelAries/TagScribeR

A tool to streamline AI image captioning

23
Experimental
27 qyzdao/Sketch-Based-Deep-Learning

A resource repository for sketch based deep learning papers

23
Experimental
28 snehalathaArakkonam/Img_CapGenerator

Generates captions for images using a CNN encoder and LSTM decoder trained...

22
Experimental
29 Aryan0419/Image-Captioning-CNN-LSTM

🖼️ Generate descriptive captions for images using a CNN-LSTM model,...

22
Experimental
30 SreeDharshan-GJ/Image-Caption-Generator-using-CNN-LSTM

Deep learning project that generates natural-language captions for images...

22
Experimental
31 IEEE-NITK/Image_Captioning

Image Captioning is the process of generating textual description of an...

22
Experimental
32 iamirmasoud/image_captioning

Automatic Image Captioning using PyTorch on MS COCO dataset

22
Experimental
33 prasadgujar/CapSearch

An Image Caption Generation based search

21
Experimental
34 eddisonpham/DynaStride

Dynamic Stride Windowing with MMCoT for Multi-Scene Captioning

21
Experimental
35 arunadurai/Eye-For-Blind

The aim of this project is to summarize the image using deep learning techniques

20
Experimental
36 nico1008/paint2code

Paint2code - a lightweight tool designed to transform your hand-drawn...

20
Experimental
37 apiverve/image-caption-react-tutorial

AI-powered image caption generator built with React

19
Experimental
38 jarora04/Project_GenAI

An On-Device LLM used for captioning and text generation

19
Experimental
39 iFairPlay22/The-Describer

Ecosystème permettant de donner l'accès à la description d'images au...

19
Experimental
40 AmirhosseinHonardoust/Image-Captioning-CNN-LSTM

An end-to-end image captioning project using a CNN encoder (ResNet-50) and...

19
Experimental
41 0000xFFFF/ai-image-desc

describe image in English with AI

18
Experimental
42 Arbazkhan-cs/AI-Powered-Image-Captioning

🖼️ AI-Powered Image Captioning: Seamlessly generate captions for images...

17
Experimental
43 purveshmakode24/captionr

Smart AI bot to generate captions from images.

17
Experimental
44 chandana-galgali/Automated-Caption-Generation-using-Encoder-Decoder-Model

An end-to-end Computer Vision and NLP project capable of classifying jewelry...

17
Experimental
45 Rumeysakeskin/IMECA

Automatic image captioning on Android-based mobile application with CNN and...

16
Experimental
46 ayushman72/ImageCaptioning

An AI model to caption images

16
Experimental
47 itsanthonio/Vision-To-Speech

A vision to speech project

16
Experimental
48 amanptl/quote-it

Quote It! will be a Software-as-a-Service platform that aims to solve the...

15
Experimental
49 aliahmad552/image-caption-generator-using-deeplearning-nlp

This project implements an Image Caption Generator, a deep learning model...

15
Experimental
50 dayyass/image-captioning

My solution to the Image Captioning Final Project of the Coursera...

15
Experimental
51 siddhali24/VISCRIBE-project

Visual Describe - Object Detection and Caption Generation Using YOLO

15
Experimental
52 parask11/image-captioner

Generates suitable captions for the images of people and animals input by the user.

15
Experimental
53 Aditya-ha11/vlm-onnx-comparison

Vision-Language Captioning using PyTorch vs ONNX with performance benchmarking

15
Experimental
54 iVishalr/Scene-Describer

Video Timestamp recommendation using Transfer Learning and NLP

15
Experimental
55 vivek-kumar9/Labelly--Image-Labelling-app-using-CNN-and-LSTMs

Image captioning application using a CNN–LSTM encoder–decoder architecture...

14
Experimental
56 VaiBhaVSinGh91/ImageCaption

This repository contains an implementation of an image captioning model that...

14
Experimental
57 harshwalia36/Audio-Description-of-Image-for-visually-impaired-person

Mini Project for Btech which helps the visually impaired person to get the...

14
Experimental
58 jaychampaneri14/image-captioning

CNN-LSTM image captioning with attention mechanism

14
Experimental
59 paazmaya/sesoko

Prepare and caption images for using them as training data

14
Experimental
60 willyfh/msvd-indonesian

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian...

13
Experimental
61 kr1shnasomani/CaptionCraft

Image Captioner using DenseNet201 and LSTM

13
Experimental
62 ehsan-torabi/Draw2Matrix

Draw2Matrix — Draw sketches and instantly convert them into exportable...

12
Experimental
63 shantanudwvd/Instagram-Caption-Generator

AI-powered Instagram caption generator using GPT-4 Vision, Spotify...

12
Experimental
64 omkar87796-sudo/VisionBrief-AI-Intelligent-Image-to-Text-Summary-Web-Application

An AI-powered web application that generates intelligent text summaries from...

12
Experimental
65 nicolafan/neural-artwork-caption-generator

Code for the paper "Exploring the Synergy Between Vision-Language...

12
Experimental
66 yashwanthreddytangella-alt/image-captioning-attention

Image captioning (ResNet encoder + attention LSTM) — data prep, training,...

11
Experimental
67 Aniket10singh16/Image2Description

A research prototype for a graph-based image captioning system using object...

11
Experimental
68 jatin-35asd/image-captioning-generator-app

AI-powered Image Caption Generator web application using CNN–LSTM...

11
Experimental
69 theSohamTUmbare/CAPbot

My discord bot that generate the captions for the images

11
Experimental
70 Smile040501/image_captioning

Generates textual description of any given image. Use both Natural Language...

10
Experimental
71 allanninal/image-captioning-app

A full-stack AI-powered image captioning app built with ReactJS (using Vite)...

10
Experimental