Image Caption Generation ML Frameworks
Applications and models for automatically generating textual descriptions of images using deep learning architectures (CNNs, RNNs, Transformers). Does NOT include sketch segmentation, image-to-audio conversion, or general object detection without caption output.
There are 71 image caption generation frameworks tracked. The highest-rated is tonybeltramelli/pix2code at 47/100 with 12,051 stars.
Get all 71 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=image-caption-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
tonybeltramelli/pix2code
pix2code: Generating Code from a Graphical User Interface Screenshot |
|
Emerging |
| 2 |
emilwallner/Screenshot-to-code
A neural network that transforms a design mock-up into a static website. |
|
Emerging |
| 3 |
ashnkumar/sketch-code
Keras model to generate HTML code from hand-drawn website mockups.... |
|
Emerging |
| 4 |
bobbens/sketch_simplification
Models and code related to sketch simplification of rough sketches. |
|
Emerging |
| 5 |
jchenghu/ExpansionNet_v2
Implementation code of the work "Exploiting Multiple Sequence Lengths in... |
|
Emerging |
| 6 |
MiteshPuthran/Image-Caption-Generator
The LSTM model generates captions for the input images after extracting... |
|
Emerging |
| 7 |
shagunsodhani/Image-Caption-Generator
A simple implementation of neural image caption generator |
|
Emerging |
| 8 |
Y-debug-sys/UCL-sketch
[IEEE TKDE] Official Implementation of "Learning-based Sketches for... |
|
Emerging |
| 9 |
val-iisc/sketch-parse
Code, demos and data for SketchParse (a neural network for sketch... |
|
Emerging |
| 10 |
aimagelab/camel
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022 |
|
Emerging |
| 11 |
hlamba28/Automatic-Image-Captioning
Generating Captions for images using Deep Learning |
|
Emerging |
| 12 |
mzbac/sketch2code
a simple model that implemented sketch to code |
|
Emerging |
| 13 |
riad5089/Image_Caption_Generator
This is a Deep Learning model which uses Computer Vision and NLP to generate... |
|
Experimental |
| 14 |
nasib-ullah/video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from... |
|
Experimental |
| 15 |
dhruvik-patel/image-description
This repo represents our machine learning project Image Description which is... |
|
Experimental |
| 16 |
llegomark/openai-gpt4-vision
This repository contains a simple image captioning app that utilizes... |
|
Experimental |
| 17 |
Dantekk/Image-Captioning
Image Captioning using CNN and Transformer. |
|
Experimental |
| 18 |
GvHemanth/Image-to-Speech-Generation_Encoder-Attention-Decoder
This project aims to assist visually impaired individuals by providing a... |
|
Experimental |
| 19 |
ArnabKumarRoy02/Image-Caption-Generator
This project is a part of the semester long research-based Mini Project... |
|
Experimental |
| 20 |
ChaitanyaC22/Udacity-CVND-Project2-Automated-Image-Captioning
This project aims at training a CNN-RNN model to predict captions for a... |
|
Experimental |
| 21 |
iangitonga/capgen
A command-line AI captions generator for audio and videos. |
|
Experimental |
| 22 |
ammarlodhi255/image-captioning-system-to-assist-the-blind
An image captioning system that is able to predict and speak out a caption... |
|
Experimental |
| 23 |
nextml/caption-contest-data
Data from the caption contest. |
|
Experimental |
| 24 |
Aryavir07/Image-Captioning-Using-CNN-and-LSTM
Generating Captions for images using CNN & LSTM on Flickr8K dataset.The... |
|
Experimental |
| 25 |
nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning
A deep learning model that generates descriptions of an image. |
|
Experimental |
| 26 |
ArchAngelAries/TagScribeR
A tool to streamline AI image captioning |
|
Experimental |
| 27 |
qyzdao/Sketch-Based-Deep-Learning
A resource repository for sketch based deep learning papers |
|
Experimental |
| 28 |
snehalathaArakkonam/Img_CapGenerator
Generates captions for images using a CNN encoder and LSTM decoder trained... |
|
Experimental |
| 29 |
Aryan0419/Image-Captioning-CNN-LSTM
🖼️ Generate descriptive captions for images using a CNN-LSTM model,... |
|
Experimental |
| 30 |
SreeDharshan-GJ/Image-Caption-Generator-using-CNN-LSTM
Deep learning project that generates natural-language captions for images... |
|
Experimental |
| 31 |
IEEE-NITK/Image_Captioning
Image Captioning is the process of generating textual description of an... |
|
Experimental |
| 32 |
iamirmasoud/image_captioning
Automatic Image Captioning using PyTorch on MS COCO dataset |
|
Experimental |
| 33 |
prasadgujar/CapSearch
An Image Caption Generation based search |
|
Experimental |
| 34 |
eddisonpham/DynaStride
Dynamic Stride Windowing with MMCoT for Multi-Scene Captioning |
|
Experimental |
| 35 |
arunadurai/Eye-For-Blind
The aim of this project is to summarize the image using deep learning techniques |
|
Experimental |
| 36 |
nico1008/paint2code
Paint2code - a lightweight tool designed to transform your hand-drawn... |
|
Experimental |
| 37 |
apiverve/image-caption-react-tutorial
AI-powered image caption generator built with React |
|
Experimental |
| 38 |
jarora04/Project_GenAI
An On-Device LLM used for captioning and text generation |
|
Experimental |
| 39 |
iFairPlay22/The-Describer
Ecosystème permettant de donner l'accès à la description d'images au... |
|
Experimental |
| 40 |
AmirhosseinHonardoust/Image-Captioning-CNN-LSTM
An end-to-end image captioning project using a CNN encoder (ResNet-50) and... |
|
Experimental |
| 41 |
0000xFFFF/ai-image-desc
describe image in English with AI |
|
Experimental |
| 42 |
Arbazkhan-cs/AI-Powered-Image-Captioning
🖼️ AI-Powered Image Captioning: Seamlessly generate captions for images... |
|
Experimental |
| 43 |
purveshmakode24/captionr
Smart AI bot to generate captions from images. |
|
Experimental |
| 44 |
chandana-galgali/Automated-Caption-Generation-using-Encoder-Decoder-Model
An end-to-end Computer Vision and NLP project capable of classifying jewelry... |
|
Experimental |
| 45 |
Rumeysakeskin/IMECA
Automatic image captioning on Android-based mobile application with CNN and... |
|
Experimental |
| 46 |
ayushman72/ImageCaptioning
An AI model to caption images |
|
Experimental |
| 47 |
itsanthonio/Vision-To-Speech
A vision to speech project |
|
Experimental |
| 48 |
amanptl/quote-it
Quote It! will be a Software-as-a-Service platform that aims to solve the... |
|
Experimental |
| 49 |
aliahmad552/image-caption-generator-using-deeplearning-nlp
This project implements an Image Caption Generator, a deep learning model... |
|
Experimental |
| 50 |
dayyass/image-captioning
My solution to the Image Captioning Final Project of the Coursera... |
|
Experimental |
| 51 |
siddhali24/VISCRIBE-project
Visual Describe - Object Detection and Caption Generation Using YOLO |
|
Experimental |
| 52 |
parask11/image-captioner
Generates suitable captions for the images of people and animals input by the user. |
|
Experimental |
| 53 |
Aditya-ha11/vlm-onnx-comparison
Vision-Language Captioning using PyTorch vs ONNX with performance benchmarking |
|
Experimental |
| 54 |
iVishalr/Scene-Describer
Video Timestamp recommendation using Transfer Learning and NLP |
|
Experimental |
| 55 |
vivek-kumar9/Labelly--Image-Labelling-app-using-CNN-and-LSTMs
Image captioning application using a CNN–LSTM encoder–decoder architecture... |
|
Experimental |
| 56 |
VaiBhaVSinGh91/ImageCaption
This repository contains an implementation of an image captioning model that... |
|
Experimental |
| 57 |
harshwalia36/Audio-Description-of-Image-for-visually-impaired-person
Mini Project for Btech which helps the visually impaired person to get the... |
|
Experimental |
| 58 |
jaychampaneri14/image-captioning
CNN-LSTM image captioning with attention mechanism |
|
Experimental |
| 59 |
paazmaya/sesoko
Prepare and caption images for using them as training data |
|
Experimental |
| 60 |
willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian... |
|
Experimental |
| 61 |
kr1shnasomani/CaptionCraft
Image Captioner using DenseNet201 and LSTM |
|
Experimental |
| 62 |
ehsan-torabi/Draw2Matrix
Draw2Matrix — Draw sketches and instantly convert them into exportable... |
|
Experimental |
| 63 |
shantanudwvd/Instagram-Caption-Generator
AI-powered Instagram caption generator using GPT-4 Vision, Spotify... |
|
Experimental |
| 64 |
omkar87796-sudo/VisionBrief-AI-Intelligent-Image-to-Text-Summary-Web-Application
An AI-powered web application that generates intelligent text summaries from... |
|
Experimental |
| 65 |
nicolafan/neural-artwork-caption-generator
Code for the paper "Exploring the Synergy Between Vision-Language... |
|
Experimental |
| 66 |
yashwanthreddytangella-alt/image-captioning-attention
Image captioning (ResNet encoder + attention LSTM) — data prep, training,... |
|
Experimental |
| 67 |
Aniket10singh16/Image2Description
A research prototype for a graph-based image captioning system using object... |
|
Experimental |
| 68 |
jatin-35asd/image-captioning-generator-app
AI-powered Image Caption Generator web application using CNN–LSTM... |
|
Experimental |
| 69 |
theSohamTUmbare/CAPbot
My discord bot that generate the captions for the images |
|
Experimental |
| 70 |
Smile040501/image_captioning
Generates textual description of any given image. Use both Natural Language... |
|
Experimental |
| 71 |
allanninal/image-captioning-app
A full-stack AI-powered image captioning app built with ReactJS (using Vite)... |
|
Experimental |