Pzhvk/Captioneer

An in-depth comparison of three image captioning models built with PyTorch. This repository contains the full pipeline for preprocessing the Flickr30k dataset, training three distinct architectures (LSTM, LSTM w/ Attention, Transformer w/ DistilBERT), and performing a comprehensive evaluation using BLEU, METEOR, CIDEr, and SPICE.

/ 100

Experimental

No Package No Dependents

Maintenance 6 / 25

Adoption 1 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

ntrang086/image_captioning

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common...

fregu856/CS224n_project

Neural Image Captioning in TensorFlow.

vacancy/SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic...

ltguo19/VSUA-Captioning

Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019

Abdelrhman-Yasser/video-content-description

Video content description model for generating descriptions for unconstrained videos

Explore NLP Tools

All categories Trending NLP directory Insights