A-SHOJAEI/preference-guided-image-captioning-alignment

A novel multimodal alignment system that combines image-caption contrastive learning with human preference optimization. By training a vision-language model on Conceptual Captions and then fine-tuning caption generation using UltraFeedback-style preference learning, we create captions that are not just accurate but aligned with human preferences fo

/ 100

Experimental

No Package No Dependents

Maintenance 10 / 25

Adoption 0 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

—

Forks

—

Language

Python

License

MIT

Category

image-captioning

Last pushed

Feb 21, 2026

Commits (30d)

GitHub

Image Captioning · 29 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/A-SHOJAEI/preference-guided-image-captioning-alignment"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

stevan-milovanovic/LiteRT-for-Android

Image Classification, Image Captioning and LLM inference with LiteRT

floydhub/pix2code-template

Build a neural network to code a basic a HTML and CSS website based on a picture of a design mockup.

ekkonwork/qwen3-vl-autotagger-cli

Standalone CLI for Qwen3-VL auto-tagging with optional XMP embedding.

ABX9801/Image-Caption-Generator

A Web App to generate caption for Images. VGG-16 Model is used to encode the images and...

regiellis/ecko-cli

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory,...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights