Multimodal Image Search Vector Databases

Tools for semantic image retrieval using multimodal embeddings (text-to-image, image-to-image, or video search). Includes CLIP-based systems, vision transformers, and cross-modal ranking. Does NOT include general image classification, object detection, or single-modality text/vector search without image integration.

There are 42 multimodal image search tools tracked. The highest-rated is soulteary/simple-image-search-engine at 47/100 with 151 stars.

Get all 42 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=multimodal-image-search&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	soulteary/simple-image-search-engine 图片搜索引擎，很简单。三步构建属于你自己的图片搜索引擎，掌握向量数据库和以图搜图、文本搜索图片。	47	Emerging	151	Python
2	shotit/shotit Shotit is a screenshot-to-video search engine tailored for TV & Film,...	40	Emerging	20	—
3	ob-labs/image-search Image search application built with the vector capabilities of OceanBase	40	Emerging	2	Python
4	sourav4243/sift-video Semantic video search system that indexes audio and visual content to enable...	37	Emerging	3	Rust
5	shotit/shotit-api The ultimate brain of Shotit, in charge of task coordination.	36	Emerging	5	JavaScript
6	KarunyaChavan/Semantixel-Semantic_Image_Retrieval Semantic Image Retrieval is a lightweight web-based platform that enables...	36	Emerging	3	Python
7	EricRollei/Semantic-Search A powerful two-stage multimodal retrieval pipeline for ComfyUI, enabling...	33	Emerging	2	Python
8	Aaryan2304/visual-search-engine An AI-powered visual search engine that finds visually similar fashion items...	29	Experimental	2	Python
9	AchrefHemissi/FoundIT-Computer-Vision-Powered-Lost-and-Found-Mobile-Application The LostFound system is designed to facilitate the recovery of lost items...	24	Experimental	1	—
10	akashAD98/Car_ai_multimodal_search A multimodal car search engine powered by LanceDB vector database that...	24	Experimental	1	Python
11	weaviate-tutorials/next-multimodal-search-demo a Weaviate multimodal search demo	23	Experimental	9	TypeScript
12	sachink1729/intelligentgallery Intelligent Image Gallery with Uploads, Deduplication, and Text-Based Search...	23	Experimental	10	Python
13	santi1602/AnyCam2Ros 📷 Transform any camera into ROS2 image topics for seamless integration with...	23	Experimental	1	Python
14	shotit/shotit-media Media broker for serving video preview for shotit	21	Experimental	2	JavaScript
15	JimmyHernandez503/oceano Sistema de reconocimiento facial con InsightFace y Qdrant - 100% confiable	20	Experimental	1	Python
16	jacobmarks/reverse-image-search-plugin Find the images in your dataset most similar to a query image from URL or...	20	Experimental	14	TypeScript
17	Abhics8/Lumina-AI AI-powered visual commerce engine with semantic fashion search using OWLv2,...	19	Experimental	—	TypeScript
18	bauerem/semantic-text2image-search This repo implements a simple terminal-based semantic image search.	17	Experimental	5	Python
19	navneet83/multimodal-mountain-peak-search Identify mountain peaks in your photos using AI—zero-shot retrieval,...	17	Experimental	2	Python
20	shotit/shotit-frontend The frontend of shotit, with full documentation.	17	Experimental	2	TypeScript
21	laxmanclo/pany PostgreSQL-native semantic search engine with multi-modal capabilities. Add...	17	Experimental	19	Python
22	redswimmer/trail-camera-search Multimodal vector search of images and videos taken from trail cameras. ...	16	Experimental	4	Jupyter Notebook
23	dschechter27875/clip_image_text_search Multimodal semantic image search using CLIP embeddings and natural language queries.	15	Experimental	1	Python
24	IlyasFardaouix/VisualIndexer Multimodal visual search engine using CLIP, OCR, and vector similarity retrieval.	15	Experimental	1	Python
25	MustafaAbbasi98/brand-video-logo-detection An application for semi-automated logo detection in brand advertisement...	15	Experimental	—	Python
26	BrandWill-ML-DS-DE/clip-faiss-product-search End-to-end vision–language search system using CLIP + FAISS (HNSW/IVF) for...	14	Experimental	—	Python
27	hareshanmuhan/semantic-search Search 1M+ images/videos with natural language — OpenAI CLIP + FAISS +...	14	Experimental	—	Python
28	ejber-ozkan/local-llm-photo-scanner A privacy-first, self-hosted photo manager powered by local LLMs (Ollama)...	14	Experimental	—	TypeScript
29	suraj95/Whatsapp-Reel-Knowledge-Base A small AI project that extract frames from an Instagram video to generate a...	14	Experimental	—	Python
30	Aniket-16-S/Semantic_Video_Search An AI powered Video Serach Engine with google's SigLIP and FAISS. It allows...	14	Experimental	3	Python
31	EsraaMadi/similarity-search-weaviate Text/Image search for similar products	13	Experimental	11	Python
32	aritro1011/QID (Query Images by Description)- A simple pipeline to convert images to...	12	Experimental	1	Python
33	Sakshi3027/semantic-video-search Production-grade semantic video search engine - search across video content...	12	Experimental	1	Python
34	oguzhantasimaz/image-similarity-search Image Similarity Search with CLIP and Upstash Vector	12	Experimental	3	Python
35	mahadev0811/Text2ImageDescription Text2ImageDescription retrieves relevant images from Pascal VOC 2012 dataset...	12	Experimental	3	Jupyter Notebook
36	GGCIRILLO/IR-Image-Classification-System From CNN Embeddings to Vector Search: A Deep Learning Pipeline for Thermal...	11	Experimental	—	Python
37	tyasemin/Data-Feature-Extraction-and-Retrieval-Pipeline Project DART. Similarity search, SAM, CLIP, and more	11	Experimental	—	Python
38	777reet/PhotoDiaries Modern web photobooth with AI-powered image similarity search. Built with...	11	Experimental	—	CSS
39	vaibhavhonakere/ClipQuest Find exact moments in uploaded videos using natural-language search + timestamps.	11	Experimental	—	JavaScript
40	ecmoce/ask-gallery Ask Gallery — Semantic photo search system powered by VLM, CLIP, and vector search	11	Experimental	—	Python
41	anantha119/Vector-Based-Image-Retrieval-System This project leverages Vision Transformers (ViT) to build a scalable image...	11	Experimental	—	Python
42	sefaburakokcu/semantic-image-search Search for images using text and images using Milvus and OpenAI-Clip.	10	Experimental	1	Python

Comparisons in this category

shotit and shotit-api (40 vs 36) shotit and shotit-media (40 vs 21) shotit-api and shotit-media (36 vs 21)