Multimodal Search Engines ML Frameworks

Tools and applications for searching across image and text modalities using vision-language models like CLIP. Includes text-to-image search, image-to-image search, and video content search. Does NOT include general recommendation systems, dataset creation/filtering tools, or single-modality search applications.

There are 40 multimodal search engines frameworks tracked. 1 score above 70 (verified tier). The highest-rated is rom1504/img2dataset at 71/100 with 4,380 stars and 88,786 monthly downloads.

Get all 40 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=multimodal-search-engines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 rom1504/img2dataset

Easily turn large sets of image urls to an image dataset. Can download,...

71
Verified
2 devrimcavusoglu/pybboxes

Light weight toolkit for bounding boxes providing conversion between...

52
Established
3 salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

46
Emerging
4 PyRetri/PyRetri

Open source deep learning based unsupervised image retrieval toolbox built...

42
Emerging
5 Particle1904/DatasetHelpers

Dataset Helper program to automatically select, re scale and tag Datasets...

40
Emerging
6 haltakov/natural-language-image-search

Search photos on Unsplash using natural language

38
Emerging
7 haltakov/natural-language-youtube-search

Search inside YouTube videos using natural language

36
Emerging
8 jina-ai/example-multimodal-fashion-search

Input text or image, get back matching image fashion results, using Jina,...

35
Emerging
9 RAHUL-KAD/Reverse-Image-Search-Engine

With the help of this repo you can build image search algorithm on your...

32
Emerging
10 TheoCoombes/crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP,...

32
Emerging
11 lucko515/image-search-engine

End-to-end image search engine based on the Deep learning techniques.

31
Emerging
12 masesk/process-google-dataset

Process Google Dataset is a tool to download and process images for neural...

31
Emerging
13 bwconrad/video-content-search

Search the content of a video with a text or image query

30
Emerging
14 huggingface/OBELICS

Code used for the creation of OBELICS, an open, massive and curated...

29
Experimental
15 meanderinghuman/OpenLens

Open-source visual search framework inspired by Google Lens — benchmarked...

29
Experimental
16 zabir-nabil/bangla-image-search

A dead-simple image search / retrieval and image-text matching system for...

29
Experimental
17 TAU-VAILab/Vox-E

This repo contains the python code as well as the webpage html files for the...

28
Experimental
18 Zeeshier/VistAI

VistAI is an AI-powered visual search for e-commerce, enabling users to...

26
Experimental
19 sayannath/Identical-Image-Retrieval

Identical-Image-Retrieval using Deep Learning

25
Experimental
20 Sagykri/NOVA

The official repository for NOVA, a deep learning framework designed for...

24
Experimental
21 thatgeeman/pybx

A simple python module to generate anchor (aka default/prior) boxes for...

24
Experimental
22 snehilhbtu/vectalab

📊 Evaluate image quality and performance with Vectalab's vectorization tools...

23
Experimental
23 masa-57/PIC

Hierarchical image clustering API for product catalog images. Two-level...

23
Experimental
24 Subhasri-Babu/AI-Scene-Safety-Analyzer-Project

AI-powered image safety analyzer using BLIP + LLaMA 3.3 via Groq API

22
Experimental
25 woctezuma/steam-image-search

Search for images on Steam using natural language queries.

21
Experimental
26 Rishabh1925/scene-localization-system

Powerful CLIP-based computer vision system for natural language-driven...

20
Experimental
27 Ivan-Zhou/image-search

Simple Image Search powered by Multimodal Foundation Models (OpenAI Clip and...

20
Experimental
28 O-S-O-K/insight_ai_app

Explainable AI image classification with Grad-CAM visualizations, BLIP...

20
Experimental
29 santoshlite/ByteDetective

The easiest way to search for images on your desktop 🔎

17
Experimental
30 ItzCrazyKns/Dataset-Converter

A Python script for converting URL-based datasets into image datasets.

16
Experimental
31 rizkysaputradev/Vision-Fusion-Real-Time

a real-time retrieval multimodial AI based demo that allows a visual input...

16
Experimental
32 CN-Scars/picture_sherlock

A local image search tool based on pre-trained deep learning models

16
Experimental
33 kyegomez/VisionDatasets

Open source scripts to create large scale datasets with rich detail for...

14
Experimental
34 koushikvikram/multimodal-image-retrieval

📝🔍🖼️ A deep learning application for retrieving images by searching with text.

14
Experimental
35 NavdeepSinghNegi999/DeepVisionIntelligence

🧠 DeepVisionIntelligence — An end-to-end multimodal AI system that...

14
Experimental
36 ajaysawandkar05/spare-part-recognition

Spare part recognition system using CLIP + DINOv2 with hybrid re-ranking...

14
Experimental
37 TunggTungg/image_retrieval

An image retrieval system that utilizes deep learning ResNet for feature...

12
Experimental
38 kaeldrin-gh/image-similarity-search

Image similarity search system using deep learning embeddings and FAISS indexing

11
Experimental
39 RishabThapliyal/Video-Scene-Classification-System

AI-powered video analysis tool with natural language search inside video...

11
Experimental
40 heydido/VisualSearchEngine

This is the methodology I worked on while developing Visual Search Engine...

10
Experimental

Comparisons in this category