CLIP Multimodal Search NLP Tools

Tools for searching and retrieving images, videos, or multimodal content using CLIP-based vision-language models and text/image queries. Does NOT include general image captioning, visual question answering without search functionality, or non-CLIP multimodal architectures.

There are 19 clip multimodal search tools tracked. 1 score above 50 (established tier). The highest-rated is ClipsAI/clipsai at 59/100 with 455 stars and 995 monthly downloads.

Get all 19 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=clip-multimodal-search&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 ClipsAI/clipsai

Clips AI is an open-source Python library that automatically converts long...

59
Established
2 ai-forever/ru-clip

CLIP implementation for Russian language

40
Emerging
3 patrickjohncyh/fashion-clip

FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.

36
Emerging
4 Lednik7/CLIP-ONNX

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

35
Emerging
5 suinleelab/CellCLIP

[NeurIPS 2025] CellCLIP – Learning Perturbation Effects in Cell Painting via...

31
Emerging
6 cene555/ruCLIP-SB

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a...

28
Experimental
7 GuyARoss/CLIP-video-search

demo natural language video db using CLIP

24
Experimental
8 emerisly/EDIS

Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)

20
Experimental
9 DeliriumV01D/RuCLIP

Unofficial c++ LibTorch implementation of RuCLIP (Sber AI)

20
Experimental
10 maliha-usui/cross-lingual-clip-memes

Cross-lingual evaluation of CLIP on Japanese vs English memes — revealing...

19
Experimental
11 DevMilk/ImageOps

Reverse Image Based Entity Search Tool

18
Experimental
12 DARK-art108/Image-Search-Using-CLIP-VIT

A powerful image search using CLIP (Contrastive Language-Image Pre-Training)...

18
Experimental
13 sugarandgugu/Text2Image-Retrieval

计算机视觉课程设计-基于Chinese-CLIP的图文检索系统

17
Experimental
14 VietHoang1512/CVPR-track-5

Contrastive learning for natural language-based vehicle retrieval (AICC | CVPR 2021)

14
Experimental
15 krishnaura45/ImageQuant

🤖Combining 🔠NLP and 🧠 Deep Learning for 📦Image-Based Entity Extraction

13
Experimental
16 gangula-karthik/AICU-BIKE-SEARCH

Find Your Stolen Bike Lah! With AICU, We Kena Spot Your Bicycle on Carousell...

12
Experimental
17 saadkh1/clip_dual_encoder

Visual and Vision-Language Representation Pre-Training with Contrastive Learning

11
Experimental
18 KernelA/clip-text-search

Search images by text input with CLIP

10
Experimental
19 pushkarydv/memory-in-images

Search within your images using natural language.

10
Experimental