CLIP Multimodal Search NLP Tools
Tools for searching and retrieving images, videos, or multimodal content using CLIP-based vision-language models and text/image queries. Does NOT include general image captioning, visual question answering without search functionality, or non-CLIP multimodal architectures.
There are 19 clip multimodal search tools tracked. 1 score above 50 (established tier). The highest-rated is ClipsAI/clipsai at 59/100 with 455 stars and 995 monthly downloads.
Get all 19 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=clip-multimodal-search&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
ClipsAI/clipsai
Clips AI is an open-source Python library that automatically converts long... |
|
Established |
| 2 |
ai-forever/ru-clip
CLIP implementation for Russian language |
|
Emerging |
| 3 |
patrickjohncyh/fashion-clip
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain. |
|
Emerging |
| 4 |
Lednik7/CLIP-ONNX
It is a simple library to speed up CLIP inference up to 3x (K80 GPU) |
|
Emerging |
| 5 |
suinleelab/CellCLIP
[NeurIPS 2025] CellCLIP – Learning Perturbation Effects in Cell Painting via... |
|
Emerging |
| 6 |
cene555/ruCLIP-SB
RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a... |
|
Experimental |
| 7 |
GuyARoss/CLIP-video-search
demo natural language video db using CLIP |
|
Experimental |
| 8 |
emerisly/EDIS
Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023) |
|
Experimental |
| 9 |
DeliriumV01D/RuCLIP
Unofficial c++ LibTorch implementation of RuCLIP (Sber AI) |
|
Experimental |
| 10 |
maliha-usui/cross-lingual-clip-memes
Cross-lingual evaluation of CLIP on Japanese vs English memes — revealing... |
|
Experimental |
| 11 |
DevMilk/ImageOps
Reverse Image Based Entity Search Tool |
|
Experimental |
| 12 |
DARK-art108/Image-Search-Using-CLIP-VIT
A powerful image search using CLIP (Contrastive Language-Image Pre-Training)... |
|
Experimental |
| 13 |
sugarandgugu/Text2Image-Retrieval
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统 |
|
Experimental |
| 14 |
VietHoang1512/CVPR-track-5
Contrastive learning for natural language-based vehicle retrieval (AICC | CVPR 2021) |
|
Experimental |
| 15 |
krishnaura45/ImageQuant
🤖Combining 🔠NLP and 🧠 Deep Learning for 📦Image-Based Entity Extraction |
|
Experimental |
| 16 |
gangula-karthik/AICU-BIKE-SEARCH
Find Your Stolen Bike Lah! With AICU, We Kena Spot Your Bicycle on Carousell... |
|
Experimental |
| 17 |
saadkh1/clip_dual_encoder
Visual and Vision-Language Representation Pre-Training with Contrastive Learning |
|
Experimental |
| 18 |
KernelA/clip-text-search
Search images by text input with CLIP |
|
Experimental |
| 19 |
pushkarydv/memory-in-images
Search within your images using natural language. |
|
Experimental |