jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Provides runtime flexibility with TensorRT, ONNX, and PyTorch backends for different performance-latency tradeoffs, supporting up to 800 QPS. Implements duplex streaming with non-blocking I/O and automatic load balancing across multiple CLIP models on a single GPU. Integrates natively with Jina and DocArray ecosystems via gRPC, HTTP, and WebSocket protocols, enabling rapid deployment of cross-modal neural search applications.
12,825 stars. No commits in the last 6 months. Available on PyPI.
Stars
12,825
Forks
2,077
Language
Python
License
—
Category
Last pushed
Jan 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/jina-ai/clip-as-service"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.