TheoCoombes/crawlingathome
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
No commits in the last 6 months.
Stars
32
Forks
7
Language
Python
License
MIT
Category
Last pushed
Mar 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/TheoCoombes/crawlingathome"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M...
devrimcavusoglu/pybboxes
Light weight toolkit for bounding boxes providing conversion between bounding box types and...
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
PyRetri/PyRetri
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Particle1904/DatasetHelpers
Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and...