yisaienkov/tinysets
The project aims to collect various datasets for tasks such as classification, clustering, object detection... The purpose of this datasets is quick checking models and algorithms performance.
No commits in the last 6 months. Available on PyPI.
Stars
6
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 15, 2020
Monthly downloads
18
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yisaienkov/tinysets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
acl-org/acl-anthology
Data and software for building the ACL Anthology.
anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
SudhirGadhvi/open-vernacular-ai-kit
Clean Indian code-mixed text before it reaches your LLM.
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models