yisaienkov/tinysets

The project aims to collect various datasets for tasks such as classification, clustering, object detection... The purpose of this datasets is quick checking models and algorithms performance.

/ 100

Emerging

No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 18 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Category

nlp-dataset-collections

Last pushed

Sep 15, 2020

Monthly downloads

Commits (30d)

GitHub PyPI

NLP Dataset Collections · 93 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yisaienkov/tinysets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

acl-org/acl-anthology

Data and software for building the ACL Anthology.

anoopkunchukuttan/indic_nlp_library

Resources and tools for Indian language Natural Language Processing

CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

SudhirGadhvi/open-vernacular-ai-kit

Clean Indian code-mixed text before it reaches your LLM.

Separius/awesome-sentence-embedding

A curated list of pretrained sentence and word embedding models

Explore NLP Tools

All categories Trending NLP directory Insights