sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Organizes 40+ NLP tasks across 15+ languages with curated benchmark datasets and leaderboard results, enabling researchers to quickly identify evaluation metrics and state-of-the-art performance baselines. Each task file documents the evaluation setting, annotated examples, and links to official or community implementations, with results ranked by performance score. The repository uses community-driven contribution guidelines prioritizing published results and multi-paper dataset validation, accessible via companion websites nlpprogress.com and nlpsota.com.
22,981 stars. No commits in the last 6 months.
Stars
22,981
Forks
3,612
Language
Python
License
MIT
Category
Last pushed
Jul 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sebastianruder/NLP-progress"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
nltk/nltk
NLTK Source
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many...
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
aphp/edsnlp
Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for...