davidsbatista/Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Organizes 15+ datasets across three supervision paradigms: traditional closed-class extraction (7-53 relation types), open information extraction (unbounded relations), and distant supervision approaches. Spans biomedical, encyclopedic, and pharmaceutical domains with Portuguese and English corpora annotated for nominal and entity pair relationships. Provides standardized formats compatible with supervised NLP model training, ranging from 225 Medline abstracts to 10,717+ annotated examples.
704 stars. No commits in the last 6 months.
Stars
704
Forks
132
Language
—
License
—
Category
Last pushed
Jul 07, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidsbatista/Annotated-Semantic-Relationships-Datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
syuoni/eznlp
Easy Natural Language Processing
OpenJarbas/simple_NER
simple rule based named entity recognition
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks