CogStack/MedCAT
Medical Concept Annotation Tool
ArchivedPerforms named entity recognition and linking on clinical text using spaCy-based NLP pipelines, extracting medical concepts and automatically mapping them to standardized biomedical ontologies (UMLS, SNOMED-CT). Includes pre-trained model packs optimized for different languages and specialties (Dutch, full UMLS with 4M+ concepts), with support for negation detection and metadata classification via MetaCAT. Integrates with EHR systems and knowledge graphs like Neo4j for downstream analysis.
526 stars. No commits in the last 6 months.
Stars
526
Forks
118
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CogStack/MedCAT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Georgetown-IR-Lab/QuickUMLS
System for Medical Concept Extraction and Linking
apache/ctakes
Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.
OHNLP/MedTagger
MedTagger is a light weight clinical NLP system built upon Apache UIMA.
CogStack/MedCATtrainer
A simple interface to inspect, improve and add concepts to biomedical NER+L -> MedCAT.
CogStack/MedCATtutorials
General tutorials for the setup and use of MedCAT.