mims-harvard/ClinVec
ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI in Medicine
Generates 153,166 clinical code embeddings by training a graph transformer (HGT) on ClinGraph, a unified knowledge graph integrating 8 EHR vocabularies (ICD-9/10, CPT, RxNorm, LOINC, ATC, PheCode, SNOMED-CT, UMLS). Embeddings are available in multiple formats (DGL, PyTorch Geometric, NetworkX) across vocabulary-specific CSV files, enabling semantic retrieval and downstream tasks like risk scoring and medical QA without patient-level data dependencies.
Stars
83
Forks
12
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/mims-harvard/ClinVec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
NYUMedML/DeepEHR
Chronic Disease Prediction Using Medical Notes
mims-harvard/SHEPHERD
SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases
biocentral/biocentral_server
Compute functionality for biocentral.
biocentral/biocentral_api
Programmatic access to the biocentral ecosystem.
nomic-ai/contrastors
Train Models Contrastively in Pytorch