mims-harvard/ClinVec

ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI in Medicine

43
/ 100
Emerging

Generates 153,166 clinical code embeddings by training a graph transformer (HGT) on ClinGraph, a unified knowledge graph integrating 8 EHR vocabularies (ICD-9/10, CPT, RxNorm, LOINC, ATC, PheCode, SNOMED-CT, UMLS). Embeddings are available in multiple formats (DGL, PyTorch Geometric, NetworkX) across vocabulary-specific CSV files, enabling semantic retrieval and downstream tasks like risk scoring and medical QA without patient-level data dependencies.

No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 9 / 25
Community 15 / 25

How are scores calculated?

Stars

83

Forks

12

Language

Jupyter Notebook

License

MIT

Last pushed

Jan 22, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/mims-harvard/ClinVec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.