uhh-lt/sensegram

Making sense embedding out of word embeddings using graph-based word sense induction

40
/ 100
Emerging

Induces polysemous word senses by clustering ego-networks extracted from word embeddings, then generates sense-specific vectors disambiguated across contexts. Works with pretrained embeddings (word2vec format) or raw text corpora via gensim, using FAISS for similarity graphs and Chinese Whispers for clustering. Outputs sense inventories with probability distributions and supports optional hypernymy labeling for proto-conceptualization resources.

213 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 22 / 25

How are scores calculated?

Stars

213

Forks

52

Language

Python

License

Last pushed

May 17, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/uhh-lt/sensegram"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.