md-experiments/picture_text
Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)
Embeds document corpora using SBERT, applies fastcluster-based HAC with configurable linkage methods, then converts hierarchical dendrograms into layered treemap structures with cluster summaries. Provides pluggable interfaces for custom encoders and summarization strategies, enabling users to swap SBERT or the centroid-based summary method while preserving the visualization pipeline. Renders final interactive treemaps via Plotly with tunable depth and cluster-size constraints.
No commits in the last 6 months. Available on PyPI.
Stars
30
Forks
9
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 31, 2024
Monthly downloads
17
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/md-experiments/picture_text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
TorchDR/TorchDR
TorchDR - PyTorch Dimensionality Reduction
derrickburns/generalized-kmeans-clustering
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL,...
abhilash1910/ClusterTransformer
Topic clustering library built on Transformer embeddings and cosine similarity...
nlpub/watset-java
An implementation of the Watset clustering algorithm in Java.
mainlp/semantic_components
Finding semantic components in your neural representations.