sian0x0/Roud-Song-Clusters
Lyrics clustering
This project helps folk song researchers and archivists organize vast collections of English-language folk song lyrics. You input various song lyrics, and it groups together different versions of the same song based on lyrical similarities. The output is clusters of lyrics, each representing a distinct folk song, identified by a Roud Folk Song Index number.
No commits in the last 6 months.
Use this if you are a folk song archivist or researcher dealing with large, unindexed collections of English-language folk song lyrics and need an automated way to identify and group different versions of the same song.
Not ideal if you are working with non-English folk songs or require extremely precise, human-expert-level judgment for very nuanced lyrical distinctions without any machine assistance.
Stars
4
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/sian0x0/Roud-Song-Clusters"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TorchDR/TorchDR
TorchDR - PyTorch Dimensionality Reduction
derrickburns/generalized-kmeans-clustering
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL,...
abhilash1910/ClusterTransformer
Topic clustering library built on Transformer embeddings and cosine similarity...
md-experiments/picture_text
Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)
mainlp/semantic_components
Finding semantic components in your neural representations.