kevinschaich/billboard
🎤 Lyrics/associated NLP data for Billboard's Top 100, 1950-2015.
Combines web scraping from Wikia Lyrics, MusicBrainz, and Billboard charts with NLP analysis—using NLTK's VADER for sentiment scoring, textstat for readability metrics (Flesch-Kincaid, Gunning-Fog), and custom repetition analysis. Data is aggregated into a year-indexed JSON structure (~2MB for ~5000 songs) enabling real-time client-side filtering and visualization via underscore.js, with ~80-90% lyric coverage across the dataset.
No commits in the last 6 months.
Stars
91
Forks
50
Language
JavaScript
License
MIT
Category
Last pushed
Apr 26, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kevinschaich/billboard"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
siddgood/billboard-hit-prediction
:notes: Predicting Billboard's Year-End Hot 100 Songs using audio features from Spotify and...
rileynwong/spotify-analysis
Data analysis on my monthly playlists
mohammadaminabbasi/Darkflow-NLP-Persian-Music-Recommendation-on-Lyrics
unsupervised process of identifying topics in a set of persian songs by LDA method of topic modelling
acheamponge/VERSUZ
A Hiphop v. Literature project to demonstrate using NLP that Hip-Hop is a form of literature and...
tashapiro/predicting-song-music-genre
What part of a song is better at determining it's music genre - the music (audio features) or...