chirindaopensource/measuring_corruption_from_text_data
End-to-End Python implementation of Muço’s (2025) corruption measurement framework. Combines NLP pipeline (regex extraction, Porter stemming, TF-IDF), PCA-based dimensionality reduction, and fixed-effects OLS to quantify institutional quality from Brazilian audit reports. Includes supervised learning robustness checks and LOO sensitivity analysis.
Stars
1
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 14, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/chirindaopensource/measuring_corruption_from_text_data"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PhantomInsights/mexican-government-report
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file...
AutoViML/featurewiz_polars
New Polars implementation of the classic featurewiz MRMR algorithm. Created by Ram Seshadri....
gyunggyung/National-Petition
청와대 국민청원 분석으로 국민의 생각 알아보기 📈🔬
stdlib-js/datasets-sotu
State of the Union addresses by U.S. Presidents.
AndreCNF/polids
Analysis of electoral manifestos and output of it through apps.