chirindaopensource/measuring_corruption_from_text_data

End-to-End Python implementation of Muço’s (2025) corruption measurement framework. Combines NLP pipeline (regex extraction, Porter stemming, TF-IDF), PCA-based dimensionality reduction, and fixed-effects OLS to quantify institutional quality from Brazilian audit reports. Includes supervised learning robustness checks and LOO sensitivity analysis.

/ 100

Experimental

No Package No Dependents

Maintenance 6 / 25

Adoption 1 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

MIT

Category

political-discourse-analysis

Last pushed

Dec 14, 2025

Commits (30d)

GitHub

Political Discourse Analysis · 30 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/chirindaopensource/measuring_corruption_from_text_data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

PhantomInsights/mexican-government-report

Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file...

AutoViML/featurewiz_polars

New Polars implementation of the classic featurewiz MRMR algorithm. Created by Ram Seshadri....

gyunggyung/National-Petition

청와대 국민청원 분석으로 국민의 생각 알아보기 📈🔬

stdlib-js/datasets-sotu

State of the Union addresses by U.S. Presidents.

AndreCNF/polids

Analysis of electoral manifestos and output of it through apps.

Explore NLP Tools

All categories Trending NLP directory Insights