PhantomInsights/mexican-government-report

Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.

/ 100

Emerging

Implements a complete ETL pipeline using **PyPDF2** for PDF text extraction with character encoding correction, **spaCy's Spanish NLP model** for tokenization and named entity recognition, and outputs structured CSV datasets for downstream analysis. Performs sentiment analysis on sentences using Kaggle's Spanish lexicon, then visualizes patterns through **matplotlib/seaborn** plots and geographic distributions via **geopandas**.

476 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

476

Forks

Language

Python

License

MIT

Related tools

AutoViML/featurewiz_polars

New Polars implementation of the classic featurewiz MRMR algorithm. Created by Ram Seshadri....

gyunggyung/National-Petition

청와대 국민청원 분석으로 국민의 생각 알아보기 📈🔬

stdlib-js/datasets-sotu

State of the Union addresses by U.S. Presidents.

AndreCNF/polids

Analysis of electoral manifestos and output of it through apps.

NLP-UMUTeam/Spanish-PoliCorpus-2020

This dataset contains the code of the paper entitled Predicting Political Ideology from...

Explore NLP Tools

All categories Trending NLP directory Insights