clips/pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

50
/ 100
Established

Bundles multilingual NLP components (Brill taggers for English, Dutch, German, Spanish, French, Italian) alongside machine learning classifiers (KNN, SVM via LIBSVM/LIBLINEAR) and integrates with public web APIs (Google, Twitter, Wikipedia) for direct data acquisition. Implements a vector-space pipeline that chains HTML parsing, POS tagging, and feature extraction for end-to-end text classification workflows, with graph analysis built on NetworkX for network visualization.

8,856 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

8,856

Forks

1,570

Language

Python

License

BSD-3-Clause

Last pushed

Jun 10, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/clips/pattern"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.