jonathandunn/text_analytics
Basic text analytics and natural language processing in Python
Provides unified APIs for feature extraction (style n-grams, sentiment, TF-IDF content, syntactic constructions), supervised classification via SVM and MLPs, and unsupervised methods including LDA topic modeling, Word2Vec embeddings, and document clustering. Built as a pedagogical toolkit accompanying free edX courses, it combines traditional linguistic features with neural approaches in a single TextAnalytics class that handles preprocessing, phrase detection via PMI and Delta P association metrics, and corpus comparison operations.
121 stars. No commits in the last 6 months.
Stars
121
Forks
54
Language
Python
License
GPL-3.0
Category
Last pushed
Apr 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jonathandunn/text_analytics"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment...
umer7/Applied-Text-Mining-in-Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan
fingeredman/teanaps
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Clarifai/clarifai-pyspark
Interfaces for Unstructured data and ML pipelines with Databricks and Clarifai
zaratsian/Spark
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References