VirtualRoyalty/spark-nlp-project
Micro project on big data technologies via spark
This project helps data professionals process and understand large volumes of Russian text. It takes raw Russian text data and can classify it into categories, identify named entities like people or organizations, and link them to known information. This is useful for data scientists, NLP engineers, or researchers working with Russian language data at scale.
No commits in the last 6 months.
Use this if you need to perform advanced natural language processing tasks like text classification or entity recognition on extensive datasets of Russian language text.
Not ideal if your primary need is for languages other than Russian, or if you only have small amounts of text data that don't require big data processing.
Stars
4
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 16, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/VirtualRoyalty/spark-nlp-project"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment...
jonathandunn/text_analytics
Basic text analytics and natural language processing in Python
IBM/watson-document-co-relation
Correlate text content across documents using Watson NLU, Python NLTK and Watson Studio.
Clarifai/clarifai-pyspark
Interfaces for Unstructured data and ML pipelines with Databricks and Clarifai
umer7/Applied-Text-Mining-in-Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan