murray-z/text_analysis_tools
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Leverages pre-trained word embeddings for synonym and semantic similarity tasks, requiring users to supply their own embeddings. Provides a modular API structure with included test datasets and executable examples for rapid prototyping. Combines classical NLP techniques (keyword extraction, text clustering) with neural approaches for tasks like text correction and event triple extraction from Chinese text.
733 stars. No commits in the last 6 months.
Stars
733
Forks
133
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/murray-z/text_analysis_tools"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chatopera/Synonyms
:herb: 中文近义词:聊天机器人,智能问答工具包
isaacus-dev/semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically...
goodmami/wn
A modern, interlingual wordnet interface for Python
CUNY-CL/wikipron
Massively multilingual pronunciation mining
UCREL/pymusas
Python Multilingual Ucrel Semantic Analysis System