yongzhuo/nlp_xiaojiang

自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，bert+bilstm+crf），数据增强（text augment, data enhance），同义句同义词生成，句子主干提取（mainpart），中文汉语短文本相似度，文本特征工程，keras-http-service调用

/ 100

Established

Implements multiple data augmentation strategies (back-translation, EDA, HMM-Markov, syntax-based generation) and hybrid retrieval chatbots combining fuzzy matching with BERT/XLNet sentence encodings for semantic similarity. Core NLP tasks use Keras-based architectures including BERT+BiLSTM+CRF for NER and multi-headed CNN/RNN variants (TextCNN, R-CNN, AVT-CNN) for text classification, leveraging pre-trained Chinese models from Google and Harbin Institute of Technology.

1,539 stars and 24 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 13 / 25

Maturity 25 / 25

Community 25 / 25

How are scores calculated?

Stars

1,539

Forks

391

Language

Python

License

MIT

Related tools

GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文，自定义了N种模型，支持不同的场景和任务)

textgain/grasp

Essential NLP & ML, short & fast pure Python code

boat-group/fancy-nlp

NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your...

changwookjun/nlp-paper

NLP Paper

NLP-kr/tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Explore NLP Tools

All categories Trending NLP directory Insights