yongzhuo/nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

63
/ 100
Established

Implements multiple data augmentation strategies (back-translation, EDA, HMM-Markov, syntax-based generation) and hybrid retrieval chatbots combining fuzzy matching with BERT/XLNet sentence encodings for semantic similarity. Core NLP tasks use Keras-based architectures including BERT+BiLSTM+CRF for NER and multi-headed CNN/RNN variants (TextCNN, R-CNN, AVT-CNN) for text classification, leveraging pre-trained Chinese models from Google and Harbin Institute of Technology.

1,539 stars and 24 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 13 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

1,539

Forks

391

Language

Python

License

MIT

Last pushed

Sep 23, 2021

Monthly downloads

24

Commits (30d)

0

Dependencies

18

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yongzhuo/nlp_xiaojiang"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.