baidu/Familia

A Toolkit for Industrial Topic Modeling

51
/ 100
Established

This toolkit helps you understand the main subjects and themes within large collections of text, like news articles, novels, or web pages. You provide various types of text data, and it outputs insights into their core topics, how similar different texts are, and key phrases that define a document. This is useful for data analysts, content strategists, and anyone who needs to quickly make sense of vast amounts of written information.

2,645 stars. No commits in the last 6 months.

Use this if you need to automatically categorize documents, group similar texts, extract keywords, or build recommendation systems based on text content.

Not ideal if you require extremely precise linguistic analysis or fine-grained sentiment extraction beyond general topic identification.

text-analysis content-categorization information-retrieval document-similarity knowledge-discovery
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

2,645

Forks

587

Language

C++

License

BSD-3-Clause

Last pushed

Jul 01, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/baidu/Familia"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.