baidu/Familia
A Toolkit for Industrial Topic Modeling
This toolkit helps you understand the main subjects and themes within large collections of text, like news articles, novels, or web pages. You provide various types of text data, and it outputs insights into their core topics, how similar different texts are, and key phrases that define a document. This is useful for data analysts, content strategists, and anyone who needs to quickly make sense of vast amounts of written information.
2,645 stars. No commits in the last 6 months.
Use this if you need to automatically categorize documents, group similar texts, extract keywords, or build recommendation systems based on text content.
Not ideal if you require extremely precise linguistic analysis or fine-grained sentiment extraction beyond general topic identification.
Stars
2,645
Forks
587
Language
C++
License
BSD-3-Clause
Category
Last pushed
Jul 01, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/baidu/Familia"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models...
i-dot-ai/themefinder
A topic modelling Python package for analysing one-to-many question-answer data.
bobxwu/TopMost
A Topic Modeling System Toolkit (ACL 2024 Demo)
andifunke/topic-labeling
The project proposes a framework to apply topic models on a text-corpus and eventually topic...
bab2min/tomotopy
Python package of Tomoto, the Topic Modeling Tool