Topic Modeling Frameworks

Tools and platforms for extracting, clustering, and analyzing topics from text documents and data streams using techniques like LDA, TF-IDF, and entity-based clustering. Does NOT include general NLP libraries, text preprocessing utilities, or document normalization without topic extraction focus.

There are 20 topic modeling frameworks tracked. 1 score above 70 (verified tier). The highest-rated is spring-projects/spring-ai at 76/100 with 8,149 stars. 1 of the top 10 are actively maintained.

Get all 20 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=topic-modeling-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 spring-projects/spring-ai

An Application Framework for AI Engineering

76
Verified
2 primaryobjects/lda

LDA topic modeling for node.js

62
Established
3 qminer/qminer

Analytic platform for real-time large-scale streams containing structured...

61
Established
4 lfoppiano/grobid-quantities

GROBID extension for identifying and normalizing physical quantities.

47
Emerging
5 rosette-api/nodejs

Babel Street Analytics Client Library for Node.js

40
Emerging
6 metehan777/entity-topic-cluster

100% client-side entity-based topic clustering tool. ML models run in your...

36
Emerging
7 EhsanMashhadi/MSR2021-ProgramRepair

Code of our paper Applying CodeBERT for Automated Program Repair of Java...

35
Emerging
8 xuludev/System

Internet hot topic detection and tracking system

27
Experimental
9 teavuihuang/edge-ml-glove-nlp

This high-speed, low-footprint NLP (Natural Language Processing) EDGE-ML...

27
Experimental
10 pratikpc/Reactive-Miner

Reactive Miner is a Data Mining PWA tool which uses React

26
Experimental
11 karimould/typescript-text-classification

Text classification using a neuronal network and TypeScript.

26
Experimental
12 ChristianMurphy/gutenberg-book-normalize

Normalize project Gutenberg books to a format easier for statistical models...

24
Experimental
13 numbworks/NW.NGramTextClassification

NW.NGramTextClassification is a library to perform text classification tasks...

22
Experimental
14 divyamohan1993/nlu-bot-trainer

Enterprise-grade NLU bot trainer with 5-classifier stacking ensemble (171K...

22
Experimental
15 oEmanuelFirmino/vector-search-with-cos-similarity

A system that processes document similarity searches using a backend...

21
Experimental
16 npatta01/superuser-topic-modeling

SuperUser forum topic modeling

16
Experimental
17 MoritzGoeckel/TextClassifier

🔎📄 Text classifier based on vocabulary analysis

15
Experimental
18 Infominer-JSI/infominer-ui

The Web UI of the Infominer tool

12
Experimental
19 Infominer-JSI/infominer-js

The main component of the (semi-)automatic data exploration and topic...

11
Experimental
20 FrederickRoman/fasttextAPI

Unofficial minified fastetext API. Use it to run NLP DL models that require...

11
Experimental