TF-IDF Text Analysis NLP Tools

Tools and implementations for TF-IDF vectorization, text classification, and document analysis using term frequency-inverse document frequency methods. Does NOT include other embedding techniques (word2vec, BERT), general machine learning frameworks, or domain-specific applications like sentiment analysis or fake news detection.

There are 21 tf-idf text analysis tools tracked. 1 score above 50 (established tier). The highest-rated is textvec/textvec at 54/100 with 197 stars and 16 monthly downloads.

Get all 21 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=tfidf-text-analysis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 textvec/textvec

Text vectorization tool to outperform TFIDF for classification tasks

54
Established
2 DigitalPebble/behemoth

Behemoth is an open source platform for large scale document analysis based...

41
Emerging
3 cooperability/BMX-bookmark-extractor

Better brain. Knowledge management tool. Stop saving things you'll never...

37
Emerging
4 nasa-jpl-memex/memex-gate

General Architecture for Text Engineering

36
Emerging
5 NISH1001/tag-generator

A simple tool to generate tags for the given text (document) using TF-IDF.

35
Emerging
6 paradite/tf-idf-keyword

:mag_right: Get keywords from a piece of text using tf-idf

27
Experimental
7 go-nlp/tfidf

tfidf provides TF-IDF functionality

25
Experimental
8 GhariebML/NLP_Text_Representation_Techniques

A comprehensive notebook demonstrating various text representation...

22
Experimental
9 pemagrg1/Magic-Of-TFIDF

TFIDF being the most basic and simple topic in NLP, there's alot that can be...

20
Experimental
10 aneessaheba/hadoop-news-analytics

Distributed word frequency analysis on 5,000 HuffPost news headlines using...

15
Experimental
11 AsadiAhmad/TF-IDF-Model

Retrieve Information from Text Documents with TF-IDF model and dimention...

15
Experimental
12 wasiahmad/mining_wikipedia

Extract mentions and category taxonomy from Wikipedia

13
Experimental
13 AneeshBose/Semantic-Query-Search-Using-Co-Occurrence-Clustering-in-Word-Graphs

Scikit-learn implementation of co-occurrence word graph based semantic query...

11
Experimental
14 adityabisht02/Research-Paper-Finder-Based-On-Similarity

A fullstack application which can be used to get the most similar research...

11
Experimental
15 krrish-v/mark_importer

Provide a category for all the imported bookmarks, makes easy to manage by...

11
Experimental
16 meanderinghuman/tfidf-news-classifier

📰 TF-IDF News Classifier: Zero-training, pure-Python tool that uses TF-IDF +...

11
Experimental
17 craigtrim/tfidf-zones

TF IDF Zones

11
Experimental
18 Hasnat-Aarif-Aslam/NLP-Foundation-Tokens-Ngrams-BoW-TF-IDF-TFIDF

Comprehensive guide to text preprocessing and vectorization techniques for...

11
Experimental
19 jiayao99/tfidf-text-classification

A tutorial on using TF-IDF for text classification

10
Experimental
20 karrarkazuya/KTP-java

A simple yet smart search in texts library. it will give you in percent how...

10
Experimental
21 wangyuhsin/tfidf-text-summarization

This repository contains Python scripts for performing TF-IDF (Term...

10
Experimental