Text Analysis Frameworks NLP Tools

R packages and frameworks for general-purpose textual data analysis, mining, and quantitative text processing. Does NOT include task-specific applications (sentiment analysis, NER, topic modeling), API wrappers, or language-specific toolkits.

There are 80 text analysis frameworks tools tracked. 2 score above 50 (established tier). The highest-rated is quanteda/quanteda at 64/100 with 877 stars. 1 of the top 10 are actively maintained.

Get all 80 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-analysis-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 quanteda/quanteda

An R package for the Quantitative Analysis of Textual Data

64
Established
2 juliasilge/tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

52
Established
3 massimoaria/tall

Text Analysis for aLL

46
Emerging
4 keyATM/keyATM

An R package for Keyword Assisted Topic Models

46
Emerging
5 lll-lll-lll-lll/sent-pattern

sent-pattern package categorizes English sentences into one of five basic...

45
Emerging
6 gagolews/stringi

Fast and Portable Character String Processing in R (with the Unicode ICU)

44
Emerging
7 ropensci/gutenbergr

Search, download, and process public domain texts from Project Gutenberg

43
Emerging
8 bnosac/BTM

Biterm Topic Modelling for Short Text with R

40
Emerging
9 irudnyts/openai

An R package-wrapper around OpenAI API

40
Emerging
10 Reilly-ConceptsCognitionLab/ConversationAlign

An R package for analyzing linguistic alignment between partners in...

38
Emerging
11 WZBSocialScienceCenter/tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel processing power

36
Emerging
12 bnosac/pattern.nlp

R package to perform sentiment analysis and Parts of Speech tagging for...

35
Emerging
13 harmonydata/harmonyapi

This is the source code for the Harmony project REST API

35
Emerging
14 SajjadPourali/Surnames

Surnames dispersion around the world which sorted by population

34
Emerging
15 ropensci-review-tools/pkgmatch

Find R packages matching either descriptions or other R packages

34
Emerging
16 juba/rainette

R implementation of the Reinert text clustering method

33
Emerging
17 cschwem2er/stminsights

A Shiny Application for Inspecting Structural Topic Models

33
Emerging
18 khushmeeet/potter-nlp

Data Analysis of harry potter text

32
Emerging
19 harmonydata/harmony_r

R library for Harmony. R package - open source tool using AI for psychology...

32
Emerging
20 moj-analytical-services/pq-tool

Tool to analyse past parliamentary questions with visualisation in RShiny

31
Emerging
21 storieswithsiva/Text-Mining-in-R

📷 Generates Text Analytics using Bag of Words. Upload your data and it will...

30
Emerging
22 chris31415926535/tardis

tardis: Text Analysis with Rules and Dictionaries for Inferring Sentiment

29
Experimental
23 jaeyk/tidyethnicnews

R package for turning Ethnic NewsWatch search results into tidyverse-ready dataframes

29
Experimental
24 quanteda/spacyr

R wrapper to spaCy NLP

29
Experimental
25 nhsx/stm-survey-text

Structural Topic modelling Code to gain insights from free text responses to...

28
Experimental
26 agusnieto77/TalleR

Taller sobre R para el monitoreo y análisis de la conflictividad social....

28
Experimental
27 news-r/nltk4r

Natural Language Toolkit for R

28
Experimental
28 news-r/gensimr

📝 Topic Modeling for Humans

27
Experimental
29 ropensci-archive/monkeylearn

:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text...

27
Experimental
30 Chris1221/goldi

R Package for Gene Ontology Label Discernment and Identification.

26
Experimental
31 ben-aaron188/snlp

2-day course on Statistical Natural Language Processing in R (foundational level)

26
Experimental
32 jaytimm/textpress

A rugged, practical R toolkit for web scraping, stepwise NLP, and...

25
Experimental
33 ropensci-archive/geoparser

:no_entry: ARCHIVED :no_entry:

25
Experimental
34 mbodke/Twitter-Sentiment-Analysis-using-R-shiny

Project based on text mining:

25
Experimental
35 ErikaJacobs/Harry-Potter-Text-Mining

NLP text analysis of the Harry Potter book series

24
Experimental
36 EmilHvitfeldt/R-text-data

List of textual data sources to be used for text mining in R

24
Experimental
37 ratsgo/models

NLP models

24
Experimental
38 trajceskijovan/Structural-Topic-Modeling-in-R

Structural Topic Modeling in R (published two articles on Medium). STM, LDA,...

23
Experimental
39 evanm31/twittR

R Shiny app for tweet analysis

23
Experimental
40 rivaquiroga/analisis-de-textos-r-2023

Materiales del módulo sobre análisis de texto con R del Diplomado en...

23
Experimental
41 abuchmueller/Twitmo

Collect Twitter data and create topic models with R

23
Experimental
42 drdataSpp/nlp_r

This repository contains the NLP and Text Mining R script and the generated...

22
Experimental
43 news-r/textanalysis

Text Analysis in R

22
Experimental
44 BrianWeinstein/googlenlp

An Interface to Google's Cloud Natural Language API

22
Experimental
45 news-r/decipher

📇 Simple NLP in R

22
Experimental
46 brandonleekramer/tidyorgs

A tidy package that detects and standardizes organizations in unstructured text data

22
Experimental
47 Tuanpham1994/Text-visualization-uncovering-structure-and-sentiment-analysis

School assignment for text analytics

22
Experimental
48 gesiscss/ptm

Introduction to Natural Language Processing with a special emphasis on the...

21
Experimental
49 uribo/textlintr

✅ Natural Language Linter Tools for 'R Markdown' and R Code 💯

21
Experimental
50 news-r/textblob

📜 TextBlob for R

21
Experimental
51 CityofEdmonton/text_depot

Search and analyze your text data

21
Experimental
52 cran-task-views/NaturalLanguageProcessing

CRAN Task View: Natural Language Processing

20
Experimental
53 mkearney/googleapis

R client for accessing Google Cloud Natural Language APIs

19
Experimental
54 ropenscilabs/tif

Text Interchange Formats

18
Experimental
55 leedongwei/data-scooping

Flyweight data mining with R

18
Experimental
56 PythonCoderUnicorn/HarryPotterBooks

Harry Potter books for Text Analysis

17
Experimental
57 doug-friedman/topicdoc

Topic-Specific Diagnostics for LDA and CTM Topic Models

16
Experimental
58 profservin/datalab

Serie de ejercicios para familiarizarse con R

16
Experimental
59 gagolews/stringx

Drop-in replacements for base R string functions powered by stringi

16
Experimental
60 gesiscss/polmineR-examples

A binderized environment for polmineR and the Corpus Workbench (CWB) as backend

15
Experimental
61 samuelae/associatoR

an R package to work with association data

15
Experimental
62 kevinhuads/rshiny-datascience-viz-lab

Interactive R Shiny application that illustrates core data science topics ...

15
Experimental
63 rubato103/textmining_KCI_RISS

R을 이용한 한국어 학술 논문 데이터(KCI, RISS) 대상의 형태소 분석, N그램 추출, 토픽 모델링 통합 파이프라인

15
Experimental
64 jengzang/villages-ML

Statistical and NLP analysis of 285K+ natural village toponyms in Guangdong...

14
Experimental
65 mkearney/dict

Word-Based Dictionaries for Natural Language

14
Experimental
66 corydonbaylor/getwiki

getwiki provides an R wrapper for wikipedia's API, allowing users to easily...

13
Experimental
67 stevecondylios/dictionaRy

An R interface to the English Dictionary

13
Experimental
68 news-r/stanfordnlp

🎓Stanford NLP in R

13
Experimental
69 tyiannak/pyScholar

Python Library to Analyse and Visualise Google Scholar Metadata

13
Experimental
70 systats/textlearnR

A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned...

12
Experimental
71 AdrianRasoOnGit/langstats

langstats is an R package designed to help language researchers explore the...

12
Experimental
72 pachadotdev/freedomhouse

The goal of freedom is to ease the usage of the Freedom in the World dataset...

11
Experimental
73 kivanc57/quita_mini

Quita Mini is a text analysis tool designed to calculate various linguistic...

11
Experimental
74 SORADATA/shiny-text-mining-app

Cette application shiny vous permet d'analyser vos données textuelles de...

11
Experimental
75 FRBCesab/nlpdevi

:basket: Toolbox for Natural Language Processing

11
Experimental
76 SUPSI-DEASS-CARES/multilingual_thematic_analysis

Modular R pipeline for multilingual survey analysis — translate, embed,...

11
Experimental
77 peterdalle/svensktext-rpackage

svensktext R package for the svensktext data repository

10
Experimental
78 Diverking/HappyDB

Resources generated for the analysis of the HappyDB corpus

10
Experimental
79 felixhoffmnn/gepart

GePart: German Party Classification Model. Student project during the 5th...

10
Experimental
80 retrofit-lab/ashrae-1836-rp-text-mining

Data and code for the paper "What we talk about when we talk about EEMs:...

10
Experimental