Text Authorship Analysis ML Frameworks
Tools and models for analyzing written text to identify authorship, detect stylistic patterns, model topics, and classify writing characteristics. Includes LDA, topic modeling, stylometric analysis, and authorship attribution. Does NOT include general NLP, text classification for non-authorship tasks, or content moderation.
There are 22 text authorship analysis frameworks tracked. 1 score above 70 (verified tier). The highest-rated is bigartm/bigartm at 74/100 with 672 stars and 445 monthly downloads.
Get all 22 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-authorship-analysis&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
bigartm/bigartm
Fast topic modeling platform |
|
Verified |
| 2 |
piskvorky/gensim
Topic Modelling for Humans |
|
Established |
| 3 |
vi3k6i5/GuidedLDA
semi supervised guided topic model with custom guidedLDA |
|
Established |
| 4 |
gregversteeg/corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count... |
|
Emerging |
| 5 |
microsoft/knowledge-extraction-recipes-forms
Knowledge Extraction For Forms Accelerators & Examples |
|
Emerging |
| 6 |
centre-for-humanities-computing/tweetopic
Blazing fast topic modelling for short texts. |
|
Emerging |
| 7 |
google-marketing-solutions/ml_toast
Cluster multilingual search terms captured from different time windows into... |
|
Emerging |
| 8 |
taciano-perez/story-inspector
Tool for analyzing book structure using NLP techniques. Helps seeing the... |
|
Experimental |
| 9 |
dayyass/latent-semantic-analysis
Pipeline for training LSA models using Scikit-Learn. |
|
Experimental |
| 10 |
A-safarji/NLP-topic-modeling-project
Topic Modeling on subreddit (NLP). In order to work on NLP topic modeling,... |
|
Experimental |
| 11 |
Develop-Packt/Topic-Modeling-and-Theme-Extraction
In this module you will learn how to analyze topic modeling output from... |
|
Experimental |
| 12 |
Su1ph3r/seshat
Stylometric Authorship Attribution & Psychological Profiling Tool |
|
Experimental |
| 13 |
purijs/fasttfidf
High-performance TF-IDF vectorization for large-scale text datasets that... |
|
Experimental |
| 14 |
nigosto/authorship-recognition
Analysis and comparison of different machine learning models for authorship... |
|
Experimental |
| 15 |
arjo129/LangCluster
A visuallization for cognates in various languages and how they spread |
|
Experimental |
| 16 |
degenNovice/corpus-tfidf-analyzer
A Python tool for text analysis using TF-IDF, lemmatization, stopword... |
|
Experimental |
| 17 |
anildervis/codexa-code-authorship
Code authorship attribution |
|
Experimental |
| 18 |
GabrielePisciotta/NLP-Authorship-Verification-Case-Study
Natural Language Processing project covering the task of Authorship Verification |
|
Experimental |
| 19 |
alejandrejames/project-thesis
A topic modelling toolkit that can collect, pre-proccess, generate topic... |
|
Experimental |
| 20 |
Sajjad-Shahali/Text_Authorship_Detection
6-class text authorship detection pipeline for human and LLM-generated text... |
|
Experimental |
| 21 |
camara94/Analyse_semantique_latente
Cet article passe en revue l'analyse sémantique latente (LSA), une théorie... |
|
Experimental |
| 22 |
Ahmadhammam03/topic-modeling-lda-nmf
Comprehensive topic modeling with LDA and NMF algorithms for discovering... |
|
Experimental |