NLP Algorithm Implementations ML Frameworks

Reference implementations and educational projects for core NLP techniques (tokenization, sequence modeling, named entity recognition, relation extraction, coreference resolution). Does NOT include end-to-end applications, LLM frameworks, or general machine learning tools.

There are 31 nlp algorithm implementations frameworks tracked. 3 score above 50 (established tier). The highest-rated is jma127/pyltr at 65/100 with 464 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=nlp-algorithm-implementations&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 jma127/pyltr

Python learning to rank (LTR) toolkit

65
Established
2 evllabs/JGAAP

The Java Graphical Authorship Attribution Program

51
Established
3 tensorflow/ranking

Learning to Rank in TensorFlow

50
Established
4 Bibliome/alvisnlp

ALvisNLP corpus processing engine

37
Emerging
5 rosette-api/rosette-elasticsearch-plugin

Document Enrichment plugin for Elasticsearch

34
Emerging
6 bst-mug/n2c2

Support code for participation at the 2018 n2c2 Shared-Task Track 1

28
Experimental
7 ethanhezhao/MetaLDA

The code for MetaLDA in ICDM 2017

28
Experimental
8 lejon/PartiallyCollapsedLDA

Implementations of various fast parallelized samplers for LDA, including...

27
Experimental
9 Gu-Youngfeng/EfficiencyMiner

This project explores the efficiency of CraTer in terms of time consumption...

24
Experimental
10 ma2th/ecst

Embedded Classification Software Toolbox

23
Experimental
11 AltA-Advisory/ReutersParser

Java parser for the "Reuters-21578, Distribution 1.0" Text Categorization data set.

23
Experimental
12 digital-thinking/text-classification

Using the smile framework for text classification and additional preprocessing

20
Experimental
13 EhsanMashhadi/ISSRE2023-BugSeverityPrediction

Code of our paper "Method-Level Bug Severity Prediction using Source Code...

19
Experimental
14 tantikristanti/NERD_KID

NERD and wiKIData (NERD KID) is a machine learning application for...

19
Experimental
15 tobiassteidle/NLP-Text-Classification-Webservice

Provide a web service that uses the trained model from the project NLP Text...

18
Experimental
16 huangyueranbbc/Hadoop_MR_TFIDF

TF-IDF Hadoop

17
Experimental
17 vijay033/Deeplearning4j

Word2Vec Porting On Android Using DeepLearning4j ( On Device Machine Learning )

17
Experimental
18 armdev/spring-stanford-corenlp

spring-stanford-corenlp simple app

16
Experimental
19 doried-a-a/TDT-Java

Topic detection and tracking, by Tishreen University team, Doried - Alaa and Reham

16
Experimental
20 CemalUnal/nlp-assignments

All experiments that I implemented while I'm taking the NLP course.

15
Experimental
21 rmbelmont/Interactive-View-Interactive-Ontology-Matching-with-Deep-Learningg.-

Computer program in Java for Matching Cybersecurity Ontologies Using K-Means...

14
Experimental
22 mouryallagadda/DocSimilaritySearch

This is a java application to find the nearest neighboring document using...

14
Experimental
23 guangylegend/MEMM

MEMM for NLP

14
Experimental
24 fermatjen/snappy

NLP and Ngram-based text classification system

13
Experimental
25 dbracewell/gengoai

Mono Repository for GengoAI projects

12
Experimental
26 cjmcgraw/ml-rescore-es-plugin

A plugin for dynamically rescorcing documents in real time, using grpc...

12
Experimental
27 lucabtz/DeepBlue

Java library to interact with Azure REST API

11
Experimental
28 dcavar/J-JSON-NLP

Java JSON-NLP Maven module and validator

11
Experimental
29 hterhors/SemanticMachineReading

Software Tool for Semantic Machine Reading based on Conditional Random Fields

10
Experimental
30 azzuwan/Jasonium

Simple desktop based editor for Rasa NLU / MITIE training model (JSON)

10
Experimental
31 matthewdargan/Semantic-Similarity

A command-line tool that finds the semantic usage of words from text

10
Experimental