NLP Algorithm Implementations ML Frameworks
Reference implementations and educational projects for core NLP techniques (tokenization, sequence modeling, named entity recognition, relation extraction, coreference resolution). Does NOT include end-to-end applications, LLM frameworks, or general machine learning tools.
There are 31 nlp algorithm implementations frameworks tracked. 3 score above 50 (established tier). The highest-rated is jma127/pyltr at 65/100 with 464 stars.
Get all 31 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=nlp-algorithm-implementations&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
jma127/pyltr
Python learning to rank (LTR) toolkit |
|
Established |
| 2 |
evllabs/JGAAP
The Java Graphical Authorship Attribution Program |
|
Established |
| 3 |
tensorflow/ranking
Learning to Rank in TensorFlow |
|
Established |
| 4 |
Bibliome/alvisnlp
ALvisNLP corpus processing engine |
|
Emerging |
| 5 |
rosette-api/rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch |
|
Emerging |
| 6 |
bst-mug/n2c2
Support code for participation at the 2018 n2c2 Shared-Task Track 1 |
|
Experimental |
| 7 |
ethanhezhao/MetaLDA
The code for MetaLDA in ICDM 2017 |
|
Experimental |
| 8 |
lejon/PartiallyCollapsedLDA
Implementations of various fast parallelized samplers for LDA, including... |
|
Experimental |
| 9 |
Gu-Youngfeng/EfficiencyMiner
This project explores the efficiency of CraTer in terms of time consumption... |
|
Experimental |
| 10 |
ma2th/ecst
Embedded Classification Software Toolbox |
|
Experimental |
| 11 |
AltA-Advisory/ReutersParser
Java parser for the "Reuters-21578, Distribution 1.0" Text Categorization data set. |
|
Experimental |
| 12 |
digital-thinking/text-classification
Using the smile framework for text classification and additional preprocessing |
|
Experimental |
| 13 |
EhsanMashhadi/ISSRE2023-BugSeverityPrediction
Code of our paper "Method-Level Bug Severity Prediction using Source Code... |
|
Experimental |
| 14 |
tantikristanti/NERD_KID
NERD and wiKIData (NERD KID) is a machine learning application for... |
|
Experimental |
| 15 |
tobiassteidle/NLP-Text-Classification-Webservice
Provide a web service that uses the trained model from the project NLP Text... |
|
Experimental |
| 16 |
huangyueranbbc/Hadoop_MR_TFIDF
TF-IDF Hadoop |
|
Experimental |
| 17 |
vijay033/Deeplearning4j
Word2Vec Porting On Android Using DeepLearning4j ( On Device Machine Learning ) |
|
Experimental |
| 18 |
armdev/spring-stanford-corenlp
spring-stanford-corenlp simple app |
|
Experimental |
| 19 |
doried-a-a/TDT-Java
Topic detection and tracking, by Tishreen University team, Doried - Alaa and Reham |
|
Experimental |
| 20 |
CemalUnal/nlp-assignments
All experiments that I implemented while I'm taking the NLP course. |
|
Experimental |
| 21 |
rmbelmont/Interactive-View-Interactive-Ontology-Matching-with-Deep-Learningg.-
Computer program in Java for Matching Cybersecurity Ontologies Using K-Means... |
|
Experimental |
| 22 |
mouryallagadda/DocSimilaritySearch
This is a java application to find the nearest neighboring document using... |
|
Experimental |
| 23 |
guangylegend/MEMM
MEMM for NLP |
|
Experimental |
| 24 |
fermatjen/snappy
NLP and Ngram-based text classification system |
|
Experimental |
| 25 |
dbracewell/gengoai
Mono Repository for GengoAI projects |
|
Experimental |
| 26 |
cjmcgraw/ml-rescore-es-plugin
A plugin for dynamically rescorcing documents in real time, using grpc... |
|
Experimental |
| 27 |
lucabtz/DeepBlue
Java library to interact with Azure REST API |
|
Experimental |
| 28 |
dcavar/J-JSON-NLP
Java JSON-NLP Maven module and validator |
|
Experimental |
| 29 |
hterhors/SemanticMachineReading
Software Tool for Semantic Machine Reading based on Conditional Random Fields |
|
Experimental |
| 30 |
azzuwan/Jasonium
Simple desktop based editor for Rasa NLU / MITIE training model (JSON) |
|
Experimental |
| 31 |
matthewdargan/Semantic-Similarity
A command-line tool that finds the semantic usage of words from text |
|
Experimental |