N-Gram Language Models NLP Tools

Statistical language models built using n-gram approaches with smoothing techniques (Laplace, Kneser-Ney, Witten-Bell, Good-Turing) for probability estimation and text generation. Does NOT include neural language models, word embeddings, or general language model applications like chatbots or summarization.

There are 25 n-gram language models tools tracked. The highest-rated is nlx-group/overlapy at 36/100 with 10 stars.

Get all 25 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=n-gram-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 nlx-group/overlapy

Python package developed to evaluate textual overlap (N-Grams) between two...

36
Emerging
2 joshualoehr/ngram-language-model

Python implementation of an N-gram language model with Laplace smoothing and...

30
Emerging
3 MannarAmuthan/kural-gen

KuralGen generates Thirukkural for a given English sentence

25
Experimental
4 phughesmcr/SimpleNGrams

The easiest way to get n-grams from strings!

24
Experimental
5 SpydazWebAI-NLP/BasicLanguageModelling2023

Basic Language Models , Bag of Words, Ngram Models Etc NLP modelling and...

22
Experimental
6 jadessechan/Text-Prediction

A trigram language model using NLTK to predict the next word of a phrase

21
Experimental
7 simrann20/Hangman_Game_Project

Hangman Game implementation using n-gram language model in NLP, achieved an...

21
Experimental
8 shayneobrien/language-modeling

Language modeling on the Penn Treebank (PTB) corpus using a trigram model...

19
Experimental
9 burhanharoon/N-Gram-Language-Model

It's a python based n-gram langauage model which calculates bigrams,...

18
Experimental
10 sayarghoshroy/Language-Modelling

Language Modelling for various corpora, Natural Language Generation using...

18
Experimental
11 SOHAM-3T/sherlock-5gram-language-model

A statistical 5-gram language model implemented in Python, trained on...

16
Experimental
12 AutoComplete-sh/autocomplete-wordpress

AutoComplete.sh Wordpress Plugin Beta

14
Experimental
13 vgherard/kgrams

k-grams, Language Models, and All That

13
Experimental
14 ngrams-dev/general

NGRAMS is a search engine for the Google Books Ngram Dataset. This...

13
Experimental
15 akhvorov/vgram

Feature extraction from sequential data

13
Experimental
16 Proteusiq/bigrams

Non-intrusive ngrams generations

13
Experimental
17 gustavecortal/ngram

Python implementation of n-gram language models from scratch and using NLTK...

13
Experimental
18 prigarg/Bigram-Language-Model-from-Scratch

A Bigram Language Model from scratch with no-smoothing and add-one...

12
Experimental
19 aankitdas/cs6320_nlp

This repository implements an n-gram-based language model for the CS6320 NLP...

12
Experimental
20 ArslanJajja1/bigram-language-model

Learn how language models work by building a Bigram model from scratch —...

12
Experimental
21 OlgaKrylova/language-models

several training language models (text generation and classification)

12
Experimental
22 UCLL-RE-Gen-AI/clin32-entropy

This demo accompanies the poster presentation "Limitations of the entropy...

11
Experimental
23 capjamesg/linguist.link

Find the most surprising words and most common n-grams on a web page.

11
Experimental
24 phanxuanphucnd/ngram_language_model

n-gram language model.

10
Experimental
25 onekyn/n-gram

A simple C implementation of an n-gram language model

10
Experimental