joshualoehr/ngram-language-model

Python implementation of an N-gram language model with Laplace smoothing and sentence generation.

/ 100

Emerging

Builds N-gram models with configurable smoothing (Laplace with adjustable lambda) and computes perplexity scores on test sets. The implementation prioritizes transparency by hand-coding core algorithms beyond NLTK's `ngrams` and `FreqDist`, while generating sentences with cumulative probability scores. Expects pre-tokenized, sentence-segmented input and provides a CLI for specifying model order, smoothing parameters, and generation count.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 20 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Related tools

nlx-group/overlapy

Python package developed to evaluate textual overlap (N-Grams) between two volumes of text.

MannarAmuthan/kural-gen

KuralGen generates Thirukkural for a given English sentence

phughesmcr/SimpleNGrams

The easiest way to get n-grams from strings!

SpydazWebAI-NLP/BasicLanguageModelling2023

Basic Language Models , Bag of Words, Ngram Models Etc NLP modelling and associated tasks

simrann20/Hangman_Game_Project

Hangman Game implementation using n-gram language model in NLP, achieved an accuracy of more than 50%

Explore NLP Tools

All categories Trending NLP directory Insights