NLP Education Courses Embedding Tools

Educational repositories, course materials, and learning resources for NLP and deep learning fundamentals. Does NOT include production tools, language-specific NLP toolkits, or deployed applications.

There are 74 nlp education courses tools tracked. 2 score above 50 (established tier). The highest-rated is roshan-research/hazm at 66/100 with 1,381 stars and 3,129 monthly downloads.

Get all 74 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=nlp-education-courses&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 roshan-research/hazm

Persian NLP Toolkit

66
Established
2 Dadmatech/DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co.

51
Established
3 amirivojdan/shekar

Simplifying Persian NLP for Modern Applications

47
Emerging
4 GlobalMaksimum/sadedegel

A General Purpose NLP library for Turkish

42
Emerging
5 GKalliatakis/Keras-VGG16-places365

Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365...

42
Emerging
6 NC0DER/KeyphraseExtraction

Keyphrase Extraction Review

36
Emerging
7 giladodinak/mlinc

Machine Learning from scratch in C

35
Emerging
8 GKalliatakis/Keras-Application-Zoo

Reference implementations of popular DL models missing from...

33
Emerging
9 todd-cook/ML-You-Can-Use

Practical ML and NLP with examples.

32
Emerging
10 ayaanzhaque/SDCNL

Deep Learning for Suicide and Depression Identification with Unsupervised...

28
Experimental
11 siddk/entity-network

Tensorflow implementation of "Tracking the World State with Recurrent Entity...

27
Experimental
12 bharatc9530/Machine-Learning

Data Visualization, EDA , Model Building and Deployment etc..

26
Experimental
13 aakhundov/sequence-labeling

Accompanying repository of our NLP paper "Sequence Labeling: A Practical...

26
Experimental
14 jukiewiczm/kaggle-predict-future-sales

Kaggle's Predict Future Sales competition project (TOP 15 solution as of March 2020)

26
Experimental
15 uzairakbar/info-retrieval

Information Retrieval in High Dimensional Data (class deliverables)

26
Experimental
16 divyanshj16/Image-Captioning

Image Captioning from scratch

25
Experimental
17 GiorgiaAuroraAdorni/learning-personality

Learning Personality is a bachelor internship project that use neural...

25
Experimental
18 jonas-scholz123/msci-project

Master's project: What makes conversations interesting, a NLP approach. We...

24
Experimental
19 bsantraigi/Tensorflow-RNN-Tutorials

Tutorial on English to Hindi Transliteration using Seq2Seq Architecture in Tensorflow

24
Experimental
20 dl4nlp-rg/search-with-dense-vectors

Final project for course on deep learning for nlp (IA376E/1s2020 @ Unicamp)

24
Experimental
21 mhdk1602/python_training

Training materials for data engineering concepts in python

24
Experimental
22 Aliipou/culture-identifier

NLP-powered personality analyzer that matches your writing style to iconic...

23
Experimental
23 greerviau/calcine

A source-agnostic, type-agnostic featurization pipeline framework for Python

23
Experimental
24 lorenzobalzani/nlp_projects

This repository contains assignments, the final course project, and the...

23
Experimental
25 K1Mo0/Sales_forecasting_m5

📈 Forecast retail sales using the M5 dataset to enhance inventory management...

23
Experimental
26 airalcorn2/Color-Names

An improved version of the color name model described here:...

23
Experimental
27 dmivilensky/Sampling-from-homotopy-groups-of-spheres

Sampling from π_n(S^2). Application of optimization and machine learning...

23
Experimental
28 ecgResearch/HeartBert

repo for the HeartBERT model

23
Experimental
29 data-datum/nlp_resources

Resources related to NLP

22
Experimental
30 Xhst/data-engineering-projects

Projects for the course Data Engineering held by professor Paolo Merialdo at...

22
Experimental
31 Hartorn/thc-net

Keras-based easy way to create NN and SNN

22
Experimental
32 ashioyajotham/Natural-Language-Processing

NLP

21
Experimental
33 Wildertrek/survey

A Computational Atlas of 44 Personality Models — standardized datasets,...

21
Experimental
34 josuviteri/Clinic-Note-NLP

A comprehensive Natural Language Processing project focused on clinical...

20
Experimental
35 dmeoli/OnlineRetail

Data Mining project 2020/2021 @ University of Pisa

20
Experimental
36 nssharmaofficial/image-caption-generator

Image captioning model with Resnet50 encoder and LSTM decoder

19
Experimental
37 rusnlp/rusnlp

map of russian nlp papers

19
Experimental
38 AdityaDutt/MultiColor-Shapes-Database

A small database to test different machine learning tasks. It contains...

18
Experimental
39 ljanyst/deep-learning

My musings related to deep learning

17
Experimental
40 kgautam01/Automated-Essay-Scorer

Deep learning & NLP based project, using word2vec model for creating word...

17
Experimental
41 donfaq/legal-space-research

Russian law meets language modelling

16
Experimental
42 SEANIMALMOVE/CetaceanWhistleDetection_StraitOfGibraltar

This repository contains an iterative deep learning pipeline for cetacean...

16
Experimental
43 caramel2001/FewShotLearning

Flower Recognition: Dealing with Less Data via Few-Shot Learning

15
Experimental
44 rezakhosro/categorized-english-words

154K domain-specific English words across 34 academic fields

15
Experimental
45 Code-Trees/END-GAME

It's all about NLP. From End- To- End.We will create Deep NLP beyond...

15
Experimental
46 KishManani/financial_forecasting_competition_g_research

My submission to the G Research Financial Forecasting Competition of 2018

15
Experimental
47 ilyesrezgui/Attention-based-Method-for-Design-Pattern-Detection

Implementation of the paper Attention-based Method for Design Pattern Detection

15
Experimental
48 bogwi/rookeen

spaCy-based CLI for web linguistic analysis with embeddings, sentiment,...

15
Experimental
49 ghostfr1end/the-blueprint-nlp

Description: NLP-анализ Telegram-канала The Blueprint (март–май 2025):...

15
Experimental
50 rasyosef/text-embedding-models-training

Notebooks to train and evaluate Amharic Text Embedding Models based on BERT...

15
Experimental
51 agoor97/NLP_tasks

This Repo collects Notebooks for NLP

14
Experimental
52 devmount/neural-network-pos-tagger

Train and evaluate neural network language models for POS tagging, tag input...

14
Experimental
53 davidzyx/HinDroid-with-Embeddings

Experiments on improving the HinDroid model

14
Experimental
54 windsuzu/Tensorflow2-Beginner

Notes from deeplearning.ai's Tensorflow 2 course. It includes the basic...

14
Experimental
55 GalinaDaub/YaPracticum

Здесь будут собраны самые интересные проекты, выполненные мной в процессе...

14
Experimental
56 yaassonn/wsd_project

Word Sense Disambiguation with DeBERTa-v3-base, centroid classification,...

14
Experimental
57 anu-gtb/evalbot

This is an automated scoring system that leverages a trained BERT model to...

14
Experimental
58 anubhavmaity/wattbot

My implementation for a kaggle competition:...

13
Experimental
59 z-aqib/embedded-echoes-ml

A complete ML pipeline for the Kaggle “Embedded Echoes” challenge using...

12
Experimental
60 SkywardAI/cecilia

EDA tools and datasets generator for ML projects

12
Experimental
61 vla6/Blog_gnn_naics

Exploring categorical features with various encodings and models

12
Experimental
62 victor7246/Notebooks

This repository contains notebooks on different topics across - linear...

11
Experimental
63 SERGI0HERREROS/NLP_ProgramasElectorales

Electoral Analysis System based on Natural Language Processing (NLP) for the...

11
Experimental
64 mozartfish/mozartfish

About Me

11
Experimental
65 usbt0p/NLP_course

Simple NLP projects implemented from-scratch in Numpy. BPE, Skip-gram...

11
Experimental
66 mmarouen/marabou

natural language processing and computer vision use cases for non technical user

11
Experimental
67 turgut090/nlp_az_R_Py_Global_vectors_tr_Keras

NLP for Azerbaijani language

11
Experimental
68 JocelynVelarde/FinanceAI

Code repository for Atrato Financial Hackathon

11
Experimental
69 Balazs-Nagy/actuarial-loss-prediction

Actuarial loss prediction of workers’ compensation claims using natural...

11
Experimental
70 DJRamosA/AC-PLT

This project presents a novel algorithm that uses natural language...

10
Experimental
71 michimichiamo/pos-tagging

Training three different RNN models on a portion of Penn Treebank data to...

10
Experimental
72 LNshuti/national-bank-of-Rwanda

Applied AI Embeddings with Python using National Economic Reports

10
Experimental
73 najafmurtaza/Developing-Machine-Learning-Models-in-Flask

Flask for training/testing Watson, FastText, Gensen Embeddings and hDBScan....

10
Experimental
74 arkeodev/nlp

Natural Language Processing (NLP)

10
Experimental