Arabic NLP Tools
Comprehensive NLP processing libraries, toolkits, and resources specifically for Arabic and Arabic dialects (including Modern Standard Arabic, Moroccan Darija, Tunisian Derja, Sudanese Arabic). Includes tokenization, POS tagging, stemming, diacritization, syntax analysis, and dialect-specific datasets. Does NOT include general multilingual NLP tools, non-Arabic language resources, or downstream applications (sentiment analysis, translation, etc.) unless Arabic processing is the primary focus.
There are 35 arabic nlp tools tracked. 1 score above 70 (verified tier). The highest-rated is CAMeL-Lab/camel_tools at 77/100 with 538 stars and 18,865 monthly downloads. 1 of the top 10 are actively maintained.
Get all 35 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=arabic-nlp-tools&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL... |
|
Verified |
| 2 |
PetrKorab/Arabica
Python package for text mining of time-series data |
|
Established |
| 3 |
markuskiller/textblob-de
German language support for TextBlob. |
|
Established |
| 4 |
01walid/awesome-arabic
A curated list of awesome projects and dev/design resources for supporting... |
|
Emerging |
| 5 |
MagedSaeed/farasapy
A Python implementation of Farasa toolkit |
|
Emerging |
| 6 |
adhaamehab/textblob-ar
Arabic support for textblob |
|
Emerging |
| 7 |
ARBML/tkseem
Arabic Tokenization Library. It provides many tokenization algorithms. |
|
Emerging |
| 8 |
AnwarCS/Sudanese-Arabic-LLM
Building a Sudanese Arabic dataset and fine-tuning LLMs to improve... |
|
Emerging |
| 9 |
CompLin/nheengatu
Tools and resources for the computational processing of Nheengatu (Modern Tupi) |
|
Emerging |
| 10 |
Ruqyai/Ruqia-Library
Python library used for Arabic NLP to process, prepare and clean the Arabic text |
|
Emerging |
| 11 |
ARBML/tnkeeh
Arabic cleaning, normalization and segmentation library. |
|
Emerging |
| 12 |
linuxscout/arabicnlptoolslist
Arabic NLP tools List inventory |
|
Emerging |
| 13 |
Seen-Arabic/Arabic-Services
بعض الخدمات البرمجية على نصوص اللغة العربية |
|
Emerging |
| 14 |
AsoSoft/AsoSoft-Library-py
AsoSoft's Library for Kurdish language processing tasks in python |
|
Experimental |
| 15 |
mohabmes/Arabycia
Arabic NLP tool used to perform Text Search, POS tagging, Translation,... |
|
Experimental |
| 16 |
sudaverse/sudaverse
The Sudaverse ecosystem - Building Sudanese Arabic into the Heart of AI |
|
Experimental |
| 17 |
OussamaBenSlama/safwaText
safwaText is a Python package designed to clean, stem, and transform Arabic... |
|
Experimental |
| 18 |
iamjazzar/matn
A shared space for Arabic text processors. |
|
Experimental |
| 19 |
bahaeddinmselmi/derja-smart-scraper
A lightweight CLI tool for collecting Tunisian Derja text snippets from the... |
|
Experimental |
| 20 |
ARBML/nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify... |
|
Experimental |
| 21 |
Rashidbm/pysarf
Python-native Arabic morphology engine powered by NumPy — root extraction,... |
|
Experimental |
| 22 |
gtoffoli/spacy-ar_core_news_md
Unofficial Arabic language model for spaCy |
|
Experimental |
| 23 |
AliOsamaHassan/Quran-and-Arabic-Language-Repository
Projects & Libraries related to Quran & Arabic Language |
|
Experimental |
| 24 |
Kwimoad/ToDarija
Automatic translation application into Moroccan Darija. This project... |
|
Experimental |
| 25 |
ayzem88/syntactic-selector
أداة متقدمة لتحليل التراكيب اللغوية العربية |
|
Experimental |
| 26 |
SssiiiSssiii/ArabicTextCleaner
Arabic Text Cleaner |
|
Experimental |
| 27 |
MujtabaMohsin/Syntactic-Positioning-for-Short-Arabic-Sentences
Irab Al-Ishraf (إعراب الأشراف) is a java application for syntactic... |
|
Experimental |
| 28 |
sinaahmadi/ScriptNormalization
Script Normalization for Unconventional Writing of Perso-Arabic scripts (ACL2023) |
|
Experimental |
| 29 |
sudaverse/sudaverse-normalizer
Sudanese Arabic text normalization and cleaning toolkit |
|
Experimental |
| 30 |
jerbarnes/nordial
NorDial is a project that aims to create resources and collect knowledge... |
|
Experimental |
| 31 |
abjed/Arabic-NLP-resources
📚 This project holds an inventory of NLP resources for Arabic. |
|
Experimental |
| 32 |
gtoffoli/spacy-cameltokenizer
Tokenizer extension for the Arabic language (MSA), integrating the... |
|
Experimental |
| 33 |
bahaeddinmselmi/tunisian-arabic-ai-dataset
The largest open-source dataset for Tunisian Arabic (Derja) NLP, featuring... |
|
Experimental |
| 34 |
wa3dbk/Barcha
Open source NLP resources for the Tunisian arabic dialect. |
|
Experimental |
| 35 |
theRealProHacker/dmg
An application that provides automatic transliteration to orientalists,... |
|
Experimental |