Arabic NLP Tools

Comprehensive NLP processing libraries, toolkits, and resources specifically for Arabic and Arabic dialects (including Modern Standard Arabic, Moroccan Darija, Tunisian Derja, Sudanese Arabic). Includes tokenization, POS tagging, stemming, diacritization, syntax analysis, and dialect-specific datasets. Does NOT include general multilingual NLP tools, non-Arabic language resources, or downstream applications (sentiment analysis, translation, etc.) unless Arabic processing is the primary focus.

There are 35 arabic nlp tools tracked. 1 score above 70 (verified tier). The highest-rated is CAMeL-Lab/camel_tools at 77/100 with 538 stars and 18,865 monthly downloads. 1 of the top 10 are actively maintained.

Get all 35 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=arabic-nlp-tools&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 CAMeL-Lab/camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL...

77
Verified
2 PetrKorab/Arabica

Python package for text mining of time-series data

53
Established
3 markuskiller/textblob-de

German language support for TextBlob.

50
Established
4 01walid/awesome-arabic

A curated list of awesome projects and dev/design resources for supporting...

40
Emerging
5 MagedSaeed/farasapy

A Python implementation of Farasa toolkit

39
Emerging
6 adhaamehab/textblob-ar

Arabic support for textblob

38
Emerging
7 ARBML/tkseem

Arabic Tokenization Library. It provides many tokenization algorithms.

37
Emerging
8 AnwarCS/Sudanese-Arabic-LLM

Building a Sudanese Arabic dataset and fine-tuning LLMs to improve...

36
Emerging
9 CompLin/nheengatu

Tools and resources for the computational processing of Nheengatu (Modern Tupi)

34
Emerging
10 Ruqyai/Ruqia-Library

Python library used for Arabic NLP to process, prepare and clean the Arabic text

31
Emerging
11 ARBML/tnkeeh

Arabic cleaning, normalization and segmentation library.

31
Emerging
12 linuxscout/arabicnlptoolslist

Arabic NLP tools List inventory

31
Emerging
13 Seen-Arabic/Arabic-Services

بعض الخدمات البرمجية على نصوص اللغة العربية

30
Emerging
14 AsoSoft/AsoSoft-Library-py

AsoSoft's Library for Kurdish language processing tasks in python

28
Experimental
15 mohabmes/Arabycia

Arabic NLP tool used to perform Text Search, POS tagging, Translation,...

26
Experimental
16 sudaverse/sudaverse

The Sudaverse ecosystem - Building Sudanese Arabic into the Heart of AI

25
Experimental
17 OussamaBenSlama/safwaText

safwaText is a Python package designed to clean, stem, and transform Arabic...

24
Experimental
18 iamjazzar/matn

A shared space for Arabic text processors.

24
Experimental
19 bahaeddinmselmi/derja-smart-scraper

A lightweight CLI tool for collecting Tunisian Derja text snippets from the...

24
Experimental
20 ARBML/nmatheg

A simple strategy for training and finetuning NLP models for Arabic. Specify...

22
Experimental
21 Rashidbm/pysarf

Python-native Arabic morphology engine powered by NumPy — root extraction,...

21
Experimental
22 gtoffoli/spacy-ar_core_news_md

Unofficial Arabic language model for spaCy

20
Experimental
23 AliOsamaHassan/Quran-and-Arabic-Language-Repository

Projects & Libraries related to Quran & Arabic Language

20
Experimental
24 Kwimoad/ToDarija

Automatic translation application into Moroccan Darija. This project...

19
Experimental
25 ayzem88/syntactic-selector

أداة متقدمة لتحليل التراكيب اللغوية العربية

16
Experimental
26 SssiiiSssiii/ArabicTextCleaner

Arabic Text Cleaner

16
Experimental
27 MujtabaMohsin/Syntactic-Positioning-for-Short-Arabic-Sentences

Irab Al-Ishraf (إعراب الأشراف) is a java application for syntactic...

15
Experimental
28 sinaahmadi/ScriptNormalization

Script Normalization for Unconventional Writing of Perso-Arabic scripts (ACL2023)

15
Experimental
29 sudaverse/sudaverse-normalizer

Sudanese Arabic text normalization and cleaning toolkit

15
Experimental
30 jerbarnes/nordial

NorDial is a project that aims to create resources and collect knowledge...

14
Experimental
31 abjed/Arabic-NLP-resources

📚 This project holds an inventory of NLP resources for Arabic.

13
Experimental
32 gtoffoli/spacy-cameltokenizer

Tokenizer extension for the Arabic language (MSA), integrating the...

13
Experimental
33 bahaeddinmselmi/tunisian-arabic-ai-dataset

The largest open-source dataset for Tunisian Arabic (Derja) NLP, featuring...

12
Experimental
34 wa3dbk/Barcha

Open source NLP resources for the Tunisian arabic dialect.

12
Experimental
35 theRealProHacker/dmg

An application that provides automatic transliteration to orientalists,...

10
Experimental