islamAndAi/QURAN-NLP
Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLP
Provides structured datasets across 190K+ Quranic corpus entries (dictionary, morphology, lemmas) plus 700K+ hadiths with web scrapers for authenticated sources like altafsir.com and thaqalayn.net. Implements semantic search via Google Universal Sentence Encoder, sentiment analysis per surah, and text summarization pipelines. Data exposed as CSV formats and hosted on Kaggle for accessibility across NLP frameworks.
133 stars. No commits in the last 6 months.
Stars
133
Forks
25
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Apr 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/islamAndAi/QURAN-NLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
motazsaad/comparable-text-miner
Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram...
yonatanlou/QumranNLP
Modern computational linguistics for the Dead Sea Scrolls
prakhar21/Automatic-Glossary-Generation
The projects lets you extract glossary words and their definitions from a given piece of text...
ronenh24/bible_search_engine
Bible search engine incorporating natural language processing, deep learning, and machine learning.
Elysian01/Codify
Codify enables data scientists to perform all the tedious and time-consuming tasks such as EDA...