amirivojdan/shekar
Simplifying Persian NLP for Modern Applications
Provides modular preprocessing operators (normalizers, filters, maskers) composable with the `|` operator following Academy of Persian Language guidelines, plus tokenization, POS tagging, NER, and embeddings. Built on ONNX Runtime for CPU/GPU inference with a lightweight footprint and optional web UI for interactive exploration. Achieves 95%+ code coverage with hundreds of test cases and supports Windows, Linux, and macOS including Apple Silicon.
Available on PyPI.
Stars
61
Forks
4
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/amirivojdan/shekar"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
roshan-research/hazm
Persian NLP Toolkit
Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish
GKalliatakis/Keras-VGG16-places365
Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365 CNNs for scene classification
NC0DER/KeyphraseExtraction
Keyphrase Extraction Review