GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish
Combines unsupervised extractive summarization with modular NLP components including ML-based sentence boundary detection, multiple embedding strategies (BERT, TF-IDF, Word2Vec), and diverse summarization algorithms (TextRank, LexRank, BM25). Provides sklearn-compatible feature extraction APIs and domain-specific datasets covering e-commerce, social media, and news domains, with experimental prebuilt classifiers for common tasks like news categorization.
No commits in the last 6 months. Available on PyPI.
Stars
95
Forks
14
Language
Python
License
MIT
Category
Last pushed
Apr 12, 2023
Commits (30d)
0
Dependencies
14
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/GlobalMaksimum/sadedegel"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
roshan-research/hazm
Persian NLP Toolkit
Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
amirivojdan/shekar
Simplifying Persian NLP for Modern Applications
GKalliatakis/Keras-VGG16-places365
Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365 CNNs for scene classification
GKalliatakis/Keras-Application-Zoo
Reference implementations of popular DL models missing from keras-applications & keras-contrib