SudhirGadhvi/open-vernacular-ai-kit
Clean Indian code-mixed text before it reaches your LLM.
53
/ 100
Established
Available on PyPI.
Maintenance
13 / 25
Adoption
10 / 25
Maturity
18 / 25
Community
12 / 25
Stars
5
Forks
1
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2026
Monthly downloads
353
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SudhirGadhvi/open-vernacular-ai-kit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
acl-org/acl-anthology
Data and software for building the ACL Anthology.
76
anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
74
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
53
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
47
KennethEnevoldsen/scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
47