Lexical Semantic Resources NLP Tools

Tools and APIs for accessing structured lexical databases, wordnets, and semantic networks across languages. Includes synonym/antonym/hypernym lookup and semantic relationship repositories. Does NOT include word embeddings, word sense disambiguation systems, or semantic parsing tools.

There are 83 lexical semantic resources tools tracked. 2 score above 70 (verified tier). The highest-rated is chatopera/Synonyms at 76/100 with 5,104 stars and 723 monthly downloads. 1 of the top 10 are actively maintained.

Get all 83 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=lexical-semantic-resources&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 chatopera/Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

76
Verified
2 isaacus-dev/semchunk

A fast, lightweight and easy-to-use Python library for splitting text into...

74
Verified
3 goodmami/wn

A modern, interlingual wordnet interface for Python

68
Established
4 CUNY-CL/wikipron

Massively multilingual pronunciation mining

67
Established
5 UCREL/pymusas

Python Multilingual Ucrel Semantic Analysis System

59
Established
6 jacksonllee/pylangacq

Language Acquisition Research Tools

58
Established
7 thunlp/OpenHowNet

Core Data of HowNet and OpenHowNet Python API

58
Established
8 gutfeeling/word_forms

Accurately generate all possible forms of an English word e.g "election" -->...

57
Established
9 Lilykos/pyphonetics

A Python 3 phonetics library.

55
Established
10 kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

55
Established
11 natasha/slovnet

Deep Learning based NLP modeling for Russian language

54
Established
12 chrislit/abydos

Abydos NLP/IR library for Python

50
Established
13 mideind/GreynirServer

The greynir.is Icelandic natural language processing API and website.

49
Emerging
14 wroberts/pygermanet

GermaNet API for Python

48
Emerging
15 soumendrak/openodia

This is a package on various tools in the Odia language.

48
Emerging
16 nlpaueb/gr-nlp-toolkit

The Greek NLP toolkit for Python. Supports NER/DP/POS...

46
Emerging
17 bjascob/pyInflect

A python module for word inflections designed for use with spaCy.

46
Emerging
18 meta-toolkit/meta

A Modern C++ Data Sciences Toolkit

44
Emerging
19 wetneb/pynif

A small Python library for NLP Interchange Format (NIF) for NER(D) systems

44
Emerging
20 murray-z/text_analysis_tools

中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 -...

43
Emerging
21 maxadamski/plwordnet

Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)

43
Emerging
22 open-language/en-wordnet

En-Wordnet is a node.js module which makes Princeton University's Wordnet...

42
Emerging
23 open-language/en-dictionary

En-Dictonary is a node.js module which makes works and their relations...

42
Emerging
24 bureaucratic-labs/revizor

Ecommerce product title recognition package

42
Emerging
25 howl-anderson/Chinese_models_for_SpaCy

SpaCy 中文模型 | Models for SpaCy that support Chinese

41
Emerging
26 johnbumgarner/wordhoard

This Python module can be used to obtain antonyms, synonyms, hypernyms,...

41
Emerging
27 TakeLab/spacy-udpipe

spaCy + UDPipe

39
Emerging
28 tasdikrahman/vocabulary

[Not Maintained anymore] Python Module to get Meanings, Synonyms and what...

39
Emerging
29 mideind/GreynirEngine

A fast, efficient natural language processing engine for Icelandic.

38
Emerging
30 Lambda-3/Indra

Indra is a Web Service which allows easy access to different distributional...

35
Emerging
31 nltk/wordnet

Stand-alone WordNet API

35
Emerging
32 sdam-au/LAGT

ETL repo for ancient Greek texts

32
Emerging
33 dmeoli/WS4J

WordNet Similarity for Java provides an API for several Semantic...

32
Emerging
34 medzuslovjansky/database

Informacija o medžuslovjanskom jezyku: i za kompjutery, i za ljudi

31
Emerging
35 web64/norwegian-nlp-resources

Norwegian NLP Resources

31
Emerging
36 open-language/wordnets

Wordnets is a gzip package which makes Princeton University's Wordnet and...

31
Emerging
37 avidale/encodechka

The tiniest sentence encoder for Russian language

30
Emerging
38 slgero/receipt_parser

Allow parsing Russian receipts

29
Experimental
39 mideind/BinPackage

The vocabulary of modern Icelandic, encapsulated in a Python package.

29
Experimental
40 techiaith/lecsicon-cymraeg-bangor

Lecsicon cynhwysfawr o eirffurfiau'r Gymraeg yn seiliedig ar ddata gwirydd...

27
Experimental
41 nepalibhasha/varnavinyas

वर्णविन्यास — Open-source Nepali orthography toolkit based on Nepal Academy...

27
Experimental
42 iis-research-team/wiki-synonyms

Python library to search for synonyms in Russian

26
Experimental
43 yweweler/c-t9

A T9 typing system written in C11

26
Experimental
44 techiaith/lecsicon-cymraeg-bangor-enghreifftiau

Enghreifftiau o ddefnyddio Lecsicon Cymraeg Bangor // Examples of code...

25
Experimental
45 ogpetrov/sakha-nlp

Various tools and data for Sakha language NLP.

25
Experimental
46 ShihabYasin/Extracting-Semantic-Relatedness-For-Bangla-Words

Semantic Relatedness For Bangla Words.

24
Experimental
47 wjbmattingly/spacyex

SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.

24
Experimental
48 melaniab/spacy-pipeline-bg

Bulgarian spaCy natural language processing pipeline

24
Experimental
49 Salah-Sal/arabic-wordnet-v4

Arabic WordNet 4.0 - 109,823 synsets translated from Open English WordNet

24
Experimental
50 petra-viola/wortsalat

python NLP library for german language

23
Experimental
51 open-language/id-en-dictionary

Id-En-Dictonary is a node.js module which makes Indonesian words, their...

22
Experimental
52 Nousheen0329/daimones-community

Explore AI-driven philosophical dialogues with Aristotle using Ancient...

22
Experimental
53 e-gun/HipparchiaGoServer

front end to greek and latin corpora: searching, browsing, concordances,...

22
Experimental
54 Aatlantise/prosody-syntax-interface

Measuring syntactic information content in prosodic features

22
Experimental
55 ispasic/idiometry

An idiom search engine

21
Experimental
56 mattlianje/loquax

NLP framework for phonology

21
Experimental
57 theodm/gender-assistenz

Anwendung zur Erkennung von generischem Maskulinum in deutschen Texten und...

20
Experimental
58 BirdsAreFlyingCameras/WordLists

A repo containing wordlists I've compiled over time. 215,184 Names, ...

20
Experimental
59 tech4germany/bam-inclusify

INCLUSIFY is a tool to support the practical use of diversity-sensitive...

20
Experimental
60 wimarka-uic/WiMarka

Python library and CLI tool designed for evaluating machine translations...

17
Experimental
61 diyclassics/la_core_web_lg

spaCy-compatible sm/md/lg/trf core models for Latin, i.e pipeline with POS...

16
Experimental
62 snizio/italian-wiktionary-parser

This repository contains a python script for parsing an xml dump of the...

16
Experimental
63 cadia-lvl/icelandic-NLP-resources

Overview of Icelandic NLP resources at a glance

16
Experimental
64 nanguoshun/StatNLP-Framework

C++ based implementation of StatNLP framework

16
Experimental
65 ayzem88/data-analyzer

أداة متقدمة لتحليل النصوص العربية بشكل شامل مع إمكانيات متعددة للتحليل...

16
Experimental
66 giannirizzola/database-italiano-enigmistica-e-linguistica

database-italiano-enigmistica-e-linguistica

15
Experimental
67 pagesjaunes/spacy-french-models

French models for spacy

15
Experimental
68 anka335/information-theory

Implementations of information theory algorithms

14
Experimental
69 theodm/gtagger

Ergänzedes Projekt zum Projekt gender-assistenz zum manuellen Taggen von...

14
Experimental
70 hinrikur/IceNLPy

A Python wrapper for the Java-based IceNLP toolkit for Icelandic

13
Experimental
71 b05102139/acoustic_distance

Python implementation of acoustic distance in the paper "A New...

13
Experimental
72 shirayu/ita-corpus-chuwa

Chunked word annotation for ITA corpus

13
Experimental
73 latincy/latincy-guidelines

Annotation guidelines for LatinCy Latin NLP models

12
Experimental
74 osama-ata/siwar-api

📚 Non-official Python wrapper for the Siwar Arabic Lexicon API...

12
Experimental
75 techiaith/geiriau-mwyaf-aml

Rhestrau geiriau mwyaf aml y Gymraeg a Saesneg // Wordlists of the most...

12
Experimental
76 sakelariev/bulgarian-spacy-models

Bulgarian models for spaCy – tokenizer, trainable lemmatizer, POS tagger,...

12
Experimental
77 Urdatorn/grc-macronizer

Automatic annotation of Ancient Greek vowel length

12
Experimental
78 ayushmukati08/tcp-dictionary-server

A multi-threaded TCP client–server dictionary implemented in Python using...

11
Experimental
79 Sion1225/sorpus

Sentence OpeRations Processing UtilitieS.

11
Experimental
80 open-language/id-wordnet

Id-Wordnet is a node.js module which makes Bahasa Wordnet available as a package.

11
Experimental
81 imgeyuez/TafIn--Tagger-for-Intensifiers

A model which can be used for an automatic identification of intensifiers of...

11
Experimental
82 Popravljam/serbian-word-explorer

Language exploration tool for Serbian linguistics with dictionary search and...

10
Experimental
83 perfah/RSR

Refined Semantic Relatedness (RSR), a distributional semantics model.

10
Experimental