Automatic Term Extraction NLP Tools

Tools for automatically identifying and extracting domain-specific terms, technical terminology, and named entities from unstructured text documents. Does NOT include general named entity recognition (NER), keyword extraction for intent analysis, or fact extraction.

There are 39 automatic term extraction tools tracked. 4 score above 50 (established tier). The highest-rated is ziqizhang/jate at 66/100 with 84 stars and 473 monthly downloads.

Get all 39 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=automatic-term-extraction&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 ziqizhang/jate

JATE - Just Automatic Term Extraction (in Python)

66
Established
2 mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

58
Established
3 mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

52
Established
4 brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction;...

52
Established
5 strangetom/ingredient-parser

A tool to parse recipe ingredients into structured data

48
Emerging
6 pryndor/Lixplore_cli

A powerful Unix-inspired command-line tool for searching scientific...

45
Emerging
7 explosion/projects

🪐 End-to-end NLP workflows from prototype to production

44
Emerging
8 swabhs/open-sesame

A frame-semantic parsing system based on a softmax-margin SegRNN.

42
Emerging
9 FACTSlab/glazing

Unified data models and interfaces for syntactic and semantic frame ontologies.

37
Emerging
10 kevinlu1248/pyate

PYthon Automated Term Extraction

36
Emerging
11 zjunlp/OntoProtein

[ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding

35
Emerging
12 gorgitko/molminer

Python library and command-line tool for extracting compounds from...

35
Emerging
13 bafgreat/fairmofsyncondition

A robust Python module for predicting the synthesis conditions of MOFs. It...

34
Emerging
14 brucewlee/lingfeat

[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction...

33
Emerging
15 robinvanschaik/flair-on-gcp

This repository adds examples on how to train Flair on Google Cloud Platform...

29
Experimental
16 aoldoni/tetre

TETRE: a Toolkit for Exploring Text for Relation Extraction

29
Experimental
17 ljvmiranda921/spacy-span-analyzer

Simple tool to analyze spans in your dataset. Implementation of Papay et...

28
Experimental
18 buaaliuming/Awesome-Resources-for-Scholarly-Big-Data

Tools, datasets, Corpus and Venue Challenge for scholarly big data——Pick up...

27
Experimental
19 CederGroupHub/text2chem

RegEx-based text parser that converts chemical terms and material entities...

27
Experimental
20 peter-grajcar/clause-extraction

Utility for clause extraction from complex sentences

24
Experimental
21 prit2596/NLP-Template-Extraction

Template Extraction from unstructured Wikipedia text using NLP techniques.

23
Experimental
22 dnaaun/openFraming

Tools for automatic frame discovery and labeling based on topic modeling and...

23
Experimental
23 hexuandeng/NewTerm

Implementation for our paper “NewTerm: Benchmarking Real-Time New Terms for...

22
Experimental
24 Fendidrip/design-resources-project

📂 Discover open-access databases for design researchers to find journals and...

22
Experimental
25 k-kaundal/sah-kse

Semantic Adaptive Hash & Knowledge Seed Engine — compress knowledge into...

22
Experimental
26 michaelmml/NLP-Information-Extraction

Automated PDF and text processing with Spacy and NLTK; information...

20
Experimental
27 ParvaShah/Template_Extraction_NLP

This project is about Template Extraction from a document using NLP Techniques

20
Experimental
28 theseekersutd/Research-Paper-Template-Extractor

Given repository extracts the templates from research papers using natural...

18
Experimental
29 RaziehZare/Speech-Processing-Ontology

A formal OWL ontology representing 79 core concepts in speech processing....

16
Experimental
30 Abhinand20/AUTO-ONTO

Tool to automatically extract keyphrases from text spanning across vast...

16
Experimental
31 seanox/seanox-ai-nlp

Modular NLP tools for domain-specific semantic matching and structured data

15
Experimental
32 hay/wiki-text-nlp

Extract 'Did you know?' facts from Wikipedia articles

14
Experimental
33 honghanhh/terminology-extraction

Terminology extraction on ACTER using Transformer-based language models

13
Experimental
34 ispasic/FlexiTerm-Python

Repository for FlexiTerm: a software tool to automatically recognise...

12
Experimental
35 harshit158/paper-dots

Automatic insights extraction and annotation tool from research papers

11
Experimental
36 alipascal/extract-rdf-from-text

Projet universitaire d'extraction d'entités triplets (sujet-prédicat-objet)...

11
Experimental
37 jeredhiggins/KeyIntentNER-T

KeyIntentNER-T is a Keyword Intent, Named Entity Recognition (NER), & Google...

11
Experimental
38 scarandriy/NeuroMine

LLM-powered pipeline for mining and structuring neuroprotective compound...

11
Experimental
39 nicolaCirillo/termdomain

A domain-aware automatic term extraction tool.

10
Experimental