Legal Document Processing NLP Tools
Tools for extracting structured information from legal documents, parsing legal text, identifying legal concepts/citations, and organizing legal data. Does NOT include general contract analysis, legal research databases, or law-specific knowledge bases without document processing components.
There are 39 legal document processing tools tracked. 1 score above 50 (established tier). The highest-rated is discopy/discopy at 68/100 with 406 stars and 4,054 monthly downloads.
Get all 39 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=legal-document-processing&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
discopy/discopy
The Python toolkit for computing with string diagrams. |
|
Established |
| 2 |
jblake1965/eluciDoc
Screens legal text and extracts sentences containing user input party... |
|
Emerging |
| 3 |
LexPredict/lexpredict-lexnlp
LexNLP by LexPredict |
|
Emerging |
| 4 |
Liquid-Legal-Institute/Legal-Text-Analytics
A list of selected resources, methods, and tools dedicated to Legal Text Analytics. |
|
Emerging |
| 5 |
Neplex/ArchiTXT
ArchiTXT is an open source Python library that transforms unstructured text... |
|
Emerging |
| 6 |
openlegaldata/awesome-legal-data
A collection of datasets and other resources for legal text processing. |
|
Emerging |
| 7 |
Legilibre/legi.py
Outils de manipulation des archives LEGI (lois françaises) |
|
Emerging |
| 8 |
LukaVuli/Entity_Neutering
A methodology to pre-process text data for preventing lookahead bias in... |
|
Emerging |
| 9 |
maastrichtlawtech/bsard
🔍 A statutory article retrieval dataset in French. (ACL 2022) |
|
Emerging |
| 10 |
yinhao0214/ParseLawDocuments
对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。 |
|
Emerging |
| 11 |
nokia/codesearch
Models and datasets for annotated code search. |
|
Emerging |
| 12 |
ondata/normattiva_2_md
Trasforma i testi delle leggi italiane in formato leggibile e pronto per... |
|
Emerging |
| 13 |
DerwenAI/arxiv-trends
Analyze trends in articles published on arXiv |
|
Emerging |
| 14 |
medelman17/eyecite-ts
TypeScript legal citation extraction library with zero dependencies.... |
|
Emerging |
| 15 |
zeeuws-archief/ArchiveTextMiner
Transform textual information to structured metadata in MDTO-format. |
|
Experimental |
| 16 |
Starscream-11813/MathBot
MathBot is a transformer-based Math Word Problem (MWP) solver made as the... |
|
Experimental |
| 17 |
MI2DataLab/HADES
A powerful tool for comparing similarly structured documents |
|
Experimental |
| 18 |
liamcripwell/disco_split
Code and data for discourse-based sentence splitting experiments. |
|
Experimental |
| 19 |
tvhahn/arxiv-code-search
Do authors on arXiv make their code and data available? We're building text... |
|
Experimental |
| 20 |
george-gca/ai_papers_cleaner
Extract text from papers PDFs and abstracts, and remove uninformative words. |
|
Experimental |
| 21 |
mastaal/uitspraken
a simple Python program to easily load in Dutch court decision XML-files as... |
|
Experimental |
| 22 |
mastaal/nllegalcit
A Python library to find citations to Dutch legal documents in natural... |
|
Experimental |
| 23 |
organvm-i-theoria/linguistic-atomization-framework
LingFrame — computational rhetoric platform: hierarchical text atomization,... |
|
Experimental |
| 24 |
openeventdata/PLOVER
Next generation event data ontology |
|
Experimental |
| 25 |
thejeswi/BobGoesToJail
A semantic law interpreter for the English translations for the German... |
|
Experimental |
| 26 |
bflashcp3f/textlabs-xwlp-code
EACL 2021 "Process-Level Representation of Scientific Protocols with... |
|
Experimental |
| 27 |
MUSC-TBIC/etude-engine
ETUDE (Evaluation Tool for Unstructured Data and Extractions) is a... |
|
Experimental |
| 28 |
chigwell/legalysis
legalysis extracts parties, issues, outcomes, and lessons from case texts... |
|
Experimental |
| 29 |
DaBr01/AGB-DE
A corpus and models for the automated legal assessment of clauses in German... |
|
Experimental |
| 30 |
fanta-mnix/nlp-contract-analysis
NLP-based Contract Analysis |
|
Experimental |
| 31 |
phHartl/eu-judgement-analyse
Quantitative analysis of judgments of the European Court of Justice |
|
Experimental |
| 32 |
justmars/citation-utils
Docket citation regexes from Philippine Supreme Court decisions |
|
Experimental |
| 33 |
TLP-COI/tlp-coi-docs
Governance, contribution guidance, and project-planning documentation for the TLP-CoI |
|
Experimental |
| 34 |
ssciwr/argumentation-management
Annotator combining different NLP pipelines. |
|
Experimental |
| 35 |
justmars/citation-report
Parse legal citations having the publisher format - i.e. SCRA, PHIL, OFFG -... |
|
Experimental |
| 36 |
justmars/citation-date
This is a dependency: a regex date formula and decoder for dates referenced... |
|
Experimental |
| 37 |
berkearda/croissantminer
Automated metadata extraction from ML dataset papers using LLMs and the... |
|
Experimental |
| 38 |
innerNULL/monoml
Mono Implementations' Archive |
|
Experimental |
| 39 |
askmuhsin/legal_maxims
legal maxims dataset |
|
Experimental |