spark-nlp and nlu

These two tools are ecosystem siblings within the John Snow Labs NLP ecosystem; NLU provides a high-level, simplified API for accessing and utilizing the extensive NLP models and functionalities implemented by Spark NLP.

spark-nlp
70
Verified
nlu
48
Emerging
Maintenance 20/25
Adoption 10/25
Maturity 16/25
Community 24/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 22/25
Stars: 4,116
Forks: 739
Downloads:
Commits (30d): 19
Language: Scala
License: Apache-2.0
Stars: 961
Forks: 140
Downloads:
Commits (30d): 0
Language: Python
License:
No Package No Dependents
Stale 6m No Package No Dependents

About spark-nlp

JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

Builds on Apache Spark for distributed NLP at scale, supporting 100,000+ pretrained pipelines and models across 200+ languages. Enables transformer architectures (BERT, RoBERTa, GPT-2, Llama, etc.) natively on JVM ecosystems (Java, Scala, Kotlin) while supporting model imports from TensorFlow, ONNX, OpenVINO, and GGUF formats. Covers end-to-end tasks including tokenization, embeddings, NER, machine translation, question answering, image captioning, and speech recognition.

About nlu

JohnSnowLabs/nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Scores updated daily from GitHub, PyPI, and npm data. How scores work