freelawproject/inception

Our microservice for generating embeddings from blocks of text

/ 100

Established

Combines SentenceTransformers with intelligent sentence-boundary chunking to embed legal documents efficiently—queries run on CPU for speed, while longer court opinions leverage GPU acceleration with configurable token limits and overlap ratios. Built as a FastAPI service with Prometheus metrics, Sentry error tracking, and batch processing capabilities, it targets legal document similarity and semantic search workflows.

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

BSD-2-Clause

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Related tools

FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...

qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Explore Embedding Tools

All categories Trending Embeddings directory Insights