langextract and lextract
About langextract
google/langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Supports multiple LLM providers (Google Gemini, OpenAI, local Ollama models) with pluggable custom providers, and uses optimized chunking and parallel processing to handle long documents efficiently. Enforces schema-compliant outputs through controlled generation on supported models, while automatically detecting hallucinated extractions that don't ground to source text. Generates interactive HTML visualizations that map each extracted entity back to its precise character position in the original document for verification and review.
About lextract
ycastorium/lextract
LLM-powered text extraction library for Elixir
Scores updated daily from GitHub, PyPI, and npm data. How scores work