bukosabino/justicio

Building an assistant for Boletin Oficial del Estado (BOE) using Retrieval Augmented Generation (RAG)

47
/ 100
Emerging

Embeds BOE documents into a vector database using Spanish-tuned sentence transformers, then retrieves semantically similar articles via approximate nearest neighbor search to provide LLM context. Built on FastAPI, Langchain, and Qdrant, with daily ETL pipelines that chunk documents, generate embeddings, and store metadata for hybrid search capabilities (semantic, keyword, or combined). Supports multiple similarity metrics and operates a free public service while maintaining deployment flexibility for self-hosted instances.

138 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

138

Forks

43

Language

HTML

License

MIT

Last pushed

Jul 17, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/bukosabino/justicio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.