MultiX0/last-archive

A local-first RAG engine for web archival and semantic search. Crawl, embed, and query your own knowledge base entirely offline.

/ 100

Emerging

Implements a microservices architecture with a Go-based high-concurrency crawler, Python embedding service, and Node.js orchestration layer communicating over Docker's internal network, combined with Qdrant vector storage and SQLite metadata. Integrates Ollama for local LLM inference via an OpenAI-compatible Go bridge, enabling fully offline RAG with semantic search across archived content. Performance and accuracy are directly tied to crawl volume—the system requires substantial indexed data to generate meaningful responses.

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Higher-rated alternatives

ConardLi/easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

ItzCrazyKns/Vane

Vane is an AI-powered answering engine.

DS4SD/deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

xuwei95/ezdata

基于python和llm大模型开发的数据处理和任务调度系统。...

ModelEngine-Group/DataMate

DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG...

Explore RAG Tools

All categories Trending RAG directory Insights