MultiX0/last-archive
A local-first RAG engine for web archival and semantic search. Crawl, embed, and query your own knowledge base entirely offline.
Implements a microservices architecture with a Go-based high-concurrency crawler, Python embedding service, and Node.js orchestration layer communicating over Docker's internal network, combined with Qdrant vector storage and SQLite metadata. Integrates Ollama for local LLM inference via an OpenAI-compatible Go bridge, enabling fully offline RAG with semantic search across archived content. Performance and accuracy are directly tied to crawl volume—the system requires substantial indexed data to generate meaningful responses.
Stars
22
Forks
8
Language
Go
License
MIT
Category
Last pushed
Jan 31, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/MultiX0/last-archive"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ConardLi/easy-dataset
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
ItzCrazyKns/Vane
Vane is an AI-powered answering engine.
DS4SD/deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries
xuwei95/ezdata
基于python和llm大模型开发的数据处理和任务调度系统。...
ModelEngine-Group/DataMate
DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG...