teamunitlab/rag-document-app

This FastAPI-based RAG service processes OCR data, generates embeddings using OpenAI, and utilizes Pinecone as a vector database for search. It answers questions based on search results using OpenAI.

/ 100

Experimental

Supports multi-format document ingestion (PDFs, images) with AWS S3 storage, implements rate limiting and Redis caching for performance, and includes API key authentication. The pipeline chains OCR extraction → tokenization → OpenAI embeddings → Pinecone vector storage → semantic search with LLM-generated answers, all deployable via Docker Compose.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 1 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

QmiAI/Qmedia

An open-source AI content search engine designed specifically for content creators. Supports...

mazzasaverio/fastapi-langchain-rag

(Let's start with a) Scalable question-answering system utilizing FastAPI, LangChain (LCEL), and...

charliewei0716/on-your-data-with-streamlit

Showcase the use of Azure OpenAI's native On Your Data feature and integrates it with Streamlit,...

ben-ogden/pinecone-rag

Using Pinecone, LangChain + OpenAI for Generative Q&A with Retrieval Augmented Generation (RAG).

thevladdo/rag-backend

Retrieval-Augmented Generation server with Pinecone and OpenAI

Explore RAG Tools

All categories Trending RAG directory Insights