fredsiika/huxley-pdf

Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on Streamlit

/ 100

Emerging

Implements semantic search over PDF documents using FAISS vector indexing with OpenAI embeddings, enabling similarity-based retrieval before passing context to GPT-4 for question-answering. The architecture chunks PDFs with configurable overlap (400-char chunks, 80-char overlap) using LangChain's text splitters, then constructs a retrieval-augmented generation (RAG) pipeline that surfaces the most relevant document segments to answer user queries while tracking token usage via OpenAI callbacks.

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

athrael-soju/Snappy

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for...

aakashsharan/research-vault

AI research assistant that extracts structured patterns from papers using RAG, LangGraph, and...

roberto729a/OllamaRAG

🤖 Build a smart AI assistant that learns from any website using a Retrieval-Augmented Generation...

waterpare833/Novel-Assistant

로컬 또는 클라우드 LLM을 사용해 문서 폴더 기반으로 검색·답변하는 RAG 어시스턴트

yousefmohtady1/CorpGuideAI-HR-Policy-Assistant

CorpGuide AI Backend: An intelligent HR Policy Assistant powered by RAG, Groq, and LangChain....

Explore Vector Databases

All categories Trending Vector Database directory Insights