fredsiika/huxley-pdf
Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on Streamlit
Implements semantic search over PDF documents using FAISS vector indexing with OpenAI embeddings, enabling similarity-based retrieval before passing context to GPT-4 for question-answering. The architecture chunks PDFs with configurable overlap (400-char chunks, 80-char overlap) using LangChain's text splitters, then constructs a retrieval-augmented generation (RAG) pipeline that surfaces the most relevant document segments to answer user queries while tracking token usage via OpenAI callbacks.
No commits in the last 6 months.
Stars
37
Forks
9
Language
Python
License
MIT
Category
Last pushed
Dec 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/fredsiika/huxley-pdf"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
athrael-soju/Snappy
π Snappy's unique approach unifies vision-language late interaction with structured OCR for...
aakashsharan/research-vault
AI research assistant that extracts structured patterns from papers using RAG, LangGraph, and...
roberto729a/OllamaRAG
π€ Build a smart AI assistant that learns from any website using a Retrieval-Augmented Generation...
waterpare833/Novel-Assistant
λ‘컬 λλ ν΄λΌμ°λ LLMμ μ¬μ©ν΄ λ¬Έμ ν΄λ κΈ°λ°μΌλ‘ κ²μΒ·λ΅λ³νλ RAG μ΄μμ€ν΄νΈ
yousefmohtady1/CorpGuideAI-HR-Policy-Assistant
CorpGuide AI Backend: An intelligent HR Policy Assistant powered by RAG, Groq, and LangChain....