leoneversberg/llm-chatbot-rag

A local LLM chatbot with RAG for PDF input files

/ 100

Emerging

Leverages Hugging Face models with optional bitsandbytes GPU quantization for inference optimization, and provides a Streamlit-based UI for interactive chat sessions. Supports multiple LLM architectures (including Gemma) via configurable model loading, with RAG implementation that processes PDF documents into searchable context for prompt augmentation. Designed for local deployment with flexible hardware requirements—quantization targets NVIDIA GPUs, but falls back to CPU inference with adjusted configuration.

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

Cinnamon/kotaemon

An open-source RAG-based tool for chatting with your documents.

BastinFlorian/RAG-Chatbot-with-Confluence

RAG Chatbot with Confluence

olioDuan/Domain-Specific-RAG-Chat-Course-Helper

This project is a Local RAG system built for the NYU Machine Learning course.

antoinelrnld/discord-rag

Easily create a RAG based on your Discord messages

Mohannadcse/DepsRAG

Interactive LLM Chatbot that constructs direct and transitive software dependencies as a...

Explore RAG Tools

All categories Trending RAG directory Insights