LLM-Implementation/private-rag-embeddinggemma

🔒 100% Private RAG Stack with EmbeddingGemma, SQLite-vec & Ollama - Zero Cost, Offline Capable

/ 100

Emerging

Implements semantic search through vector embeddings stored in SQLite with sub-millisecond query performance, then routes results to a local LLM via Ollama's API for context-aware generation. Uses EmbeddingGemma's configurable 256/768-dimension embeddings to balance speed and quality, with UV for dependency management and support for Jupyter notebooks in isolated virtual environments.

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

kreuzberg-dev/kreuzberg-surrealdb

Extract, chunk, and embed documents from 88+ formats directly into SurrealDB.

sudhanshug16/chromadb-cli

CLI to interact with ChromaDB (https://github.com/chroma-core/chroma)

jmiba/zotero-redisearch-rag

An Obsidian plugin that synchronizes selected Zotero full-text items with your vault in...

sanketvagal/rag-notes

RAG system that lets you chat with your Obsidian/Markdown notes — chunks by headers, embeds with...

Vatsal-Founder/Hybrid-Search-with-LangChain-and-Pinecone

Hybrid search RAG system combining BM25 sparse + dense embeddings via LangChain and Pinecone 35%...

Explore Embedding Tools

All categories Trending Embeddings directory Insights