jasonacox/TinyLLM

Setup and run a local LLM and Chatbot using consumer grade hardware.

/ 100

Emerging

Supports multiple inference backends (Ollama, vLLM, llama-cpp-python) with OpenAI API compatibility, enabling flexible deployment across different hardware constraints. The chatbot layer adds RAG capabilities including URL summarization, news aggregation, stock/weather lookups, and vector database integration for knowledge retrieval. Architecture uses containerized services with persistent model caching and multi-session support via FastAPI frontend querying the inference backend.

319 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

319

Forks

Language

JavaScript

License

MIT

Higher-rated alternatives

undreamai/LLMUnity

Create characters in Unity with LLMs!

Mintplex-Labs/anythingllm-docs

Documentation of AnythingLLM by Mintplex Labs Inc.

bloodworks-io/phlox

Open source, local first AI medical scribe for desktop and web.

mamei16/LLM_Web_search

An extension for oobabooga/text-generation-webui that enables the LLM to search the web

snexus/llm-search

Querying local documents, powered by LLM

Explore RAG Tools

All categories Trending RAG directory Insights