r2d4/react-llm

Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().

/ 100

Emerging

Leverages Apache TVM and MLC Relax compiled to WebAssembly to execute Vicuna models entirely on-device, with model weights and tokenizer (SentencePiece) cached in browser storage for faster subsequent loads. Offloads inference to a WebWorker to avoid blocking the main thread, while conversation state persists to localStorage. The modular monorepo includes companion packages for pre-built retro UI components and a Chrome extension, allowing both headless integration and turnkey deployment.

702 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 12 / 25

How are scores calculated?

Stars

702

Forks

Language

TypeScript

License

MIT

Higher-rated alternatives

mlc-ai/web-llm

High-performance In-browser LLM Inference Engine

e2b-dev/desktop

E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can...

geekjr/quickai

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art...

chrisrobison/textweb

A text-grid web renderer for AI agents — see the web without screenshots

Azure-Samples/llama-index-javascript

This sample shows how to quickly get started with LlamaIndex.ai on Azure 🚀

Explore LLM Tools

All categories Trending LLM Tool directory Insights