avilum/minrlm

Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla LLMs. Data never enters the prompt.

/ 100

Established

Implements a REPL-based execution model where the LLM generates Python code to query data directly, keeping raw context out of the prompt entirely; uses entropy profiling via zlib compression to identify relevant sections and task-specific routing to optimize code patterns for structured data, search, math, and code retrieval tasks. Wraps execution in a DockerREPL sandbox (seccomp, stdlib-only) and optionally delegates smaller sub-tasks to a secondary LLM on filtered evidence, achieving 30pp accuracy gains over vanilla on frontier models while maintaining flat token cost regardless of document size.

Used by 1 other package. Available on PyPI.

Maintenance 13 / 25

Adoption 13 / 25

Maturity 20 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Related models

hassancs91/SimplerLLM

Simplify interactions with Large Language Models

kyegomez/SingLoRA

This repository provides a minimal, single-file implementation of SingLoRA (Single Matrix...

tylerelyt/LLM-Workshop

🌟 Learn Large Language Model development through hands-on projects and real-world implementations

NetEase-Media/grps_trtllm

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM...

gtausa197-svg/-Project-Nord-Spiking-Neural-Network-Language-Model

The first pure SNN language model trained from scratch with a fully original architecture. 144M...

Explore Transformer Models

All categories Trending Transformer directory Insights