microsoft/aici

AICI: Prompts as (Wasm) Programs

/ 100

Emerging

Implements constrained decoding through lightweight WebAssembly modules that execute on the inference engine's CPU during token generation, enabling token-by-token control over LLM output. Controllers written in Rust, C++, Python, or JavaScript maintain state and implement diverse strategies—from programmatic decoding to multi-agent coordination—while the sandbox architecture ensures security without filesystem or network access. Integrates with llama.cpp, HuggingFace Transformers, and rLLM, designed as a portable layer for higher-level control libraries like Guidance and LMQL.

2,064 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

2,064

Forks

Language

Rust

License

MIT

Higher-rated alternatives

trymirai/uzu

A high-performance inference engine for AI models

justrach/bhumi

⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed,...

lipish/llm-connector

LLM Connector - A unified interface for connecting to various Large Language Model providers

keyvank/femtoGPT

Pure Rust implementation of a minimal Generative Pretrained Transformer

ShelbyJenkins/llm_client

The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from...

Explore LLM Tools

All categories Trending LLM Tool directory Insights