Maximilian-Winter/llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

/ 100

Verified

Leverages guided sampling with JSON schema grammars to constrain model outputs, enabling function calling and structured output even on models not fine-tuned for these tasks. Integrates with multiple inference backends including llama.cpp, TGI, and vLLM servers, and supports agentic workflows through conversational, sequential, and mapping chain patterns with tool integration from Pydantic, llama-index, and OpenAI schemas.

620 stars and 8,620 monthly downloads. Actively maintained with 1 commit in the last 30 days. Available on PyPI.

Maintenance 16 / 25

Adoption 19 / 25

Maturity 18 / 25

Community 19 / 25

How are scores calculated?

Stars

620

Forks

Language

Python

License

—

Compare

llama-cpp-agent and llm-axe

Related tools

mozilla-ai/any-llm

Communicate with an LLM provider using a single interface

ShishirPatil/gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

CliDyn/climsight

A next-generation climate information system that uses large language models (LLMs) alongside...

rizerphe/local-llm-function-calling

A tool for generating function arguments and choosing what function to call with local LLMs

day50-dev/llcat

cat for LLMs

Explore LLM Tools

All categories Trending LLM Tool directory Insights