giskard-oss and MASEval

giskard-oss

Verified

MASEval

Established

Maintenance 25/25

Adoption 10/25

Maturity 16/25

Community 19/25

Maintenance 13/25

Adoption 12/25

Maturity 18/25

Community 16/25

Stars: 5,158

Forks: 406

Downloads: —

Commits (30d): 57

Language: Python

License: Apache-2.0

Stars: 18

Forks: 7

Downloads: 222

Commits (30d): 0

Language: Python

License: MIT

No Package No Dependents

No risk flags

About giskard-oss

Giskard-AI/giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

This tool helps AI application developers test and evaluate their Large Language Model (LLM) agents and applications. It allows you to define specific scenarios and checks to ensure your AI behaves correctly, even with varied, non-deterministic outputs. Data scientists, machine learning engineers, and AI product managers can use this to validate and improve the reliability of their LLM-powered systems.

LLM evaluation AI agent testing prompt engineering AI safety RAG systems

About MASEval

parameterlab/MASEval

Multi-Agent LLM Evaluation

Related comparisons

giskard-oss and lmms-eval

Scores updated daily from GitHub, PyPI, and npm data. How scores work