giskard-oss and MASEval

giskard-oss
70
Verified
MASEval
59
Established
Maintenance 25/25
Adoption 10/25
Maturity 16/25
Community 19/25
Maintenance 13/25
Adoption 12/25
Maturity 18/25
Community 16/25
Stars: 5,158
Forks: 406
Downloads:
Commits (30d): 57
Language: Python
License: Apache-2.0
Stars: 18
Forks: 7
Downloads: 222
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No risk flags

About giskard-oss

Giskard-AI/giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

This tool helps AI application developers test and evaluate their Large Language Model (LLM) agents and applications. It allows you to define specific scenarios and checks to ensure your AI behaves correctly, even with varied, non-deterministic outputs. Data scientists, machine learning engineers, and AI product managers can use this to validate and improve the reliability of their LLM-powered systems.

LLM evaluation AI agent testing prompt engineering AI safety RAG systems

About MASEval

parameterlab/MASEval

Multi-Agent LLM Evaluation

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work