giskard-oss and MASEval
Maintenance
25/25
Adoption
10/25
Maturity
16/25
Community
19/25
Maintenance
13/25
Adoption
12/25
Maturity
18/25
Community
16/25
Stars: 5,158
Forks: 406
Downloads: —
Commits (30d): 57
Language: Python
License: Apache-2.0
Stars: 18
Forks: 7
Downloads: 222
Commits (30d): 0
Language: Python
License: MIT
No Package
No Dependents
No risk flags
About giskard-oss
Giskard-AI/giskard-oss
🐢 Open-Source Evaluation & Testing library for LLM Agents
This tool helps AI application developers test and evaluate their Large Language Model (LLM) agents and applications. It allows you to define specific scenarios and checks to ensure your AI behaves correctly, even with varied, non-deterministic outputs. Data scientists, machine learning engineers, and AI product managers can use this to validate and improve the reliability of their LLM-powered systems.
LLM evaluation
AI agent testing
prompt engineering
AI safety
RAG systems
About MASEval
parameterlab/MASEval
Multi-Agent LLM Evaluation
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work