antrixsh/trusteval

Enterprise LLM Evaluation & Responsible AI Framework — Benchmark bias, hallucination, PII leakage, and toxicity across Healthcare, BFSI, Retail & Legal industries. Supports OpenAI, Anthropic, Gemini & HuggingFace. Python SDK + CLI + Web Dashboard. 191 tests. Compliance-ready reports.

/ 100

Emerging

No Package No Dependents

Maintenance 13 / 25

Adoption 3 / 25

Maturity 9 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Related tools

nityansuman/marvin

Web app to automatically generate subjective or an objective test and evaluate user responses...

shibing624/judger

自动作文评分工具，支持中文、英文作文智能评分，支持评分模型自训练，支持WEKA处理模型数据，支持自定义评分算法。java开发。

shubhpawar/Automated-Essay-Scoring

Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle

samiali12/debateai-server

A FastAPI-powered backend that manages structured debates, analyzes arguments, and generates...

usnistgov/KAIROS

Scoring and analysis software for the evaluation of Knowledge Directed Artificial Intelligence...

Explore NLP Tools

All categories Trending NLP directory Insights