opencompass and COMPASS
These are unrelated projects that share only a superficial naming similarity: one is a comprehensive LLM evaluation framework supporting 100+ benchmarks across multiple models, while the other is a specialized domain application using LLMs to catalog energy infrastructure regulations—making them neither competitors, complements, nor ecosystem siblings.
About opencompass
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Provides flexible evaluation pipelines through composable evaluators (including LLM-as-judge and mathematical reasoning assessments) and supports specialized benchmarks for long-context, reasoning, and scientific tasks. Features configurable model backends (HuggingFace, vLLM, LMDeploy) with answer post-processing via models like XFinder for more accurate capability assessment. Integrates with ModelScope for on-demand dataset loading and includes CompassHub and CompassRank for centralized benchmark results and model ranking.
About COMPASS
NatLabRockies/COMPASS
INFRA-COMPASS is a tool that leverages Large Language Models (LLMs) to create and maintain an inventory of state and local codes and ordinances applicable to energy infrastructure.
Scores updated daily from GitHub, PyPI, and npm data. How scores work