chembench vs macbench — 50 vs 36 Quality Score

chembench

50

Established

macbench

36

Emerging

Maintenance 10/25

Adoption 10/25

Maturity 16/25

Community 14/25

Maintenance 10/25

Adoption 6/25

Maturity 16/25

Community 4/25

Stars: 134

Forks: 16

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

Stars: 23

Forks: 1

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

No Package No Dependents

About chembench

lamalab-org/chembench

How good are LLMs at chemistry?

ChemBench helps chemists and materials scientists evaluate how well large language models (LLMs) and multimodal models perform on chemistry-related tasks. You provide a language model (or a vision-language model) and it outputs detailed reports on the model's accuracy across various chemistry topics. This is for researchers and developers working with AI in chemistry who need to assess model capabilities.

computational chemistry materials science AI model evaluation chemical informatics drug discovery

About macbench

lamalab-org/macbench

Probing the limitations of multimodal language models for chemistry and materials research

This tool helps chemistry and materials science researchers evaluate how well advanced AI models (multimodal language models) understand and respond to questions using both text and images in your field. You input a multimodal language model and a set of chemistry/materials research tasks, and it provides a report on the model's performance across various stages of scientific work. This is designed for scientists, engineers, and researchers who want to assess or compare AI models for scientific discovery workflows.

chemistry-research materials-science AI-model-evaluation scientific-discovery computational-chemistry

chembench and macbench

About chembench

About macbench

Related comparisons