chembench and macbench

chembench
50
Established
macbench
36
Emerging
Maintenance 10/25
Adoption 10/25
Maturity 16/25
Community 14/25
Maintenance 10/25
Adoption 6/25
Maturity 16/25
Community 4/25
Stars: 134
Forks: 16
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 23
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About chembench

lamalab-org/chembench

How good are LLMs at chemistry?

ChemBench helps chemists and materials scientists evaluate how well large language models (LLMs) and multimodal models perform on chemistry-related tasks. You provide a language model (or a vision-language model) and it outputs detailed reports on the model's accuracy across various chemistry topics. This is for researchers and developers working with AI in chemistry who need to assess model capabilities.

computational chemistry materials science AI model evaluation chemical informatics drug discovery

About macbench

lamalab-org/macbench

Probing the limitations of multimodal language models for chemistry and materials research

This tool helps chemistry and materials science researchers evaluate how well advanced AI models (multimodal language models) understand and respond to questions using both text and images in your field. You input a multimodal language model and a set of chemistry/materials research tasks, and it provides a report on the model's performance across various stages of scientific work. This is designed for scientists, engineers, and researchers who want to assess or compare AI models for scientific discovery workflows.

chemistry-research materials-science AI-model-evaluation scientific-discovery computational-chemistry

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work