Skip to content

About MLCommons® Science

The MLCommons Science Working Group is dedicated to advancing the frontier of scientific machine learning. Our primary goal is to establish standardized benchmarks that enable the community to evaluate, compare, and improve AI models applied to complex scientific problems.

By creating a transparent and reproducible framework for benchmarking, we aim to accelerate scientific discovery and ensure that AI tools are developed with the rigor required for scientific research.

Official Resources

Stay connected with our official governance and guidelines:

Benchmark Catalog & Tools

We provide centralized resources to help researchers discover and utilize scientific benchmarks:

  • Benchmark Catalog: A filterable and searchable index of scientific benchmarks. Explore the Catalog
  • SABATH: The Scientific AI Benchmark and Tool Hub for accessing shared benchmarking tools. Visit SABATH on GitHub

Current Benchmarks

The following benchmarks are currently available, including descriptions and source code:

Community & Contribution

We welcome contributions from the global scientific and AI communities.