February 22, 2023

Present

Gregg Barrett, Geoffrey Fox, Juri Papay, Christine Kirkpatrick, Wesley Brewer,

Apologies

Jeyan Thiyagalingam, Tony Hey, Gregor von Laszewski
Nhan V Tran, Benjamin Hawks, Christian Herwig, Piotr Luszczek, Mallikarjun Shankar, (This was a mistake of Geoffrey who logged in with the wrong email so he did not get messages that people were in waiting room)

Tentative Agenda

Any new members
AI Readiness of MLCommons Science (Continued) https://docs.google.com/document/d/1NbL-VdkrY9jzPxveOys2RCK8TdEJ7O5wgnxjAgzK-rE/edit?usp=sharing
Using Benchmarking Data to Inform Decisions Related to Machine Learning Resource Efficiency (Continued) https://docs.google.com/document/d/1gOKA8BnlJnsTAELWFSmL7Fl7kJej_UrNH-FVXbZFxGI/edit?usp=sharing
Discussion of new Benchmarks (Continued)
AOB

General

Most of the meeting was devoted to further interactive discussion of the papers
We suggested sending both papers to HPC working group for comments and/or collaboration

Discussion of “AI Readiness of MLCommons Science”

In the first paper, Christine asked for comments on table 1
We asked for volunteers to complete sections identified as gaps
Use email; Gregg and JUri indicated willingness to volunteer
We discussed ontologies for computer systems
Geoffrey noted MLIR (Multi-level Intermediate Representation aimed at software)
And NML aimed at computer networking
We noted Cerebras with a mesh connection
Wes mentioned regarding this paper that he had performed some energy benchmarks of inference in our "Inference Benchmarking on HPC Systems" paper and sent the link in the chat (https://ieeexplore.ieee.org/abstract/document/9286138).
Also, he mentioned if it would be of interest to have some power utilization metrics from Frontier, he may be able to get some of those numbers -- actually he is in the process of getting some of that data, but he is not sure how soon he could have the numbers.

This paper is further along
Gregg has a long comment on abstract
Geoffrey needs to add comments on efficiency from MLCommons data
Juri will add section on power use per chip and role of software and power usage.
Should Commercial clouds be discussed
Need to add OpenFold in HPC benchmarks
Need to discuss Dataperf MLCommons working group

Discussion with David Kanter

The MLCommons board still find our use of Science Discovery and Performance metrics confusing. Geoffrey discussed this with David Kanter on March 3, 2023. One idea that emerged was to only have an open division in Science which clearly differentiates us from mainstream benchmarks which mainly have a closed division. I stressed that a significant role of science benchmarks was in education, where often students find it interesting just to measure performance., I think David will arrange a further meeting.

February 22, 2023

February 22, 2023

Present

Apologies

Tentative Agenda

General

Discussion of “AI Readiness of MLCommons Science”

Discussion of “Using Benchmarking Data to Inform Decisions Related to Machine Learning Resource Efficiency”

Discussion with David Kanter