Skip to content

September 6, 2023

September 6, 2023

Present

Geoffrey Fox, Piotr Luszczek, Juri Papay, Wes Brewer, Gregor von Laszewski, Christine Kirkpatrick, Gregg Barrett, Tom Gibbs, Xiaoxuan Yang, Ying-Jung Chen, Manil Maskey

Apologies

Mallikarjun Shankar

Tentative Agenda

  • Any new members
  • Foundation Models
  • What are the initial Activities
  • White Papers
  • Other Benchmarks
  • AOB

New Members

  • Manil Maskey https://www.linkedin.com/in/manilmaskey/ is a Senior Research Scientist in NASA HQ Science Mission Directorate and is AI Lead and Project Manager. He is located at NASA’s Marshall Space Flight Center in Alabama. He is a colleague of Yuhan (Douglas) Rao, who joined earlier this summer. He has a Ph.D. from the University of Alabama in Huntsville.
  • Ying-Jung Chen https://www.linkedin.com/in/yj-elizabeth-chen/ is an Applied Scientist / Machine Learning Engineer at Descartes Labs https://descarteslabs.com/ focussing on geospatial intelligence. She obtained a Ph.D. from UC Santa Barbara in 2018

Foundation Models

  • Geoffrey updated his presentation on Foundation models AI4Science Foundation Models Overview August 20 2023 slides 20-27, including a new MLCommons slide from Piotr and Vijay Janapa Reddi.
  • He emphasized “way stations” if full-blown Foundation models are not possible initially.
  • He noted Foundation models are new infrastructure components for all organizations, including Science Research groups and communities.
  • It was agreed that ScienceGPT isn't ChatGPT
  • There was concern that we would be unfocused. Tom Gibbs noted that one needs to define classes of problems to limit the objective, determine what data would be needed, and whether a foundational model is needed
  • Gregg noted that in Africa, we are trying to build a foundation model to determine which are the priority drugs to manufacture for Africa. An effective model requires a huge amount of diverse data inputs. It's a priority in Washington, but aggregating the data and then having a model to provide a output that will shape policy for drug manufacturing is proving to be challenging.
  • and Consensus
  • Manil noted the IBM-NASA Foundation model NASA and IBM Openly Release Geospatial AI Foundation Model for NASA Earth Observation Data | Earthdata
  • They Are Building On Top Of This And Adding New Data Such As SAR Data
  • Gregor noted BLOOM from Huggingface
  • Arjun (in absentia) At OLCF, we are currently building/scaling up LLMs oriented at science downstream tasks: scientific questions in the context of scientific publications and abstract texts.
  • Extending the notion to broader categories of data will be key and also perhaps identifying canonical “modes of scientific discovery”. I’ve added a bullet to tFeatures of a Science Foundation Model: List and map to “discovery” modes in science.
  • Wes Brewer discussed Foundation models of turbulence
  • One more paper POF23_055129_1_5.0146456.pdf that I mentioned on the topic of developing foundation models for turbulence. We are working towards developing a generalized wall-bounded turbulence model by training on various geometric configurations -- see Table I. Here, we are training on nine different DNS and LES simulations with different periodic hill geometries and conditions.
  • PU0174_VFS_VCGI_Paper_A.pdf Figure 4 illustrates the mixture of experts method to address heterogeneity which is a general feature of Foundation models

Papers