Abstract: Many scientific and engineering challenges, such as the design of materials or molecules or the control of experimental systems, rely on the existence of fast predictive models that can evaluate potential designs or control policies. Traditionally this has been accomplished through numerical simulation; more recently data-driven machine learning methods have been applied. However, both approaches leave gaps: physical modeling can be accurate and extrapolates well to previously-unstudied conditions, but it is often computationally expensive and relies on physics approximations that may not be valid. Machine learning can generalize from massive amounts of real-world or simulation data, but suffers from physical grounding and extrapolation into new regimes, as well as in settings where large data sets do not exist.
In this talk I explore an intermediate regime, which is hybrid reduced order models: fast simplified physics approximations where some of the unknown or approximated equations are replaced with data-driven machine learning components. Examples include coarse-grained models where the full macroscopic equations cannot be derived from first-principles microscopic equations, multiscale models with unknown closure terms or sub-grid parameterization schemes, and low-order or latent dynamical systems that learn governing equations on a low-dimensional reduced state space. I discuss how such reduced systems can be identified from very limited data, much less than is often needed in traditional machine learning but at much lower time-to-solution than traditional numerical modeling. This facilitates not only system design and control but also uncertainty quantification approaches that search the space of possible equations for predictive models that can explain the data. I will focus on an example from materials science concerning the design of self-assembling block copolymer nanomaterials.

Speaker: Dr. Nathan Urban, Applied Mathematics Department, Brookhaven National Laboratory

Location: Laufer 101

Zoom: https://stonybrook.zoom.us/j/96090260834?pwd=mw8QTHbMOw9oeU9hazZeoq8bN4VIfH.1
Meeting ID: 960 9026 0834 Passcode: 374969




Time: 04/28 Wed 3pm-4pm

Remote Access
Join Zoom Meeting https://stonybrook.zoom.us/j/95617197636?pwd=KytzZ2pVRG9SZGpKZUtpNXJISjNjZz09 
Meeting ID: 956 1719 7636 Passcode: 924293

Title: Brain imaging genetics for Alzheimer's disease: integrated analysis and machine learning

Li Shen, Ph.D.
Professor of Informatics
Department of Biostatistics, Epidemiology and Informatics 
Perelman School of Medicine
University of Pennsylvania

Bio: Li Shen, Ph.D., is a Professor of Informatics in the Department of Biostatistics, Epidemiology and Informatics at the Perelman School of Medicine in the University of Pennsylvania. He is an elected fellow of the American Institute for Medical and Biological Engineering (AIMBE). He obtained his Ph.D. degree in Computer Science from Dartmouth College. The central theme of his lab is focused on developing computational and informatics methods for integrative analysis of multimodal imaging data, high throughput omics data, cognitive and other biomarker data, electronic health record (EHR) data, and rich biological knowledge such as pathways and networks, with applications to complex disorders. His research interests include medical image computing, biomedical informatics, machine learning, network science, imaging genomics, Alzheimer's disease, and big data science in biomedicine. He has authored over 280 peer-reviewed articles (h-index 57) in these fields. Dr. Shen's work has been continuously supported by the NIH and NSF, and he is presently the PI of multiple NIH and NSF grants on developing computational methods for various biomedical applications including brain imaging genomics, genetics of Alzheimer's disease, genetics of human connectome, mining drug effects from the EHR data, and big data mining in brain science. He is co-leading the NIA Alzheimer's Disease Sequencing Project AI4AD Consortium and oversees the imaging genomics aspect of this landmark project. Dr. Shen served as the Executive Director of the Medical Image Computing and Computer Assisted Intervention (MICCAI) Society Board of Directors during 2016-2019. He has chaired and co-chaired various professional meetings in medical image computing and bioinformatics. He is an Associate Editor of BioData Mining and Frontiers in Radiology (Section of AI in Radiology), and serves on the Editorial Board of Medical Image Analysis and Brain Imaging and Behavior.

Abstract: Brain imaging genetics is an emerging data science field, where integrated analysis of brain imaging and genetics data, often combined with other biomarker, clinical and environmental data, is performed to gain new insights into the genetic, molecular and phenotypic characteristics of the brain as well as their impact on normal and disordered brain function and behavior. Many methodological advances in brain imaging genetics are attributed to large-scale landmark biobank projects such as the Alzheimer's Disease Sequencing Project, the Alzheimer's Disease Neuroimaging Initiative, and the UK Biobank. Using the study of Alzheimer's disease as an example, we will discuss fundamental concepts, state-of-the-art statistical and machine learning methods, and innovative applications in this rapidly evolving field. We show that the wide availability of brain imaging genetics data from various large-scale biobanks, coupled with advances in biomedical statistics, informatics and computing, provides enormous opportunities to contribute significantly to biomedical discoveries in brain science and to impact the development of new diagnostic, therapeutic and preventative approaches for complex brain disorders such as Alzheimer's disease.

More details:
https://bmi.stonybrookmedicine.edu/sites/default/files/shen_li_04_28_flyer.pdf

You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes three short talks on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

Tuesday, January 7, 2025, 12:00 pm -- CDS, Bldg. 725, Training Room

Speakers

Maria Zawadowicz, EBNN--ML for Atmospheric Aerosol Research

Mohammad Atif, CDS--An Extensible Digital Twin Framework

Guang Zhao, CDS--Pareto Prompt Optimization

Join ZoomGov Meeting: https://bnl.zoomgov.com/j/1615289117?pwd=Hqkbj9itxWrFnkhZ8rQXHPInO2gxdF.1

Meeting ID: 161 528 9117
Passcode: 991382

Are you tired of drowning in a sea of resumes and losing top talent in the hiring whirlwind? Transform your hiring process through a different lens and learn about AI in the Workplace and the Applicant Tracking System (ATS). Whether you're a recent graduate seeking your first job or an undergraduate student looking to delve into more career-oriented opportunities, this workshop by SBU Career Center is designed to equip you with the knowledge and strategies needed to succeed.

Register here: https://stonybrook.joinhandshake.com/stu/events/1568133?

University Libraries Presents: The Library AI Club is a welcoming space for students, faculty, and staff to explore AI in a supportive, low-pressure environment. Meeting every two weeks, the club features discussions, collaborative projects, guest speakers, and hands-on experiments. Join us to learn, share ideas, and engage with AI responsibly and creatively. We'd love to see you at an upcoming meeting! Location: Melville Library, Scholarly Communication Seminar Room
Discover how U.S. Census Bureau Tools can help you find free data for your research projects, community, and more. See how to access the latest American Community Survey and 2020 Census data for various geographies including New York City and Long Island at data.census.gov. Learn about Community Resilience Estimates and how to navigate My Community Explorer; an interactive map-based tool which highlights demographic and socioeconomic data that measure inequality. This session will involve live demonstrations and hands-on exercises for participants. Registrants will receive the Zoom link one day prior to the event.

Please Register for SBU Libraries' AI Club: Exploring Census Data here.
Title: Building foundation models for scientific data Seminar

Speaker: Ruben Ohana, Ph.D. and Michael McCabe, Ph.D - Flatiron Institute, New York

Abstract: Foundation models are very large architectures trained on large-scale datasets and can be used to transfer knowledge from a domain to another. Scientific data, particularly numerical simulations of partial differential equations (PDEs), presents unique challenges due to its complexity and the need for domain expertise to assess prediction quality, complicating the building of the first foundation models in this field. In this talk, we will develop our approach of building foundation models for scientific data, highlighting the requirements and expectations for achieving meaningful results. We will also introduce The Well, a comprehensive collection of datasets encompassing multi-scale simulations of fluid dynamics, astrophysics, and biological systems. The Well serves as a foundation for developing models that generalize across diverse physical phenomena, aiming to accelerate scientific discovery through large-scale learning.

Join Zoom Meeting: https://bnl.zoomgov.com/j/1606898802?pwd=GbbPiLGHlEokDskxjeFheMFWfuboxO.1
Meeting ID: 160 689 8802
Passcode: 281575
The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
Abstract: Machine learning (ML) systems fueled by neural networks have entered our daily lives and led to scientific breakthroughs, but many open questions remain. After a nod toward the question of rigor with ML and recent progress, I'll turn to the theory of neural networks. I will argue that understanding neural networks inevitably leads to ideas from field theory (FT), which was already realized in the simplest case in the 1990s, and I will review some essential FT-for-NN results. I will then propose that the connection might be more general, an NN-FT correspondence of sorts, with neural networks providing a way to define a field theory. I'll end with comments on known results including the origin of interactions and various symmetries, but I will also list some open questions. The apparent non-sequitur in the title will be used as a rhetorical device to explore where we are and where we'd like to go.

https://scgp.stonybrook.edu/calendar/full-calendar
Abstract: As we enter the AI era, domain scientists face a critical question: What can we do to harness AI effectively for scientific discovery? AI has demonstrated remarkable capabilities, from accelerating simulations to uncovering hidden patterns in complex datasets. While these advancements offer unprecedented opportunities, they also raise concerns--AI models often function as black boxes, making it difficult to connect their outputs to established scientific principles. This lack of interpretability can undermine trust and limit adoption, particularly in fields like meteorology where physical understanding is critical.
In this talk, I will explore how interpretable AI can bridge this gap, highlighting its potential to generate explicit, physically meaningful equations rather than opaque neural networks. Through four case studies from my lab, I will showcase how interpretable AI can enhance scientific understanding:
  1. Satellite Precipitation Retrieval: Using AI-based approaches to interpret precipitation retrieval algorithms from AMSU data, we identified critical microwave channels (89 and 150 GHz) that directly link to physical processes in the atmosphere.
  2. Quantitative Precipitation Estimation (QPE): By applying symbolic regression models to polarimetric radar data, we derived mathematical expressions that outperform traditional Z-R relationships and existing QPE algorithms, offering new insights into rainfall microphysics.
  3. Tornado Probability Prediction: Leveraging reinforcement learning-based symbolic deep learning models, we developed interpretable equations that outperform the traditional Significant Tornado Parameter (STP) index, providing a clearer understanding of the relationships between key atmospheric variables and tornado risk.
  4. Domain-Aware Symbolic Regression for Scientific Equations: In our latest work, we introduced a symbolic regression framework that incorporates domain-specific symbol priors extracted from thousands of scientific publications. By encoding common mathematical structures--such as the prevalence of trigonometric functions in physics or logarithmic forms in biology--into a tree-structured reinforcement learning model, we improved both the accuracy and interpretability of discovered equations. This approach accelerates convergence, enforces physical plausibility, and reveals new governing relationships in climate and geophysical data.
Through these examples, I hope to spark discussion on the evolving role of domain scientists in the AI era and inspire new ways to integrate AI with physical understanding in atmospheric research.

IACS Seminar Speaker: Yixin Wen, University of Florida

Location: IACS Seminar Room or Zoom

Join Zoom Meeting: https://stonybrook.zoom.us/j/97596399106?pwd=0PBvElFLqov3biO6OlQxSWLWudkIuH.1
Meeting ID: 975 9639 9106
Passcode: 096213