Abstract: Sub-grid turbulence is challenging to resolve in climate models; therefore, it is parameterized. Traditionally, turbulent parameterizations have relied on physics-based and equation-based approaches. However, ad hoc and uncertain components in these parameterizations introduce uncertainty in future climate predictions. Recently, data-driven techniques have emerged as an alternative for modeling sub-grid fluxes. I will demonstrate the use of machine learning to model vertical turbulent fluxes in the ocean surface boundary layer and its impact on reducing biases in NOAA's Geophysical Fluid Dynamics Laboratory ocean climate model.

I will show how neural networks, trained to predict the eddy diffusivity profile from high-fidelity yet computationally expensive turbulence schemes, enhance the vertical mixing scheme in the climate model. These networks replace ad hoc components while maintaining the conservation principles of the standard ocean model equations. The enhanced scheme outperforms its predecessor by reducing biases in the mixed-layer depth and modestly improving tropical upper-ocean stratification in ocean-only global simulations. Furthermore, simplified equations that can replace the neural networks show similar improvements but with lower computational cost and better interpretability. They point to structural deficiencies in the baseline parameterization. This work is one of the first successful applications of machine learning to improve a sub-grid parameterization of turbulent mixing in ocean climate models.

IACS Seminar Speaker: Aakash Sane, Princeton University

Location: IACS Seminar Room or Zoom

Join Zoom Meeting: https://stonybrook.zoom.us/j/97764942108?pwd=MzCWupCe3L9mKdrgfO2bJg3GBbvXuf.1
Meeting ID: 977 6494 2108
Passcode: 519324
18th Annual Engineering Ball Flowerfield, St. James, NY Thursday April, 2nd, 7:00 to 10:00 pm Pick up your tickets in 231 Engineering (Monday - Friday, 10:00 am to 4 pm) Presenting Partner: L3Harris
Abstract: Humans perceive the world around them by recognizing global patterns and structures such as object parts, branches, their spatial arrangement, and so on. Most deep learning models, however, take a fundamentally local approach. They process images pixel-by-pixel rather than focusing on structures as a whole. While these models indeed perform well on many tasks, the local (pixel-level) versus global (structure-level) disconnect makes them harder to interpret and control.

Topology, in a general sense, is a mathematical language for describing structure. It delineates how different parts of an image relate to one another, capturing both individual structures and their overall layout. Preserving topology enforces structural correctness and, by extension, semantic validity.

In this thesis, we investigate how topological constraints can be used to bridge the gap between local and global understanding. We use topology to inform the design of deep learning models that are explicitly structure-aware. Our thesis focuses on dense prediction tasks, which include image segmentation, uncertainty estimation, and generative modeling. First, we introduce a topological interaction module for semantic segmentation that encodes containment and exclusion constraints directly into the learning process. This preserves anatomical hierarchies and improves multi-class consistency. Next, since segmentation models can never be truly perfect, we address the need for reliable uncertainty estimation to identify error-prone regions. Unlike conventional pixel-wise uncertainty maps, which tend to be noisy and difficult to interpret, we propose reasoning at the level of structural units--branches and connections--which are more visually discernible and actionable. Finally, we leverage topology for generative modeling. We propose a topology-guided diffusion framework that can be controlled using structural attributes like object count and connectivity.

Together, these contributions establish a unified approach to topology-informed, structure-preserving dense prediction models. By integrating topological reasoning with deep networks, this thesis advances models that are not only accurate, but also structurally consistent, interpretable, and controllable. The results from this thesis have been published in ECCV, NeurIPS, and ICLR.

Speaker: Saumya Gupta

Location: New Computer Science (NCS) 120


Zoom: https://stonybrook.zoom.us/j/93643318604?pwd=kv8DagpbayzizivU29UCYItnlzlYRM.1&jst=2
Zoom Link: https://github.com/giorgianb/spdhackspring2021/blob/main/bit.ly/spdhack2021

ΣΦΔ Hack Spring 2021 is ΣΦΔ's first annual machine learning hackathon. ΣΦΔ Hack Spring 2021 aims to introduce Stony Brook students to the rich and challenging field of machine learning, and develop the skills necessary to build sophisticated machine learning models on their own.
 
More info here: https://github.com/giorgianb/spdhackspring2021/blob/main/README.md
Abstract: Graphs are a universal language of science. Molecules, materials, quantum systems, and knowledge bases can all be naturally represented as graphs. This talk explores how graph-based artificial intelligence is emerging as a powerful engine for scientific discovery. Using molecular design as a guiding example, we examine how modern graph AI enables machines not only to analyze complex scientific structures but also to generate new ones. We will discuss graph neural networks for learning predictive models of molecular properties, graph generative models for constructing novel chemical structures, and emerging multimodal graph-language models that support inverse design and synthesis planning. Together, these advances make graph AI more scalable, interpretable, and data-efficient--key capabilities for real-world scientific discovery. As artificial intelligence enters the era of foundation models, the next frontier lies in multimodal reasoning. Scientific knowledge is not purely textual; it is expressed through structures, code, and experimental data. By integrating graph representations with large language models, we move toward AI systems that can reason across multiple modalities and engage with scientific knowledge in its native forms. Looking ahead, we envision AI systems that behave less like tools and more like collaborators in the scientific process--generating hypotheses, designing candidate structures, planning experiments, interpreting results, and iteratively refining ideas through cycles of success and failure. In this vision, multimodal and agentic AI will enable scientists to explore vast and previously inaccessible design spaces, accelerating breakthroughs across domains ranging from drug discovery and materials innovation to software systems and quantum technologies.

Bio: Jie Chen is an interdisciplinary researcher working at the intersection of computing and mathematics, with a current focus on foundation models and AI agents for scientific discovery. His research integrates machine learning, statistics, scientific computing, and numerical linear algebra, with contributions spanning graph neural networks, multimodal graph LLMs, graph structure learning, scalable Gaussian processes, graph coarsening, and matrix functions. He is widely recognized for transformative contributions to graph-based deep learning and large-scale statistical modeling, and for bridging theory with real-world scientific and engineering applications. Dr. Chen has led externally funded, multi-institutional research programs supported by Shell, Evonik, and the U.S. Department of Energy, with applications in materials discovery, financial forensics, and power system resilience. He previously served as a Senior Research Scientist and Manager at IBM Research and the MIT-IBM Watson AI Lab, and as a Postdoctoral Fellow at Argonne National Laboratory. He has published extensively in top-tier AI, statistics, and applied mathematics venues, and his work has been recognized by multiple IBM Outstanding Technical Achievement Awards and the SIAM Student Paper Prize. He earned his Ph.D. in Computer Science from the University of Minnesota and his B.S. in Mathematics with honors from Zhejiang University.

Location: NCS 120

As AI drives rapid change across professional fields, how do you bring these developments into your classroom? The CELT AI Panel Discussion will gather academic thought leaders to explore how generative AI is reshaping teaching, learning, and the knowledge students need for today's world. Our panelists will share practical strategies for integrating AI-related advancements into course content, highlight both opportunities and challenges, and discuss how educators can help students build critical thinking, ethical awareness, and hands-on experience with emerging AI technologies. Join us to examine how teaching can evolve alongside an AI-transformed society.

Register here.

Discover how U.S. Census Bureau Tools can help you find free data for your research projects, community, and more. See how to access the latest American Community Survey and 2020 Census data for various geographies including New York City and Long Island at data.census.gov. Learn about Community Resilience Estimates and how to navigate My Community Explorer; an interactive map-based tool which highlights demographic and socioeconomic data that measure inequality. This session will involve live demonstrations and hands-on exercises for participants. Registrants will receive the Zoom link one day prior to the event.

Please Register for SBU Libraries' AI Club: Exploring Census Data here.