Please join us on Zoom for our next event in the Fall 2025 Stony Brook School of Nursing Research Seminar Series presented by our Office of Research and Innovation.

Topic: Responsible Artificial Intelligence: Promoting Health Equity for All

Speaker: Michael P. Cary, Jr., PhD, RN, FAAN.

Dr. Cary is a tenured Associate Professor at the Duke University School of Nursing. Dually trained as a health services researcher and applied health data scientist, Dr. Cary utilizes AI to investigate health disparities in aging populations, thereby promoting health equity and improving healthcare delivery. He co-directs HUMAINE™, an initiative dedicated to equipping nurses and healthcare professionals with the knowledge and skills necessary for the responsible use of AI in clinical practice.

Register: https://web.cvent.com/event/057978a5-a770-4de5-aca5-ad00287e4902/summary

The University's Main Commencement Ceremony will take place on Friday, May 23, 2025 at 11 am at Kenneth P. LaValle Stadium. Gates open at 10 am.

All guests need a valid ticket to enter LaValle Stadium - no exceptions. Children age 1 and older require a ticket. Seating is first-come, first-served.

Register here.

Recently, large-scale language data combined with modern machine learning techniques have shown strong value as means for studying human psychology and behavior. For example, language alone has been shown predictive in mental health, personality, and health behaviors. However, many applications for such language-based assessments have readily available and important data beyond language (i.e. extra-linguistics), such as predicting the subjective well-being of a community using tweets, where one can take into account their age, education, and demographic attributes. Language may capture some characteristics while extra-linguistic variables captures others. We believe that effectively integrating linguistic and extra-linguistic data can yield benefits beyond either independently. In this thesis, we develop methods which effectively integrate extra-linguistic data with language data focused primarily on social scientific applications. The central challenge is dealing with the size and heterogeneity of, often sparse and noisy, language data versus the, often low-dimensional and non-sparse, extra-linguistic variables. First, we consider structured extra-linguistics, like socioeconomic (income and education rates) and demographics (age, gender, etc.), and propose two integration methods, named residualized controls (RC) and residualized factor adaptation (RFA), to be used in county-wise prediction tasks. Demonstrating techniques that integrate information at both the model-level and data-level, we found consistently strong improvement over naively combining features, for example, increasing county level well-being predictions by over 12%. Next, we consider unstructured extra-linguistic data. In the first part, we incorporate social network connections and language over time to propose a novel metric for quantifying the stickiness of words - their ability to spread across friendship connections in a social network over time (or in other words, stick in ones vocabulary after seeing friends use it). We obtain which language features are more probable to disseminate through friendship and show such a metric is useful for predicting who will be friends and what content will spread. In addition, we analyze language content over time by proposing a novel dynamic content-specific topic modeling technique that can help to identify different sub-domains of a thematic scope and can be used to track societal shifts in concerns or views over time.
A talk by Jerome Zhengrong Liang entitled, Machine Learning from Original Images to Texture Patterns: A Paradigm Shift from Non-Medical Application to Medical Diagnosis. Abstract: Artificial intelligence (AI) research for medical diagnosis started soon after human began to use computer, initially called artificial neural network (ANN) and now convolutional neural network (CNN). ANN has been mainly explored to classify the experts' handcrafted features from the original (or raw) images, while CNN has been mainly explored directly on the raw images for both tasks of extracting abstract features and classifying the features. Experimental evidences have been shown that CNN can be trained by a large number of the raw images with experts' scores (or labels) to match or even surpass the experts' performance for both non-medical and medical diagnosis applications. However, the performances of the CNN models as well as the experts on medical diagnosis dropped dramatically when the labels of the raw images were replaced by the corresponding medical pathological reports. Accumulated medical knowledge reveals that the lesion heterogeneity is a footprint of lesion evolution and ecology, and the heterogeneity is an indicator of lesion progress and response to medical intervention. The heterogeneity can be reflected by the image contrast distribution (or texture patterns) across the lesion volume. Image textures have been shown as an effective descriptor of the lesion heterogeneity for computer-aided diagnosis. Can we map the raw images into texture patterns (or images) and train CNN to learn from the texture images? This question is the central theme of this presentation with application to CT Colonography or virtual colonoscopy, a game from AlphaGo to PolypGo. Bio: Jerome Zhengrong Liang, PhD, IEEE Fellow Imaging Research and Informatics Laboratory Department of Radiology, Stony Brook University

You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

At our Oct 7 Mixer, BNL's newly minted interim director, John Hill will be present to give opening remarks and kick us off on a new year of impactful scientific AI collaborations.

Abstract: Weather extremes and strong seasonal-to- subseasonal variability pose growing challenges to urban populations, infrastructure, and energy systems. Yet, most cities remain data deserts: routine weather observations are sparse, with stations concentrated at airports rather than within the urban core. This lack of coverage limits our ability to monitor and predict fine-scale urban weather patterns precisely where they matter most. We present a new AI-driven framework for optimal sensor placement and urban weather monitoring. Unlike traditional approaches, our method leverages physics- based simulations together with Bayesian experimental design principles, but does so using a computationally efficient variational inference strategy that makes large-scale optimization tractable. This allows us to guide sensor networks in a way that minimizes information loss while capturing spatiotemporal variability at city scales. Applied to Phoenix, Arizona, our framework outperforms random sensor placement strategies, especially when only a limited number of sensors can be deployed. Importantly, the same AI models that guide sensor placement also function as a real-time nowcasting tool, providing urban weather information over the entire domain, beyond sensor locations. Together, these capabilities offer a scalable pathway to reduce urban data deserts, enhance monitoring of weather extremes, and improve resilience planning for energy, transportation, and public health systems.

Biography: Dr. Katia Lamer is an atmospheric scientist and the Director of the Center for Multiscale Applied Sensing at Brookhaven National Laboratory. Originally from Canada, she earned her B.S. and M.S. in Atmospheric and Oceanic Sciences from McGill University and a Ph.D. in Meteorology from Penn State University. Her research focuses on atmospheric boundary layer processes and remote sensing technologies, with a strong emphasis on data science. At Brookhaven, she is known for her work with the CMAS mobile observatories and its facility that connect fundamental atmospheric science to real-world applications, improving weather prediction, environmental monitoring, and urban climate resilience. Her work has been featured in public outlets such as New Scientist and Wired. Dr. Lamer also serves as an invited member of the World Meteorological Organization's Data Assimilation and Observing Systems Working Group, and the American Meteorological Society's Boundary Layer and Turbulence Committee. puting, communications and sensing, all enabled by AI.

Location: CDS, Bldg. 725, Training Room

Join ZoomGov Meeting: https://bnl.zoomgov.com/j/1604383624?pwd=ffQ5cUPNxTI7nzClKQO6cnsNbhF9Vf.1

Meeting ID: 160 438 3624 | Passcode: 558449

Join Stony Brook University's Center for Excellence in Learning and Teaching (CELT) for a bootcamp on how to use AI to enhance your teaching and courses. This event will demonstrate how ChatGPT, Microsoft Copilot, and other generative AI platforms can support you in crafting learning objectives, writing exam questions, composing rubrics, and designing course content such as lesson plans, in-class activities, instructional videos, and more.

Register here.
Abstract: Humans perceive the world around them by recognizing global patterns and structures such as object parts, branches, their spatial arrangement, and so on. Most deep learning models, however, take a fundamentally local approach. They process images pixel-by-pixel rather than focusing on structures as a whole. While these models indeed perform well on many tasks, the local (pixel-level) versus global (structure-level) disconnect makes them harder to interpret and control.

Topology, in a general sense, is a mathematical language for describing structure. It delineates how different parts of an image relate to one another, capturing both individual structures and their overall layout. Preserving topology enforces structural correctness and, by extension, semantic validity.

In this thesis, we investigate how topological constraints can be used to bridge the gap between local and global understanding. We use topology to inform the design of deep learning models that are explicitly structure-aware. Our thesis focuses on dense prediction tasks, which include image segmentation, uncertainty estimation, and generative modeling. First, we introduce a topological interaction module for semantic segmentation that encodes containment and exclusion constraints directly into the learning process. This preserves anatomical hierarchies and improves multi-class consistency. Next, since segmentation models can never be truly perfect, we address the need for reliable uncertainty estimation to identify error-prone regions. Unlike conventional pixel-wise uncertainty maps, which tend to be noisy and difficult to interpret, we propose reasoning at the level of structural units--branches and connections--which are more visually discernible and actionable. Finally, we leverage topology for generative modeling. We propose a topology-guided diffusion framework that can be controlled using structural attributes like object count and connectivity.

Together, these contributions establish a unified approach to topology-informed, structure-preserving dense prediction models. By integrating topological reasoning with deep networks, this thesis advances models that are not only accurate, but also structurally consistent, interpretable, and controllable. The results from this thesis have been published in ECCV, NeurIPS, and ICLR.

Speaker: Saumya Gupta

Location: New Computer Science (NCS) 120


Zoom: https://stonybrook.zoom.us/j/93643318604?pwd=kv8DagpbayzizivU29UCYItnlzlYRM.1&jst=2