Fall 2025, Mondays 2 to 3:20 pm, NCS 220 and Zoom link to be announced soon.

The seminar will be jointly taught by Prof. Dimitris Samaras samaras@cs.stonybrook.edu.

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision.

To enroll in this course, you must either: (1) be in the Ph.D. program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Registered students must attend in person. Up to 3 absences will be excused. Everyone else is welcome to attend.
Abstract: Molecular learning has become an emerging field of AI, driving breakthroughs in drug discovery, protein design, and materials design. For high-stakes scientific tasks, however, predictive accuracy alone is not sufficient: models must also be interpretable and trustworthy. Our work aims to study molecular learning under a unified explainability perspective across two major model families: Graph Neural Networks (GNNs) and Large Language Models (LLMs).

GNNs are natural choices for molecular graphs and achieve strong performance on many molecular tasks. To enhance explainability, many GNN explanation methods have been proposed and work well for 2D GNNs. However, 3D GNNs introduce two key challenges: producing chemically meaningful substructures and reducing fidelity loss caused by dense geometric graphs. To address these challenges, I present two methods. 3DGraphX decomposes dense 3D graphs into chemically meaningful 3D motifs, enabling compact explanations that align with chemical intuition. EDMA introduces an energy-based discrete mask approximation approach to reduce the discrepancy between the soft mask optimized during training and the hard mask used for explanation, improving explanation fidelity.

LLMs present different characteristics and challenges compared with GNNs. LLMs can provide a certain level of explanation through step-by-step reasoning, and their natural-language outputs are easy for humans to understand and interpret. However, because LLMs are trained for general-purpose tasks, their performance on scientific tasks often lags behind specialized GNNs. To improve performance, existing methods guide LLMs by providing suggestions through brief feedback, retrieval-augmented generation (RAG), or planner agents. However, these approaches face several limitations, such as vague guidance, introduced bias problems, and high computational cost. To fill the gap, I propose RL-Guider, a lightweight reinforcement-learning agent that converts evaluation feedback into input-specific guidance for molecular optimization. RL-Guider improves over time by accumulating historical experience and transfers efficiently across different LLMs while preserving interpretability.

Together, these efforts aim to provide explanations that are scientifically meaningful and faithful, while also preserving or improving performance on molecular tasks to better meet real scientific needs.

Speaker: Xufeng Liu

Location: New Computer Science-1-Room 115

You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

We meet every other Tuesday at noon in CDSD's Training Room (building 725, room 2-124) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

In addition to our speaker, we will have a number of CDS staff in attendance with expertise in AI methods and applications including image analysis, foundation models development, and inverse problem solving.

AI-Driven Physics-Informed Phase Retrieval from a Single X-ray

Abstract: X-ray phase-contrast imaging enables the visualization of weakly absorbing or low-contrast structures and plays an important role in materials, biological, and energy research. Conventional X-ray holography and phase-retrieval techniques typically require multiple intensity measurements acquired at different propagation distances to recover phase information, increasing acquisition time, radiation dose, and experimental complexity. In this work, we present an AI-driven, physics-informed approach for phase retrieval using only a single X-ray intensity measurement. The method adapted a generative neural network as an inverse reconstruction engine, with physical models of X-ray wave propagation embedded directly into the optimization process. This allows phase and absorption information to be recovered from a single hologram without relying on paired, unpaired, or simulated training datasets. By combining physical constraints with self-supervised AI reconstruction, the approach achieves stable and quantitative results across a wide range of imaging conditions. The results demonstrate how physics-informed AI can reduce experimental requirements and enable data-efficient, automated phase retrieval for next-generation X-ray imaging workflows.

Biography: Xiaogang Yang is a computational scientist in the Data Analysis & Workflow Integration group at NSLS-II, focusing on AI development for X-ray imaging, data analysis, and automated workflows. He earned his PhD from Delft University of Technology, completed his postdoctoral research at Argonne National Laboratory, and previously held a tenured position at PETRA III (DESY).

Location: CDS, Bldg. 725, Training Room

Join ZoomGov Meeting: https://bnl.zoomgov.com/j/1604383624?pwd=ffQ5cUPNxTI7nzClKQO6cnsNbhF9Vf.1

Meeting ID: 160 438 3624
Passcode: 558449

Please Note: Due to a funding shortfall, we are for the time being no longer able to provide pizza and sodas for these events. We will have coffee though, and all are of course welcome to bring their lunch.

Abstract: Humans perceive the world around them by recognizing global patterns and structures such as object parts, branches, their spatial arrangement, and so on. Most deep learning models, however, take a fundamentally local approach. They process images pixel-by-pixel rather than focusing on structures as a whole. While these models indeed perform well on many tasks, the local (pixel-level) versus global (structure-level) disconnect makes them harder to interpret and control.

Topology, in a general sense, is a mathematical language for describing structure. It delineates how different parts of an image relate to one another, capturing both individual structures and their overall layout. Preserving topology enforces structural correctness and, by extension, semantic validity.

In this thesis, we investigate how topological constraints can be used to bridge the gap between local and global understanding. We use topology to inform the design of deep learning models that are explicitly structure-aware. Our thesis focuses on dense prediction tasks, which include image segmentation, uncertainty estimation, and generative modeling. First, we introduce a topological interaction module for semantic segmentation that encodes containment and exclusion constraints directly into the learning process. This preserves anatomical hierarchies and improves multi-class consistency. Next, since segmentation models can never be truly perfect, we address the need for reliable uncertainty estimation to identify error-prone regions. Unlike conventional pixel-wise uncertainty maps, which tend to be noisy and difficult to interpret, we propose reasoning at the level of structural units--branches and connections--which are more visually discernible and actionable. Finally, we leverage topology for generative modeling. We propose a topology-guided diffusion framework that can be controlled using structural attributes like object count and connectivity.

Together, these contributions establish a unified approach to topology-informed, structure-preserving dense prediction models. By integrating topological reasoning with deep networks, this thesis advances models that are not only accurate, but also structurally consistent, interpretable, and controllable. The results from this thesis have been published in ECCV, NeurIPS, and ICLR.

Speaker: Saumya Gupta

Location: New Computer Science (NCS) 120


Zoom: https://stonybrook.zoom.us/j/93643318604?pwd=kv8DagpbayzizivU29UCYItnlzlYRM.1&jst=2
Abstract: Theory-internal work on opacity in phonology has been focused on the challenges these interactions present for one theory (rules, constraints) versus another. But there has also been interest in studying the formal, invariant properties of opaque and other process interactions (Chandlee et al. 2018; Bakovic and Blumenfeld 2024), though these works crucially differ in their underlying assumptions. In this talk I will recontextualize Chandlee et al. (2018)'s result that opaque maps are ISL in light of Bakovic and Blumenfeld (2024)'s recent formal typology of process interactions, and this recontextualization will provide an answer to an open question about the k-value of an interaction map. I will then discuss the implications of this collective formal understanding of opacity for a recent model of lexicon and phonological grammar learning (i.e., Hua and Jardine 2021, Chandlee and Jardine to appear).


Speaker: Prof. Jane Chandlee, Associate Professor in the Department of Linguistics at Haverford College

Location: IACS Seminar room.
Spring 2026, Wednesdays 2 to 3:20 pm, NCS 220 and Zoom link to be announced soon.

The seminar will be jointly taught by Prof. Dimitris Samaras (samaras@cs.stonybrook.edu).

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision.

To enroll in this course, you must either: (1) be in the Ph.D. program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Registered students must attend in person. Up to 3 absences will be excused. Everyone else is welcome to attend.

Please note: Exceptionally, the first meeting on 1/28 will be in NCS 120.

The Provost's Office is excited to invite you to join in responding to an extraordinary opportunity to enhance our academic and research capabilities in AI at Stony Brook. SUNY recently made funding available to support the creation of departments of AI and Society at its universities. Stony Brook is well-positioned to seize this opportunity to build upon our interdisciplinary strengths in AI.

The office is hosting a forum on Friday, Nov. 15, from 11:30 a.m. to 1:30 p.m., in Ballroom A, SAC. You are invited to attend to learn more about this opportunity and to help us generate ideas to build a compelling proposal for Stony Brook to submit to SUNY. Lunch will be provided.

Please click here to RSVP as soon as possible.

This funding will support innovation in our curriculum, allowing us to create programs that explore the social and societal impact of AI alongside the technological advancements led by researchers in engineering and scientific disciplines.

We believe we can make a significant impact through this SUNY program and look forward to your participation in this initiative.