Learn how to prompt AI to help clean datasets and write formulas in Google Sheets.

When you have a messy dataset, it can take a lot of time to clean it up before you can start analyzing. Can AI help? In this workshop, we'll collect live data and then use Gemini AI (the stand alone tool) to help clean up the data. Then, we'll use it to help do some analysis. Because we'll be working with live data live in Gemini, we don't know exactly what will happen, but that's the reality of data and data cleaning!

In this session, you will

  1. Craft effective AI prompts to generate Google Sheets formulas for data analysis and manipulation
  2. Utilize Gemini to develop regular expression formulas to extract, reformat, clean text-based data
  3. Develop formulas for numerical analysis using Gemini AI

https://stonybrookuniversity.co1.qualtrics.com/jfe/form/SV_dht1o3rNzlZhHka?source=event+manager&session=0815250900sheets

The University's Main Commencement Ceremony will take place on Friday, May 23, 2025 at 11 am at Kenneth P. LaValle Stadium. Gates open at 10 am.

All guests need a valid ticket to enter LaValle Stadium - no exceptions. Children age 1 and older require a ticket. Seating is first-come, first-served.

Register here.

Launching a University-Wide AI Innovation Institute:

Last spring, the Office of the Provost led a group of over 30 faculty, staff, and administrators to consider how we can expand and leverage our strengths in AI research and discovery. The resulting recommendation was to launch a university-wide AI Innovation Institute (AI3), which would expand the Institute for AI-driven Discovery and Innovation established in 2018 from a department-level institute within the College of Engineering and Applied Science (CEAS) to the university-wide AI Innovation Institute reporting to the provost.

As a university-wide enterprise, the AI Innovation Institute (AI3) is intended to accelerate, coordinate, and organize AI innovation and education across Stony Brook. The institute will serve to empower the entire university community and beyond, catalyzing core AI research, curriculum innovation, and societal change in the ever-evolving landscape of knowledge work.

The AI Town Hall, led by AI3 Interim Director Skiena, is an open house event that will provide an overview of the major AI initiatives on campus, including the new AI Seed Grant program and Stony Brook's role in New York State's Empire AI program. The session will include time for questions and discussion about the future of AI at Stony Brook.

Abstract: Humans perceive the world around them by recognizing global patterns and structures such as object parts, branches, their spatial arrangement, and so on. Most deep learning models, however, take a fundamentally local approach. They process images pixel-by-pixel rather than focusing on structures as a whole. While these models indeed perform well on many tasks, the local (pixel-level) versus global (structure-level) disconnect makes them harder to interpret and control.

Topology, in a general sense, is a mathematical language for describing structure. It delineates how different parts of an image relate to one another, capturing both individual structures and their overall layout. Preserving topology enforces structural correctness and, by extension, semantic validity.

In this thesis, we investigate how topological constraints can be used to bridge the gap between local and global understanding. We use topology to inform the design of deep learning models that are explicitly structure-aware. Our thesis focuses on dense prediction tasks, which include image segmentation, uncertainty estimation, and generative modeling. First, we introduce a topological interaction module for semantic segmentation that encodes containment and exclusion constraints directly into the learning process. This preserves anatomical hierarchies and improves multi-class consistency. Next, since segmentation models can never be truly perfect, we address the need for reliable uncertainty estimation to identify error-prone regions. Unlike conventional pixel-wise uncertainty maps, which tend to be noisy and difficult to interpret, we propose reasoning at the level of structural units--branches and connections--which are more visually discernible and actionable. Finally, we leverage topology for generative modeling. We propose a topology-guided diffusion framework that can be controlled using structural attributes like object count and connectivity.

Together, these contributions establish a unified approach to topology-informed, structure-preserving dense prediction models. By integrating topological reasoning with deep networks, this thesis advances models that are not only accurate, but also structurally consistent, interpretable, and controllable. The results from this thesis have been published in ECCV, NeurIPS, and ICLR.

Speaker: Saumya Gupta

Location: New Computer Science (NCS) 120


Zoom: https://stonybrook.zoom.us/j/93643318604?pwd=kv8DagpbayzizivU29UCYItnlzlYRM.1&jst=2
Join Stony Brook University's Center for Excellence in Learning and Teaching (CELT) for a bootcamp on how to use AI to enhance your teaching and courses. This event will demonstrate how ChatGPT, Microsoft Copilot, and other generative AI platforms can support you in crafting learning objectives, writing exam questions, composing rubrics, and designing course content such as lesson plans, in-class activities, instructional videos, and more.

Register here.
As generative AI tools become increasingly prevalent in education, their impact on collegiate writing raises important questions about creativity, academic integrity, and effective teaching practices. This panel brings together faculty and students to share perspectives on the opportunities and challenges that AI presents in an academic setting. Through an open dialogue, participants will engage in meaningful conversations, allowing for a deeper understanding of each other's viewpoints and fostering collaboration. Students and faculty will explore diverse ways AI can be used in teaching and learning and seek solutions to utilize AI writing tools ethically. This exchange aims to build a community of trust and shared knowledge, ensuring that AI's role in education is both innovative and responsible.

Register here: https://stonybrook.zoom.us/meeting/register/tJAqdOitpjIpHtDGAsGBfEb3ah0YIzhIJolN
Abstract: Large language models (LLMs) may exhibit unintended or undesirable behaviors. Recent works have concentrated on aligning LLMs to mitigate harmful outputs. Despite these efforts, some anomalies indicate that even a well-conducted alignment process can be easily circumvented, whether intentionally or accidentally. Does alignment fine-tuning yield have robust effects on models, or are its impacts merely superficial? In this work, we make the first exploration of this phenomenon from both theoretical and empirical perspectives. Empirically, we demonstrate the elasticity of post-alignment models, i.e., the tendency to revert to the behavior distribution formed during the pre-training phase upon further fine-tuning. Leveraging compression theory, we formally deduce that fine-tuning disproportionately undermines alignment relative to pre-training, potentially by orders of magnitude. We validate the presence of elasticity through experiments on models of varying types and scales. Specifically, we find that model performance declines rapidly before reverting to the pre-training distribution, after which the rate of decline drops significantly. Furthermore, we further reveal that elasticity positively correlates with the increased model size and the expansion of pre-training data. Our findings underscore the need to address the inherent elasticity of LLMs to mitigate their resistance to alignment.

Speaker: Huajian Zhang

Location: CS2311
The Provost's Lecture Series features talks by SUNY Distinguished Academy faculty members at Stony Brook University, showcasing the outstanding research and scholarship that is taking place at our institution.

Joe Mitchell

SUNY Distinguished Professor, Applied Mathematics and Statistics
Chair, Department of Applied Mathematics and Statistics, College of Engineering and Applied Sciences

A Case for Algorithms: A Computational Geometer's Perspective

Algorithms are all around us in every smart device and technology that has consumed our daily lives. As a computational geometer, I study algorithms to solve problems that involve a geometric perspective on data. I have observed that practically every technology and field of study has a need for effective algorithms involving geometric data. I reflect on some favorite algorithmic problems that are easy to visualize, but challenging to solve, and argue that the formal study of algorithms remains essential in the age of AI.

Reception to follow immediately after the talks.

Register here.

Are you concerned about AI issues with your asynchronous online courses? Is your fully online course vulnerable to AI plagiarism? Do you want to engage your online students using AI? Discover the future of education with our AI-powered solutions designed specifically for online asynchronous courses. This innovative approach uses artificial intelligence to transform the way courses are delivered, making learning more personalized, engaging, and effective.

Register here.
Join Zoom Meeting
https://stonybrook.zoom.us/j/98079526509?pwd=Wkt5eURhVDN5VE56TUloS2h5V1Jodz09

Meeting ID: 980 7952 6509
Passcode: 949941



Abstract Over the last decade, artificial neural networks have undergone a revolution, catalyzed by better
supervised learning algorithms. However, in stark contrast to young animals (including humans), training such
networks requires enormous numbers of labeled examples, leading to the belief that animals must rely instead
mainly on unsupervised learning. The reason is that most animal behavior is not the result of clever learning
algorithms--supervised or unsupervised-- but is encoded in the genome. Specifically, animals are born with
highly structured brain connectivity, which enables them to learn very rapidly. Because the wiring diagram is
far too complex to be specified explicitly in the genome, it must be compressed through a genomic
bottleneck. I will describe results showing how the genomic bottleneck algorithm can lead to dramatic
compression of networks and better generalization, particularly for rapid transfer in supervised and
reinforcement learning.

Anthony Zador is professor of neuroscience at CSHL.