You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

Experiencing Machine Learning in Collider-Accelerator Control System

Abstract: The Relativistic Heavy Ion Collider (RHIC) at Collider-Accelerator Department (C-AD) of BNL provides the world's only high-energy polarized proton beam. It is in the unique position to study where nuclei obtain their spin. During 25 years of operation at RHIC, the C-AD controls group has developed its own control system to tune the accelerator performance, which contains millions of control points. The successful operation of this system will highly affect the machine performance. RHIC's successor, the Electron-Ion Collider (EIC), will be one of the most complex scientific instruments ever built, with the capability of colliding polarized proton and electron beams. The increasing complexity of instruments will require new, sophisticated control methods/tools to tune and optimize the accelerator performance. In this talk, I will summarize some projects developed in recent years that utilize machine learning in the C-AD controls group.

Biography: Dr. Yuan Gao is an assistant scientist at the Collider-Accelerator Department (C-AD) at Brookhaven, primarily working on developing new machine learning schemes in the control group to enhance system performance. His research interests include game theory, algorithm design, anomaly detection, and simulation modeling.

Location: CDS, Bldg. 725, Training Room

Join ZoomGov Meeting: https://bnl.zoomgov.com/j/1604302440?pwd=0x2I95PIvbkkzIi6rA0MNnon5k2sux.1

Meeting ID: 160 430 2440
Passcode: 478223

Title: Class visual similarity based noisy sample removal in generative Few Shot Learning
Time: Thursday, Feb 4, 11:30am - 1:00pm
Zoom:
https://stonybrook.zoom.us/j/8563646526?pwd=anJna1gzUStXNlNVSUIzdDRUSC9CUT09

Meeting ID: 856 364 6526
Passcode: 203791



Abstract:  

Over the past decade, larger datasets, hardware accelerations, and network architecture improvements have contributed to phenomenal achievements in many tasks of computer

vision. However, in the absence of large datasets, computer vision models struggle to learn

general representations which results in poor performance. Few-shot learning tries to address 

this problem by proposing models which learn from a few examples.


I first give an overall review of few-shot learning methods. I particularly focus on generative Few Shot Learning(FSL) methods, which augment the scarce categories in a dataset by generating samples for those rare categories. As the actual class distribution can be complex and lie very close to each other, the sample generated for one class can be noisy or lie close to another class.  However, none of the current FS generative methods perform any form of quality control of the generated samples.


In this work, I propose to identify and remove the generated samples that are less likely to be in the distribution of the few-shot class. Here I particularly deal with few-shot scenarios where the

prior information of the relationship between the classes based on visual  similarity is available. The main idea is to exploit these priors to better identify the unreliable generated samples.


Particularly, I have proposed two methods based on class relationship to detect noisy generated samples. In the first method, we assume that the embedding space of each class follows a Gaussian distribution.  From this assumption, I propose Gaussian Neighborhood (GN), a method to estimate how likely a generated sample is drawn from the estimated distribution of a few-shot class.  We evaluate this method on the Hematopoiesis dataset.  By simply eliminating samples based on thresholding our proposed GN scores, the few-shot  classification  performance  is  improved by 5% and 2% in five shot and one shot respectively, compared to the model trained on all generated images. 


The GN scores represent the similarity distances from the generated samples to their classes, based on the assumption that each class is a Gaussian distribution.  However, this assumption might be strict in many scenarios since the real distributions of data can be arbitrarily complex.  Thus in my second proposed method, I aim to learn such similarity distances directly from data via metric learning. I propose to train a deep-network to regress the similarity distance between a pair of samples.  This network is trained using both the class-level  visual  similarity  information  and  the  class  labels.   This method improves the 1-shot and 5-shot classification performances by 0.5% and 1% respectively, compared to GN.

Join us at the Center for Excellence in Learning and Teaching (CELT) for an interactive Zoom workshop on Generative AI designed for faculty and staff interested in enhancing teaching and assessment practices, increasing student engagement, and navigating the rapidly evolving landscape of AI tools. Participants will be introduced to common AI tools, explore potential instructional uses, and discuss key considerations such as academic integrity, transparency, and equity.

Register now: https://stonybrook.zoom.us/meeting/register/6js1eP64T1ys8tyU57EJ7Q#/registration
AI Seminar: Video Architecture Search - Michael Ryoo Abstract: Video understanding is a challenging problem. Because a video contains spatio-temporal data, its feature representation is required to abstract both appearance and motion information. This is not only essential for automated understanding of the semantic content of videos, such as Web-video classification or sport activity recognition, but is also crucial for robot perception and learning. Previously, convolutional neural networks (CNNs) for videos were normally built by manually extending known 2D architectures such as Inception and ResNet to 3D or by carefully designing two-stream CNN architectures that fuse together both appearance and motion information. However, designing an optimal video architecture to best take advantage of spatio-temporal information in videos still remains an open problem. In this talk, we discuss recent progress in neural architecture search for videos, obtaining more optimal network architectures for video understanding.

Are you concerned about AI issues with your asynchronous online courses? Is your fully online course vulnerable to AI plagiarism? Do you want to engage your online students using AI? Discover the future of education with our AI-powered solutions designed specifically for online asynchronous courses. This innovative approach uses artificial intelligence to transform the way courses are delivered, making learning more personalized, engaging, and effective.

Register here.
The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
Abstract : Humans reason about everyday situations by making commonsense-based inferences, derived both from explicitly stated information and implicit, unstated knowledge. In this thesis, I investigate whether NLP models have different aspects of causal knowledge about events and how to improve their understanding of narratives and plans.
Answering questions about why people perform actions in a narrative can test whether NLP systems contain and can effectively apply causal knowledge about events. I introduce TellMeWhy, a dataset concerning why characters in short narratives perform the actions described. An evaluation of then SOTA finetuned models show that they are far worse than humans. To improve models, it is important to understand what aspects of causal knowledge they need and how to best use external sources to inject this knowledge. In KnowWhy, I analyze different ways of injecting knowledge into models, which is difficult since we do not know apriori what type of knowledge will be needed to answer a question, hence requiring a ranking model to pick the most important inference. Results show that this retrieved knowledge helps models of all sizes, thereby improving their understanding of narratives.
Next, I study whether models can reason about causal aspects of plans. I focus on testing whether they understand the underlying causal dependencies reflected in the temporal order of a plan's steps. I introduce CAT-Bench, and find that SOTA models are underwhelming, and that model answers are not consistent across questions about the same step pairs. In their current state, these models cannot yet reliably be used for complex user-facing tasks. I then measure contemporary models' ability to perform user-facing and user-centric plan customization. I introduce the use of semi-symbolic edits in large language model (LLM) based agents and test several multi-LLM-agent architectures for plan customization. While LLMs still lack the ability to understand complex customization hints, my results suggest that LLM-based architectures may be worth exploring further for other customization applications. Finally, I distill complex reasoning capabilities into small language models (SLMs) using synthetic data that reflects a decomposition-then-editing process for plan customization. I demonstrate that explicitly teaching this latent causal reasoning significantly improves the quality of SLM-generated customizations. Overall, my work has improved how well NLP models understand complex reasoning associated with events in different contexts.

Speaker: Yash Kumar Lal

Location: NCS 220 or Zoom https://stonybrook.zoom.us/j/95849648243?pwd=dgPpZtDpgwQrK9z1SaPpNbBifaorzk.1
Presented by Stony Brook University Department of Biomedical Informatics and Long Island Network for Clinical and Translational Science (LINCATS).

The seminar aims to empower participants with the knowledge and skills necessary to harness AI effectively in clinical practice and research. It will equip attendees with practical insights, case studies, and interactive discussions led by experts in both AI and medicine, fostering a collaborative environment where attendee can explore how to overcome barriers and maximize the potential of AI in transforming modern healthcare delivery.

All Stony Brook Audiences Welcome.
Please note: This exciting event is open to all Stony Brook Faculty/Staff/Students. While the overarching theme for this event is the application of AI in medicine, the event is designed to bridge the professional practice gap that exists between cutting-edge AI research and its practical implementation in clinical settings, While AI holds immense promise for transforming healthcare delivery, many physicians and researchers lack the foundational knowledge and practical skills needed to effectively integrate AI into their daily practices.

THIS CONFERENCE IS FOR STONY BROOK UNIVERSITY & HOSPITAL FACULTY/STAFF & STUDENTS ONLY.


Registration link: https://cme.stonybrookmedicine.edu/continuing-medical-education/conferences/235/bench-to-bedside-understanding-the-practical-application-of-ai-in-medicine-2024/10/17/2024

FOR QUESTIONS
joseph.cesaria@stonybrookmedicine.edu
mary.saltz@stonybookmedicine.edu
Artificial Intelligence is rapidly reshaping research, education, and industry--but its growth carries important environmental implications. From the energy demands of large-scale computing to AI's potential to advance climate modeling, conservation, and sustainable design, the relationship between AI and the environment is both challenging and promising. This interdisciplinary panel explores AI's ecological footprint, its role in environmental solutions, and how universities can pursue innovation while upholding sustainability commitments.

Panelists:
Dana Golden -- PhD student in Economics, Stony Brook University.
Dr. Sharon Pochron -- Associate Professor in Sustainability Studies Program, School of Marine and Atmospheric Sciences, Stony Brook University.
Dr. Jordanna Sprayberry -- Associate Professor, Ecology & Evolution, Director of Undergraduate Biology, Stony Brook University.
Dr. Lav Varshney -- Director of the Artificial Intelligence Innovation Institute (AI3) and inaugural Della Pietra Infinity Chair, Stony Brook University.

Register here.