You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

#1 How to train your Scientific Chatbot by Alexandr Prozorov, Post-Doctoral Research Associate


Abstract: RHIC is closing its 25-year run with ~1 EB of data and decades of hard-won know-how that risk drifting into obscurity. The RHIC Data & Analysis Preservation Plan (DAPP) pilots an AI assistant that lets physicists talk to RHIC in natural language--searching internal notes, code, workflows, and docs, and pointing to runnable, containerized analyses. Built on Retrieval-Augmented Generation(RAG) with a Model Context Protocol orchestration layer, the system indexes heterogeneous, experiment-specific content and enforces role-aware access
for public vs. collaboration-restricted materials. Takeaway: domain-adapted AI can turn a legacy exabyte into reproducible answers, training assets, and new discovery paths.

Biography: Alexandr Prozorov is a postdoc from Czech Technical University in Prague working in STAR experiment. Fascinated by AI

#2 Quantum AI: Atoms, Cavities and Learning by Raman Kumar, Post-Doctoral Research Associate, Instrumentation Department

Abstract: The Instrumentation Department (IO) in the Discovery Technologies directorate at BNL is engaged in exploring various aspects of quantum systems research. One of the main goals of our group's effort is in developing neutral atom-cavity array platforms for remote entanglement generation and distributed quantum processing. This platform promises to herald truly scalable quantum computing systems and open new paradigms for networking and sensing. In this talk, I will explain our group's research and the role AI is playing in unlocking new insights with two examples. The first application of AI is in fabrication process prediction of micro-cavity structures. The second application revolves around role of AI in quantum error detection and correction in modern quantum computing systems.

Biography: Dr. Raman Kumar is a postdoctoral research associate in the IO department at BNL working with Dr./Prof. Sebastian Will (Columbia U.). Kumar obtained his Ph.D. degree in Electrical and Computer Engineering from the University of Illinois Urbana-Champaign. Prior to joining BNL in Nov 2024, Kumar worked as a postdoc at the City College in New York working on topological photonic quantum sensing using NV centers in diamond. Kumar and Will combined have an extremely wide moat and expertise in a variety of different areas which include Ultra cold atoms and molecules, quantum optics, quantum condensed matter, nanofabrication, semiconductor devices and advanced electromagnetics. Their areas of research interest include scalable quantum computing, communications and sensing, all enabled by AI.

Location: CDS, Bldg. 725, Training Room

Join ZoomGov Meeting https://bnl.zoomgov.com/j/1607892208?pwd=MSjxN5btSeToZsQMwEQzCCbBo5h58V.1

Meeting ID: 160 789 2208
Passcode: 753871

The Vedanta Forum is devoted to one of humanity's oldest and most profound pursuits -- thinking. Thinking about who we truly are: the one that remains constant through childhood and old age, through waking, dream, and deep sleep. Thinking about the source and cause of creation, and its relationship to what inheres in us.

Across history, such thinking, both meditative and scientific, has been aimed at these questions. The ancient Upanishads proclaimed, Tat Tvam Asi -- Thou Art That -- revealing the non-dual identity of the individual and the ultimate reality. Centuries later, modern scientists such as Schrödinger and Bohr echoed similar intuitions about the unity of existence.

Over time, many philosophical approaches, traditions, and interpretive schools have arisen from such inquiry, each offering unique perspectives. The Forum will:

  • Focus on universal approaches and traditions and examine their teachings,

  • Foster comparative studies, and

  • Explore the practical benefits to society from such thinking,

through scholarly studies, dialogue, and debate also promoting accessibility to all qualified seekers. Additionally, the Forum will explore how these reflections can enrich life, education, and even technology.

Location: NCS 120 (New Computer Science), Engineering Dr, Stony Brook, NY 11794.

The program is available at: https://www.vedantaforum.org/events/program

Defending Software Systems from Cyber Attack Campaigns Presented by R. Sekar The DNC hack of 2016, the Equifax breach of 2017, and the spate of ransomware campaigns in 2019 demonstrate the formidable challenges we face in securing our network and software systems against highly stealthy and sophisticated adversaries. In this talk, I will describe two avenues of research we have been pursuing to help tilt the table against such powerful adversaries. The first is software hardening techniques that make software vulnerabilities harder to exploit. To maximize their applicability and ease of use, our techniques are implemented into compilers, or they directly transform binary code. I will outline some of the exciting new developments we have had in this area over the years, including randomization, memory safety, information-flow tracking, control-flow integrity, and code-pointer integrity. We complement this first line of defense with techniques for analyzing and understanding attack campaigns that manage to slip past all deployed defenses. Our techniques can sift through logs consisting of hundreds of millions of events to zoom in on attack activity that may span just a few hundred events. I will describe our experience in mapping out several DARPA-sponsored red team attack campaigns.

Abstract: Computational pathology has revolutionized cancer diagnosis and research through the analysis of digitized whole slide images (WSIs). However, the giga-pixel size of these images presents profound technical challenges, creating two intertwined bottlenecks: computational inefficiency and label inefficiency. The immense data scale makes standard end-to-end (E2E) training of deep neural networks infeasible due to prohibitive GPU memory requirements, while the reliance on expert pathologists for annotations makes obtaining high-quality labeled data a tedious and expensive process. This proposal confronts these dual challenges by developing a series of novel model architectures, training paradigms, and self-supervised learning methods designed to create a more efficient and effective framework for WSI analysis.

To improve computational efficiency, this proposal first introduces a locally supervised learning paradigm that enables E2E training on entire WSIs by partitioning a network into gradient-isolated modules, circumventing the memory bottleneck of backpropagation. Second, it presents Prompt-MIL, a parameter-efficient fine-tuning framework that reduces the number of trainable parameters, memory consumption, and training time by fine-tuning only few prompts to guide large pre-trained models. Third, this work advances the efficient architecture on WSIs by developing novel State-Space Models (SSMs). It proposes 2DMamba, the first intrinsic Mamba architecture that preserves the crucial 2D spatial structure of images, overcoming the spatial discrepancy inherent in 1D models. Fourth, to address the inefficiency of multi-directional scans in Mamba models, including 2DMamba, it presents Locally Bi-directional Mamba (LBMamba), which introduces a novel, hardware-aware local backward scan that integrates bi-directional scan into a single forward pass, significantly improving throughput performance trade-off. Lastly, it proposes an extension to the LBMamba, warp-level Bi-directional Mamba (WLBMamba) that extends the thread-level bidirectional scan to warp-level bidirectional scan that further improves the throughput performance trade-off.

To improve label efficiency, this proposal proposes a Precise Location-based Matching strategy for self-supervised dense contrastive learning. By allowing a local patch in one augmented view to match multiple overlapping patches in another, creates a more accurate correspondence, leading to superior feature representations for dense prediction tasks like segmentation and detection.

In summary, this proposal presents a holistic investigation into the efficiency bottlenecks in computational pathology. Through these combined contributions in model architecture, training paradigms, and self-supervised learning, this work establishes a more scalable, efficient, and powerful computational framework for analyzing giga-pixel pathology images.

Speaker: Jingwei Zhang

Location: Old Computer Science Room 2114

Zoom: https://stonybrook.zoom.us/j/95187903649?pwd=tV0CNxLu1QKqw7hGmcE1h0rJ2C6n1b.1
Meeting ID: 951 8790 3649 | Passcode: 488916

Talk Title: Knowledge-enhanced LLMs and Human-AI Collaboration Frameworks for Creativity Support


Abstract:

Large language models (LLMs) constitute a paradigm shift in Natural Language Processing and Artificial Intelligence. To build AI systems that are human-centered, I propose we need knowledge-aware models and human-AI collaboration frameworks to help them solve tasks ultimately aligning these models better with human values. In this talk, I will discuss my research agenda for human-centered AI with a case study on creativity that focuses on how to augment LMs with external knowledge, build effective human-AI collaboration frameworks as well as theoretically grounded robust evaluation protocols for measuring capabilities of NLG systems. I will begin by describing knowledge-enhanced methods for creative text generation such as metaphors. Next, I will describe how content creators can collaborate and benefit from the creative capabilities of text-to-image-based AI models. Finally, I will focus on the design and development of theoretically grounded evaluation protocols to benchmark the creative capabilities of Large Language Models in both producing as well as assessing creative text. I will end this talk by highlighting the current limitations of existing models and future directions toward building better models that will enable efficient and trustworthy human-AI collaboration systems.


Bio:

Tuhin Chakrabarty is a final-year Ph.D. candidate in the Natural Language Processing group within the Computer Science department at Columbia University. His research is supported by the Columbia Center of Artificial Intelligence & Technology (CAIT) & an Amazon Science Ph.D. Fellowship. He was also a Computational Journalism fellow at NYTimes R&D and an intern at the Allen Institute of Artificial Intelligence, Salesforce Research, and Deepmind. His research interests are broadly in Natural Language Processing, Computer Vision, and Human-Computer Interaction with a special focus on Human-Centered Methods for Understanding, Generation, and Evaluation of Creativity. His work has been recognized at top natural language processing and human-computer interaction conferences and journals such as ACL, NAACL, EMNLP, TACL, and CHI. He has been involved in organizing several workshops and tutorials at NLP conferences such Figurative Language Processing workshop at EMNLP 2022, NAACL 2024, and the tutorial on Creative Text Generation at EMNLP 2023. His work on AI and creativity has been mentioned in mainstream news media such as The Hollywood Reporter and more recently The Washington Post.

Join Zoom Meeting https://stonybrook.zoom.us/j/97103601583?pwd=TnpGMXdpeEd1N0hZcXppS1BLNFJhZz09 (ID: 97103601583, passcode: 004031) Join by phone (US) +1 646-931-3860 (passcode: 004031) Joining instructions: https://www.google.com/url?q=https://applications.zoom.us/addon/invitation/detail?meetingUuid%3DILacj94mRvSXgTYt0Cqs1w%253D%253D%26signature%3D9f2f1e7e603bbcb9034724d084eea8846c19a38b7436180170dfc3f1d718b425%26v%3D1&sa=D&source=calendar&usg=AOvVaw3MsNgLSPMRl8L5i6BosYrB Meeting host: H.Andrew.Schwartz@stonybrook.edu

Join Zoom Meeting:
https://stonybrook.zoom.us/j/97103601583?pwd=TnpGMXdpeEd1N0hZcXppS1BLNFJhZz09
Johannes Hachmann, University of Buffalo Assistant Professor of Chemical Engineering presents Making Machine Learning Work in Chemistry

The use of modern machine learning, informatics and data mining approaches is a relatively new development in the chemical and materials domain. These techniques have been exceedingly successful in other application fields, and since there is no fundamental reason why they should not have a similarly transformative impact on chemical and materials research, there is now a concerted effort by the community to introduce data science in this new context. However, adapting techniques from other application domains for the study of chemical and materials systems requires a substantial rethinking and redevelopment of the existing methods.

In this presentation, we will discuss our work on designing advanced, physics-infused neural network architectures, the fusion of unsupervised clustering with supervised regression for local ensemble models, active and transfer learning techniques, bootstrapping approaches to minimize our training data footprint, methods to increase the applicability domain of data-derived models and automated hyperparameter optimization.

Biosketch: Johannes Hachmann is an Assistant Professor of Chemical Engineering at the University at Buffalo (UB), the Director of the Engineering Science in Data Science graduate program, a Core Member of the UB Computational and Data-Enabled Science and Engineering graduate program, and a Faculty Member of the New York State Center of Excellence in Materials Informatics. He earned a Dipl.-Chem. degree (2004) after undergraduate studies at the universities of Jena and Cambridge, M.Sc. (2007) and Ph.D. (2010) degrees in Chemistry from Cornell University, and he conducted postdoctoral research at Harvard University before joining the UB faculty in 2014. The research of the Hachmann Group fuses (first-principles) molecular and materials modeling with virtual high-throughput screening and modern data science (i.e., the use of database technology, machine learning and informatics) to advance a data-driven discovery and rational design paradigm in the chemical and materials disciplines. One of the centerpieces of the group's efforts is the creation of an open, general-purpose software ecosystem for the data-driven design of chemical systems and the exploration of chemical space. This work was recognized with a 2018 NSF CAREER Award.

To truly understand human language, we must look at words in the context of the human generating the language. Factors such as demographics, personality, modes of communication, and emotional states have shown to play a crucial role in NLP models pre-LLMs era. Steps of mathematically defining the inclusion of human context in language modeling and more will be discussed with Nikita Soni, a PhD student at Stony Brook University co-advised by H. Andrew Schwartz and Niranjan Balasubramanian. She is the lead organizer of the workshop on human-centered large language modeling.

Please register for the STEM Speaker Series Zoom event here

Please RSVP for the STEM Speaker Series in-person event here

As AI drives rapid change across professional fields, how do you bring these developments into your classroom? The CELT AI Panel Discussion will gather academic thought leaders to explore how generative AI is reshaping teaching, learning, and the knowledge students need for today's world. Our panelists will share practical strategies for integrating AI-related advancements into course content, highlight both opportunities and challenges, and discuss how educators can help students build critical thinking, ethical awareness, and hands-on experience with emerging AI technologies. Join us to examine how teaching can evolve alongside an AI-transformed society.

Register here.

Abstract: Self-supervised representation learning (SRL) has emerged as a pivotal advancement in machine learning, offering high-quality data representations without the need for labeled datasets. While SRL has demonstrated enhanced adversarial robustness compared to supervised learning, its resilience against other attack types, particularly backdoor attacks, remains an open question. Recent studies have revealed potential vulnerabilities in SRL, underscoring the necessity for a comprehensive security analysis. However, existing research often extrapolates attacks from supervised learning paradigms, neglecting the unique challenges and opportunities inherent to self-supervised mechanisms.

This thesis proposal aims to address three critical objectives in the realm of self-supervised learning: (1) exploring novel attack vectors, (2) implementing and evaluating practical attacks, and (3) developing robust countermeasures. We focus on two key SRL paradigms: Contrastive Learning and Diffusion Models. For Contrastive Learning, we synthesize existing security vulnerabilities and introduce innovative attack vectors, such as CTRL, to uncover distinctive risks. We conduct a comparative analysis of contrastive and supervised learning approaches in their defense against these threats, exploring potential safeguards and highlighting the limitations of current protective measures in self-supervised contexts. Regarding Diffusion Models, we demonstrate inherent vulnerabilities in their application to adversarial purification.

Our research aims to illuminate the unique challenges posed by emerging attack vectors in self-supervised learning, fostering technical advancements to address underlying security risks in real-world applications. By contributing to the development of more resilient and secure self-supervised representation learning systems, we seek to enhance their reliability and trustworthiness in practical scenarios. This comprehensive examination of SRL's security landscape will provide valuable insights for the broader machine-learning community and pave the way for more robust AI systems.

Join here.
The annual conference on Neural Information Processing Systems is a multi-track interdisciplinary annual meeting that includes invited talks, demonstrations, symposia, and oral and poster presentations of refereed papers. Along with the conference is a professional exposition focusing on machine learning in practice, a series of tutorials, and topical workshops that provide a less formal setting for the exchange of ideas.

For more information and registration, visit the official website.