Abstract: Implicit functions have long been a fundamental representation for both 2D and 3D objects in computer graphics, playing a significant role in the field's early development. With the rise of 3D deep learning and the rapid advancement of neural rendering techniques, implicit representations of 3D shapes have regained significant attention in recent years. In this talk, I will present several recent research projects focusing on implicit function-based 3D reconstruction and neural rendering. Furthermore, I will discuss potential future developments in this dynamic and rapidly evolving field.

Biography: Ying He is an Associate Professor at the College of Computing and Data Science, Nanyang Technological University, where he also serves as the Director of the Centre for Augmented and Virtual Reality. His research interests lie in geometric computation and analysis, with applications spanning computer graphics, 3D vision, computer-aided design, multimedia, and wireless sensor networks. Dr. He is an active member of the technical program committees for major conferences on geometric modeling and has served on the editorial boards of IEEE Transactions on Visualization and Computer Graphics, Computer Graphics Forum, and Computational Visual Media. He has also taken on key leadership roles as General/Program Co-Chair for several conferences, including Shape Modeling International (SMI) 2022, Solid and Physical Modeling (SPM) 2022 & 2023, Geometric Modeling and Processing (GMP) 2014 & 2021, and Computational Visual Media (CVM) 2020. For more information, please visit https://personal.ntu.edu.sg/yhe/

Location: NCS 115

Abstract: Jailbreak attacks circumvent LLMs' built-in safeguards by concealing harmful queries within adversarial prompts. While most existing defenses attempt to mitigate the effects of adversarial prompts, they often prove inadequate as adversarial prompts can take arbitrary, adaptive forms. This paper introduces RobustKV, a novel jailbreak defense that takes a fundamentally different approach by selectively removing critical tokens of harmful queries from key-value (KV) caches. Intuitively, for an adversarial prompt to be effective, its tokens must achieve sufficient `importance' (measured by attention scores), which consequently lowers the importance of tokens in the concealed harmful query. Therefore, by carefully evicting the KVs of low-ranked tokens, RobustKV minimizes the harmful query's presence in the KV cache, thus preventing the LLM from generating informative responses. Extensive evaluation using benchmark datasets and models demonstrates that RobustKV effectively counters state-of-the-art jailbreak attacks while maintaining the LLM's performance on benign queries. Notably, RobustKV creates an interesting effectiveness-evasiveness dilemma for the adversary, leading to its robustness against adaptive attacks.

Speaker: Tanqiu Jiang

Where: NCS 220 and Zoom (https://stonybrook.zoom.us/j/6406956411)
Join a faculty development program to support instructors across campus with navigating/integrating AI in their courses. We're inviting interested faculty to participate in the grant project called Fostering Writing-to-Learn Skills with Critical AI Literacy: A Faculty Development and Student Support Program (funded through the AI3 Institute).

Time commitment and completion requirements :

  • Attend four sessions and a final symposium on the following dates/times:

    • Friday, September 12 from 11am - 12:30pm over Zoom

    • Friday, September 26 from 11am - 12:30pm over Zoom

    • Friday, October 10 from 11am - 12:30pm over Zoom

    • Friday, October 24 from 11am - 12:30pm over Zoom

    • Friday, November 14 from 10am - 1pm in Wang 201 - please note that this is an in person session only

  • Engage with online materials in Brightspace prior to each of the sessions (mainly to update a syllabus, assignment, or teaching strategy that you can share and discuss at the workshop)

Contact: Shyam Sharma, Christine Fena, and Rose Tirotta-Esposito with questions.

https://docs.google.com/document/d/1b51tvfK0HSOkCW7cwYq2nyyeeHtvBZYC7_XHv7Av8wQ/edit?tab=t.0

OVERVIEW


This workshop, Expanding Horizons in AI with HPC, aims to explore the dynamic intersection of AI and HPC, focusing on how advanced computing can accelerate AI research and applications. As AI models become more complex and data-intensive, traditional computing systems struggle to meet the demand for scalability, efficiency, and speed. HPC offers a solution by providing the necessary infrastructure for training large-scale models, enhancing AI algorithms, and enabling breakthroughs in fields such as deep learning, natural language processing, and autonomous systems.

Through a combination of expert presentations and panel discussions, participants will gain insights into the latest developments in AI-HPC integration. Attendees will also engage in discussions on the future trends, challenges, and ethical considerations surrounding the use of HPC in AI.

The workshop is designed for AI researchers, data scientists, engineers, and HPC professionals seeking to enhance their understanding of how high-performance computing can drive innovation and expand the potential of AI in solving complex, real-world problems.

The workshop will be held at the Wang Center at Stony Brook University.

https://you.stonybrook.edu/hpcai/

PROGRAM

The program features sessions on HPC Architectures for AI, AI Applications in HPC, LLM's in HPC, and AI in HPC Workflows, and open student presentations. The tentative program and list of confirmed speakers is available at https://you.stonybrook.edu/hpcai/program/.

CALL FOR STUDENT PRESENTATIONS & PARTICIPATION

We are excited to offer students the opportunity to present their work in the area of high-performance scientific computing and artificial intelligence at the workshop. We are calling for students to submit their talk proposals (Name + Title) by April 15 to hpc_ai_workshop@stonybrook.edu. The committee will select the best submission to be presented at the workshop. Accepted speakers will be notified by April 22, 2025.

All students, regardless of whether they are presenting, may reach out to hpc_ai_workshop@stonybrook.edu for financial support to cover travel and lodging costs.

REGISTRATION

Registration is available at https://www.eventbrite.com/e/expanding-horizons-in-ai-with-hpc-tickets-1256469978529?aff=oddtdtcreator until May 2nd. The registration fee covers the workshop participation and the social event in the evening of May 9.

Regular registration: $200
Student registration: $100


IMPORTANT NOTE

The registration fee was meant to cover the room rent, catering, and dinner. Thanks to an RF seed grant, we are able to drop the registration fees for SBU students and staff/faculty. We still ask for an informal registration via email to hpc_ai_workshop@stonybrook.edu until April 27, so we can plan for catering and dinner.
Please get in touch with us if you have already registered as an SBU student/faculty/staff member for the workshop so we can handle any reimbursement.

The program is now online at https://you.stonybrook.edu/hpcai/program/.
Abstract: Many unresolved legal questions over LLMs and copyright center on memorization: whether specific training data have been encoded in the model's weights during training, and whether those memorized data can be extracted in the model's outputs. While many believe that LLMs do not memorize much of their training data, recent work shows that substantial amounts of copyrighted text can be extracted from open-weight models. However, it remains an open question if similar extraction is feasible for production LLMs, given the safety measures these systems implement. We investigate this question using a two-phase procedure: (1) an initial probe to test for extraction feasibility, which sometimes uses a Best-of-N (BoN) jailbreak, followed by (2) iterative continuation prompts to attempt to extract the book. We evaluate our procedure on four production LLMs -- Claude 3.7 Sonnet, GPT-4.1, Gemini 2.5 Pro, and Grok 3 -- and we measure extraction success with a score computed from a block-based approximation of longest common substring (nv-recall). With different per-LLM experimental configurations, we were able to extract varying amounts of text. For the Phase 1 probe, it was unnecessary to jailbreak Gemini 2.5 Pro and Grok 3 to extract text (e.g, nv-recall of 76.8% and 70.3%, respectively, for Harry Potter and the Sorcerer's Stone), while it was necessary for Claude 3.7 Sonnet and GPT-4.1. In some cases, jailbroken Claude 3.7 Sonnet outputs entire books near-verbatim (e.g., nv-recall=95.8%). GPT-4.1 requires significantly more BoN attempts (e.g., 20X), and eventually refuses to continue (e.g., nv-recall=4.0%). Taken together, our work highlights that, even with model- and system-level safeguards, extraction of (in-copyright) training data remains a risk for production LLMs.

Speaker: Xinyue

Location: CS2311
We live in a new scientific paradigm: the Big Data era, in which a lot of data is available for almost anything. In this new paradigm, the driving force is to use data directly to learn about chemical and physics systems employing artificial intelligence. This paradigm has proven helpful in simulating realistic physical, biological, and chemical models, yielding impressive results. Similarly, the insight gained in these situations can be used to improve our understanding of fundamental processes. In that regard, we want to answer the question: Can a machine learn chemistry? The answer to this question is still debatable, but we will show our ideas and methods to find the answer. We will also discuss our results on predicting atom-diatom reactions and other avenues and work in progress in our group.

Please register for the STEM Speaker Series: Can a Machine learn Chemistry here.
Abstract: DeepSeek-R1-Zero has shown that reinforcement learning (RL) at scale can directly enhance the reasoning capabilities of LLMs without supervised fine-tuning. In this work, we critically examine R1-Zero-like training by analyzing its two core components: base models and RL. We investigate a wide range of base models, including DeepSeek-V3-Base, to understand how pretraining characteristics influence RL performance. Our analysis reveals that DeepSeek-V3-Base already exhibit ''Aha moment'', while Qwen2.5 base models demonstrate strong reasoning capabilities even without prompt templates, suggesting potential pretraining biases. Additionally, we identify an optimization bias in Group Relative Policy Optimization (GRPO), which artificially increases response length (especially for incorrect outputs) during training. To address this, we introduce Dr. GRPO, an unbiased optimization method that improves token efficiency while maintaining reasoning performance. Leveraging these insights, we present a minimalist R1-Zero recipe that achieves 43.3% accuracy on AIME 2024 with a 7B base model, establishing a new state-of-the-art.

Speaker: Md. Saqib Hasan

Location: CS2311

You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Abstract: Designing custom proteins could revolutionize medicine and materials, but it remains an immense scientific challenge. Our work uses large-scale AI foundation models to generate novel proteins tailored to bind specific small molecules. Each AI-generated design is passed through a rigorous, multi-stage validation pipeline to ensure it is biophysically realistic. A key innovation is fine-tuning our model with data from molecular dynamics (MD) simulations, exposing it to the conformational dynamics and energetics of protein-ligand binding. This physics-aware training results in novel protein designs with enhanced stability and more effective binding capabilities.

Bio: Xin Dai is an Assistant Computational Scientist in the Artificial Intelligence Department of the CDS. His work centers on AI for Science with a strong focus on computational biology. He earned his PhD in Physics from Tsinghua University.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

Location: CDS, Bldg. 725, Training Room

Join Zoom Meeting: https://bnl.zoomgov.com/j/1604383624?pwd=ffQ5cUPNxTI7nzClKQO6cnsNbhF9Vf.1

Meeting ID: 160 438 3624
Passcode: 558449