AI Innovation Institute

The Association for Computational Linguistics is the international scientific and professional society for people working on problems involving natural language and computation. Membership includes the ACL quarterly journals, Computational Linguistics and Transactions of the ACL, reduced registration at most ACL-sponsored conferences, discounts on ACL-sponsored publications, and participation in ACL Special Interest Groups.

An annual meeting is held each summer in locations where significant computational linguistics research is carried out.

For more information and registration, visit the official website.

Read more about ACL 2026

Join University Libraries for an engaging panel discussion where we delve in and learn about the impacts of artificial intelligence on the 2024 US elections! Panelists are Paige Lord, Tom Costello, and Musa al-Gharbi. The discussion will be moderated by Library Dean, Karim Boughida. Co-sponsored by the Office of Diversity, Inclusion, and Intercultural Initiatives.

Please RSVP for Democracy in the Digital Age: AI's Influence on 2024 Elections here.

Read more about Democracy in the Digital Age: AI's Influence on 2024 Elections

The Hudson River Estuary (HRE) and New York Bight (NYB) are closely connected, with HRE acting as crucial areas where many NYB marine species spawn and grow. Understanding how these biotic and abiotic environments interact, especially with rapid climate change, is key to better managing fisheries and conserving ecosystems. To better understand the HRE-NYB ecosystem, we develop a comprehensive ecosystem model that links physical and biological processes. Using data from long-term monitoring programs, we analyze ecological patterns and identify key factors regulating the ecosystem. We use this information to develop a model that mimics the food web from tiny plankton to large predators in the ecosystem. This model can help us better understand how changes in the environment, like rising temperatures, and human activities such as fishing affect marine lives and ecosystem over time. The insights from this model can support smarter fisheries management and efforts to conserve marine ecosystems in the HRE-NYB region.

IACS Student Seminar Speaker: Xiangyan Yang, Dept. of Applied Math & Statistics

Location: IACS Seminar Room or Zoom

Join Zoom Meeting: https://stonybrook.zoom.us/j/91650247483?pwd=fvAGEwadplJh7jFC5RWcdvZ5NWPJth.1
Meeting ID: 916 5024 7483
Passcode: 631055

Read more about Unveiling the Secrets of Local Estuary-Marine Ecosystem with Modeling

AI Seminar: Video Architecture Search - Michael Ryoo Abstract: Video understanding is a challenging problem. Because a video contains spatio-temporal data, its feature representation is required to abstract both appearance and motion information. This is not only essential for automated understanding of the semantic content of videos, such as Web-video classification or sport activity recognition, but is also crucial for robot perception and learning. Previously, convolutional neural networks (CNNs) for videos were normally built by manually extending known 2D architectures such as Inception and ResNet to 3D or by carefully designing two-stream CNN architectures that fuse together both appearance and motion information. However, designing an optimal video architecture to best take advantage of spatio-temporal information in videos still remains an open problem. In this talk, we discuss recent progress in neural architecture search for videos, obtaining more optimal network architectures for video understanding.

Read more about AI Seminar: Video Architecture Search by Michael Ryoo

Event information here: https://bmi.stonybrookmedicine.edu/sites/default/files/BMI_GrandRounds_Flyer-DrXu.pdf

Read more about BMI Grand Rounds: Automatic analysis of cryo-electron tomography using computer vision and machine learning

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors. Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Everyone else is welcome to attend. Fill in https://forms.gle/q6UG9ygauLp2a8Po8 to subscribe to our mailing list for further announcement.

Read more about CSE 656 Seminar in Computer Vision

Predictable Autonomy for Cyber-Physical Systems by Stanley Bak, Safe Sky Analytics

ABSTRACT: Cyber-physical systems combine complex physics with complex software. Although these systems offer significant potential in fields such as smart grid design, autonomous robotics and medical systems, verification of CPS designs remains challenging. Model-based design permits simulations to be used to explore potential system behaviors, but individual simulations do not provide full coverage of what the system can do. In particular, simulations cannot guarantee the absence of unsafe behaviors, which is unsettling as many CPS are safety-critical systems.

The goal of set-based analysis methods is to explore a system's behaviors using sets of states, rather than individual states. The usual downside of this approach is that set-based analysis methods are limited in scalability, working only for very small models. This talk describes our recent process on improving the scalability of set-based reachability computation for LTI hybrid automaton models, some of which can apply to very large systems (up to one billion continuous state variables!). Lastly, we'll discuss the significant overlap of techniques used for our scalable reachability analysis methods with set-based input/output analysis of neural networks.

BIO: Stanley Bak is a computer scientist investigating the predictable design of autonomous cyber-physical systems. He strives to develop practical formal methods that are both scalable and useful, which demands developing new theory, programming efficient tools and building experimental systems. He received a Bachelor's degree in Computer Science from Rensselaer Polytechnic Institute (RPI) in 2007 (summa cum laude), and a Master's degree in Computer Science from the University of Illinois at Urbana-Champaign (UIUC) in 2009. He completed his PhD from the Department of Computer Science at UIUC in 2013. He received the Founders Award of Excellence for his undergraduate research at RPI in 2004, the Debra and Ira Cohen Graduate Fellowship from UIUC twice, in 2008 and 2009, and was awarded the Science, Mathematics and Research for Transformation (SMART) Scholarship from 2009 to 2013. From 2013 to 2018, Stanley was a Research Computer Scientist at the US Air Force Research Lab (AFRL), both in the Information Directorate in Rome, NY, and in the Aerospace Systems Directorate in Dayton, OH. He currently helps run Safe Sky Analytics, a research consulting company investigating verification and autonomous systems, and performs teaching as an Adjunct Professor at Georgetown University.

Read more about CSE 600 Make-Up Opportunity: Predictable Autonomy for Cyber-Physical Systems Talk by Stanley Bak, Safe Sky Analytics

Abstract: Large Language Models (LLMs) have revolutionized how people interact with knowledge, offering unprecedented opportunities to accelerate the pace of scientific discovery. In this talk, I will discuss my research on the synergy between LLMs and scientific knowledge--specifically how these models extract, induce, and verify knowledge to automate the research lifecycle. First, I will cover our work on improving knowledge extraction from vast scientific literature, focusing on enabling models to comprehend long documents in a cost-efficient and comprehensive manner. I will describe a novel paradigm for representing document-level structured information as question-answer pairs and how we address the challenges of long-context understanding by leveraging global context through retrieval-augmented modeling. Next, I present our pioneering work on using LLMs for new scientific hypothesis generation. We introduce a framework employing reinforcement learning with fine-grained reward modeling and adaptive controllers.
This approach balances novelty, feasibility, and effectiveness to generate inspiring and actionable research hypotheses. Finally, I will discuss work on the first LLM Scientist for machine learning research. I will demonstrate how LLMs can move beyond hypothesis generation to participate in the execution and validation of scientific hypotheses, ensuring that the discovered knowledge is not only innovative but also grounded and verified.

Bio: Xinya Du is a tenure-track assistant professor at UT Dallas Computer Science Department. He earned a Ph.D. degree from Cornell University and was a Postdoctoral Research Associate at the University of Illinois (UIUC). He has also worked at Microsoft Research, Google Research, and Allen Institute AI. His research is on large language models, deep learning, and their applications in science.His work has been published in leading NLP and ML conferences (ACL, ICLR, NeurIPS). His research has received multiple recognitions, including a Best Paper Award at AAAI AI for Research and a Best Poster Award at ICML AI for Science workshop. His work was included in the list of Most Influential ACL Papers and has been covered by major media like New Scientist. He was named a Spotlight Rising Star in Data Science by the University of Chicago and is the recipient of several prestigious awards, including the Amazon Research Award, Cisco Research Award, Open Philanthropy Award, and the NSF CAREER Award.

Location: NCS 120

Read more about Scientific Knowledge Discovery with Large Language Models

AI Institute Seminar Title: A Geometric Understanding of Deep Learning Abstract: This work introduces an optimal transportation (OT) view of generative adversarial networks (GANs). Natural datasets have intrinsic patterns, which can be summarized as the manifold distribution principle: the distribution of a class of data is close to a low-dimensional manifold. GANs mainly accomplish two tasks: manifold learning and probability distribution transformation. The latter can be carried out using the classical OT method. From the OT perspective, the generator computes the OT map, while the discriminator computes the Wasserstein distance between the generated data distribution and the real data distribution; both can be reduced to a convex geometric optimization process. Furthermore, OT theory discovers the intrinsic collaborative--instead of competitive--relation between the generator and the discriminator, and the fundamental reason for mode collapse. We also propose a novel generative model, which uses an autoencoder (AE) for manifold learning and OT map for probability distribution transformation. This AE-OT model improves the theoretical rigor and transparency, as well as the computational stability and efficiency; in particular, it eliminates the mode collapse. The experimental results validate our hypothesis, and demonstrate the advantages of our proposed model.

Read more about AI Institute Seminar: David Gu, 'A Geometric Understanding of Deep Learning'

What AI tools are available to help with the scholarly research process? Are they helpful? What do they do and is it worth the time and energy to try them out? Join librarian Christine Fena to explore and compare established and emerging AI research tools such as Elicit, Scite, Consensus, and Undermind. The workshop will not offer a lengthy tutorial on how to use any of these tools, but will provide a starting point to understanding what they are, what new ones are emerging, and how AI research assistants might bring changes to your search process. All are welcome!

Read more about Understanding and Comparing AI Research Tools