AI Innovation Institute

Abstract: Artificial intelligence (AI) is rapidly transforming scientific discovery, enabling breakthroughs in areas ranging from drug discovery to modeling complex physical systems. In the life sciences, AI has traditionally been applied to prediction tasks such as classifying molecules as toxic or non-toxic, estimating drug properties, or solving partial differential equations. These discriminative models have proven powerful, but they are inherently limited to mapping existing inputs to deterministic outputs. A new wave of methods is shifting the paradigm from discrimination to generation: creating new possibilities, such as generating novel molecules or designing new drugs. By reframing AI as both a predictive and generative engine, this shift offers new pathways for accelerating discovery and innovation in life sciences at an unprecedented scale. This talk will cover several aspects of AI for Science (AI4Sci), beginning with advances in discriminative models for molecular systems and solving PDEs, and then turning to generative approaches, including diffusion models for 3D molecular generation and large language models for drug editing. Together, these developments illustrate how moving from prediction to creation is redefining what AI can contribute to science.

Bio: Wenhan Gao is a fourth-year Ph.D. student in Applied Mathematics under the supervision of Professor Yi Liu. He was also a Staff Research Scientist Intern at VISA Research, where he worked on large language models (LLMs) and multi-agent systems for commerce. Wenhan's research focuses on AI for Science (AI4Sci), with a particular emphasis on generative AI. His work looks deep into the fundamental mechanisms of AI models when applied to scientific tasks, and he strives to incorporate established scientific priors, such as symmetry, into model design. He has published papers as a first or corresponding author in leading AI and computational venues, including ICLR, ICML, NeurIPS, TMLR, ACL, and the Journal of Computational Physics. In addition to his research, Wenhan has served as a reviewer and oral session chair for top AI conferences and as a lecturer for both undergraduate and graduate courses at Stony Brook University.

Location: IACS Seminar Room or Zoom

This seminar will take place in person and online*

Join Zoom Meeting: https://stonybrook.zoom.us/j/91670093552?pwd=2EcniXqPZLTpa4ZBKRs1zAjYqs1LS0.1

Meeting ID: 916 7009 3552
Passcode: 434045

Read more about From Prediction to Creation: Transforming Scientific Discovery with Artificial Intelligence

Abstract: Artificial Intelligence (AI) is no longer a futuristic concept -- it is here, but its development, benefits, and risks remain unevenly distributed across industries, nations, and social groups. In this talk, Jieshu presents her research on the societal dimensions of AI from two perspectives: the forces shaping AI's development (backward-looking) and its current and potential impact on society (forward-looking). She first examines disparities in AI, including women's underrepresentation in AI patents and the geographic concentration of AI innovation, highlighting inequalities in who creates AI and who benefits from it. She then explores AI's societal impact, focusing on workforce transformation and the need for GenAI literacy. She will also discuss AI patents, AI's role in climate change mitigation and adaptation, potential environmental biases in LLMs, and gender-specific patterns in AI portrayals in science fiction.

Bio: Jieshu Wang is a Postdoctoral Research Scholar at Arizona State University (ASU), focusing on the social dimensions of artificial intelligence (AI). With a background in engineering, economics, communication, and science and technology studies, she examines how AI both shapes and is shaped by broader societal forces. Her research employs interdisciplinary methods to explore the social, political, and economic factors influencing AI development, as well as its role in innovation, the economy, the future of work, climate change mitigation, and popular culture. Jieshu holds a Ph.D. in Human and Social Dimensions of Science and Technology from ASU. She is also a science book translator and has translated six books.

Location: Old Computer Science, room 1310

Read more about I'm Sorry, Dave... AI is Already Here, But Not Evenly Distributed.

Imagine machines that can see beyond human limitations--drones locating hidden survivors, cameras predicting structural failures, or medical devices detecting tumors beneath the skin. Traditional vision systems are constrained by the boundaries of human perception, missing vast information present in light interactions. This talk explores the development of advanced vision systems that capture underutilized dimensions of light, model intricate light-scene interactions, and extract hidden 3D information--around corners, beneath surfaces, and at high speeds. By jointly developing novel imaging hardware, efficient rendering models, and physics-based learning algorithms, we aim to transcend conventional vision capabilities--unlocking critical applications in autonomous navigation, structural monitoring, and non-invasive medical imaging.

Speaker Bio:

Akshat Dave is a Postdoctoral Associate at MIT Media Lab in the Camera Culture group working with Prof. Ramesh Raskar. He received his Ph.D. from Rice University ECE Department in 2023 where he was advised by Prof. Ashok Veeraraghavan. His research lies at the intersection of applied optics, computer graphics, and computer vision. His research focuses on developing vision systems that go beyond human perception. His work has been recognized by Rice University's Best Thesis Award, OSA Best Paper Prize, and fellowships by Texas Instruments and Qualcomm.

Read more about Superhuman Vision by integrating Cameras, Graphics, and AI

CSE 600 Seminar Series | Fall 2025

Speaker: Jiawei (Joe) Zhou

Read more about Large Language and Multimodal Models in the Wild: Efficiency, Trustworthiness, and Future Outlook

Read more about ICCV 2021 Conference

Dates:

Wednesday, March 3, 2021 - 6:00pm to 7:30pm

Location:

Zoom - contact events@cs.stonybrook.edu for Zoom info.

Event Description:

Women in Computer Science (WiCS), the Society of Women Engineers (SWE), and the Stony Brook Robotics Team (SBRT) are collaborating to host an event called Inspiring Women in STEM Academia: A Community Dialogue to address the lack of female representation in STEM academia.

All are invited to attend so they may gain a better understanding of the challenges faced by their female colleagues and hear perspectives on how they can offer support in the workplace. Given the shockingly disproportionate number of female professionals in STEM academia, we feel that this event would be extremely beneficial for male faculty to listen to and amplify their voices.

It will begin with a discussion panel consisting of Stony Brook professors and faculty who will provide valuable insight into the issue. From there, we will split into smaller discussion groups where student and faculty attendees will be able to voice their opinions, hear about the thoughts/experiences of others, and participate in an engaging discussion with panelists.

The event will be held on March 3rd from 6:00 - 7:30 PM on Zoom.

The following Stony Brook faculty will be panelists:

Dr. Aruna Balasubramanian - Computer Science Professor, WiCS Advisor, WPhD Advisor

Dr. Xinwei Mao - Civil Engineering Assistant Professor

Urszula Zalewski - Director of Experiential Learning, Career Center Advisor (Healthcare)

Dr. Heather Lynch - Ecology and Evolution Professor, Lynch Lab for Quantitative Ecology

Karen Kernan - URECA Director, Simons Summer Research Program Director

Dr. Eszter Boros - Chemistry Assistant Professor, Boros Lab

Dr. Maria Nagan - Chemistry Lecturer, Nagan Research Lab

Read more about Inspiring Women in STEM Academia: A Community Dialogue

TITLE: Towards a Theory of Encode/Decoder Architectures by Andrej Risteski of CMU

ABSTRACT: A common choice of architecture in representation learning (i.e., learning a good embedding of the data) is an encoder/decoder architecture, which tries to map a part of the input into a good latent representation (via an encoder), and predict the remaining part of the input (via a decoder). Two common examples are universal machine translation: where one tries to learn to translate between any pair of a set of languages via a common latent language, given paired up corpora for only a part of the pairs; and contextual encoders -- where one tries to predict a part of the image, given the rest of the image.

We will give a framework for analyzing the sample complexity of such architectures -- i.e., how many pairs of languages do we need to have paired up corpora for? How many image prediction tasks do we have to solve to get a good representation?

Read more about Talk: Towards a Theory of Encode/Decoder Architectures by Andrej Risteski

Abstract:

Photorealistic editing of human facial expressions and head articulations remains a long-standing topic in the computer graphics and computer vision community. Methods enabling such control have great potential in AR/VR applications where a 3D immersive experience is valuable, especially when this control extends to novel views of the scene in which the human subject appears. Traditionally, 3D Morphable Face Models (3DMMs) have been used to control the facial expressions and head pose of a human head. However, the PCA-based shape and expression spaces of 3DMMs lack the expressivity. They cannot model essential elements of the human head such as hair, skin details, and accessories such as glasses that are paramount for realistic reanimation. In this thesis, we present a set of methods that enables facial reanimation, starting from editing expressions in still face images to creating fully controllable neural 3D portraits with control over facial expressions, head pose, and viewing direction of the scene using only casually captured monocular videos from a smartphone to finally achieving studio-like quality from the said monocular captures.
First, we propose a method for editing facial expressions in near-frontal facial images through the unsupervised disentangling of expression-induced deformations and texture changes. Next, we extend facial expression editing to human subjects in 3D scenes. We represent the scene and the subject in it using a semantically guided neural field. This enables control over the subject's facial expressions and the viewing direction of the scene they're in. We then present a method that learns, in an unsupervised manner, to deform static 3D neural fields using facial expression and head-pose dependent deformations, enabling control over facial expressions and head pose of the subject along with the viewing direction of the 3D scene they're in. Next, we propose a method that makes the learning of the aforementioned deformation field robust to strong illumination effects, which adversely impact the registration of the deformation. We then propose an extension of this unsupervised deformation model to 3D Gaussian splatting by constraining it using a 3D morphable model, resulting in a rendering speed of 18 FPS--a 100x speed improvement over prior work. Finally, we propose a method that bridges the quality gap between 3D portraits created using in-the-wild monocular data and multi-view studio capture data. We accomplish this using a two-stage method. First, we train a StyleGAN to relight and inpaint in-the-wild face texture maps (with strong illumination effects and incompletely captured regions). Next, we both reconstruct and generate identity-specific facial details that may be poorly captured in the in-the-wild captures. Once trained, we can generate studio-like complete avatars from monocular phone captures.

Speaker: Shahrukh Athar

Zoom Link:
https://stonybrook.zoom.us/j/94228500743?pwd=RqOBgG6tbJkKaFBlWFwBkYFX0VRovV.1

Meeting ID: 94228500743
Passcode: 661599

Read more about Controllable Neural 3D Portraits: From images to 3D scenes

Language shared online through social media or messaging reflects people's thoughts and emotions. Processing this data with Natural Language Processing (NLP) and machine learning can reveal mental health and psychological traits. For example, analyzing Facebook posts enables me to predict depression before it is clinically diagnosed and highlight particular symptoms. At the population level, billions of geo-tagged Tweets can be used to monitor health risk patterns, including depression and anxiety trends across communities. Beyond assessment, I'm using Large Language Models (LLMs) to improve mental health care, including training therapists and assisting with Cognitive Behavioral Therapy. These applications of NLP and Al may lead to earlier and more effective interventions and improved access for underserved populations. Speaker: Johannes Eichstaedt, Ph.D. Assistant Professor, Psychology & Human-Centered Al, Stanford University

Read more about Measuring and Improving Mental Health through Digital Text and Large Language Models

Join Zoom Meeting
https://stonybrook.zoom.us/j/91945227869?pwd=emhoZDFWVTV0MVdPWW5uVk43MjQzUT09

Meeting ID: 919 4522 7869
Passcode: 452304
One tap mobile
+16468769923,,91945227869# US (New York)
+13126266799,,91945227869# US (Chicago)

Dial by your location
        +1 646 876 9923 US (New York)
        +1 312 626 6799 US (Chicago)
        +1 301 715 8592 US (Germantown)
        +1 669 900 6833 US (San Jose)
        +1 253 215 8782 US (Tacoma)
        +1 346 248 7799 US (Houston)
        +1 408 638 0968 US (San Jose)
Meeting ID: 919 4522 7869
Find your local number: https://stonybrook.zoom.us/u/aCvAYWkRg

Read more about Miller's Monkey Updated: Communicative Efficiency and the Statistics of Words in Natural Languages with Jordan Kodner