AI Innovation Institute

Abstract: Human gaze behavior is a fundamental cue for understanding social intent, human-machine interaction, and cognitive processes. This thesis addresses the challenges of gaze target estimation (GTE), also known as gaze following, by developing a holistic understanding of gaze in complex environments.

The first part of this work improves GTE performance by introducing Patch-level Distribution Prediction (PDP). Unlike traditional models that rely on strict pixel-wise regression, PDP models gaze as a distribution over patches, which better accounts for annotation variance and bridges the gap between target location and in/out-of-frame prediction. To address the laborious nature of data labeling, the second part presents GCDR, the first semi-supervised method for gaze following. By prompting large Visual Question Answering (VQA) models to generate initial Grad-CAM heatmaps and refining them with a diffusion model, this method achieves high performance with significantly fewer human annotations. The third part expands the applicability of GTE to multi-camera environments. By introducing the Multi-View Gaze Target (MVGT) dataset, along with two novel frameworks for integrating information between multiple views and predicting the gaze target across views, we explore a new direction that overcomes single-view limitations such as face occlusion and out-of-view targets.

Building on these foundations, the final part of this thesis proposes a new direction toward semantic social gaze understanding using next-generation multimodal Large Language Models (LLMs). Rather than focusing solely on geometric gaze target localization, we aim to enrich gaze prediction with semantic and relational interpretation in complex social scenes. To this end, we will leverage existing gaze following datasets to derive social gaze supervision, including mutual gaze and shared attention, and obtain aligned language descriptions of scene-level gaze behaviors. This proposed work will enable the model to not only locate gaze targets but also predict structured social gaze relations among individuals, meanwhile generating a concise natural-language summary describing the dominant gaze interactions. By integrating spatial gaze estimation, social relation reasoning, and language-based scene understanding within a unified multimodal model, this work takes an important step toward a holistic understanding of human gaze behavior in real-world environments.

Speaker: Qiaomu Miao

Read more about A Holistic Approach to Human Gaze Understanding: Probabilistic Modeling, Geometric Reasoning, and Multimodal Learning

The SUNY AI Symposium brings together AI experts from across the state, in Western New York and around the country.

This two-day event showcases AI thought leaders, SUNY researchers, students and companies of all sizes who leverage AI to produce positive outcomes--with scientific discovery, business innovation and economic impact. Come curious, explore the fascinating world of AI and leave with connections to those at the forefront of innovation.

Read more about SUNY AI Symposium

Are you concerned about AI issues with your asynchronous online courses? Is your fully online course vulnerable to AI plagiarism? Do you want to engage your online students using AI? Discover the future of education with our AI-powered solutions designed specifically for online asynchronous courses. This innovative approach uses artificial intelligence to transform the way courses are delivered, making learning more personalized, engaging, and effective.

Register here: https://stonybrook.zoom.us/meeting/register/RD94cHiHRwCj6xNkCZqNEg

Read more about CELT Workshop: AI Solutions for Asynchronous Online Courses

Join Klaus Mueller, professor of computer science and interim chair of the Department of Technology and Society, as he hosts Sucheta Lahiri.

Lahiri leads the AI Ethics and Risk Management function at Oxy, where she is responsible for ensuring that the company's AI solutions are developed and deployed in a manner that is ethical, efficient, trustworthy, safe, sustainable, and human-centered. She holds a doctorate from Syracuse University, along with two master's degrees in Applied Statistics and Information Science earned in India.

Zoom: https://stonybrook.zoom.us/j/7851507944?omn=98268154363#success

Read more about Learn About Responsible AI from a Leader in Oil and Gas Industry

Fall 2025, Mondays 2 to 3:20 pm, NCS 220 and Zoom link to be announced soon.

The seminar will be jointly taught by Prof. Dimitris Samaras samaras@cs.stonybrook.edu.

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision.

To enroll in this course, you must either: (1) be in the Ph.D. program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Registered students must attend in person. Up to 3 absences will be excused. Everyone else is welcome to attend.

Read more about CSE 656 Seminar in Computer Vision | Fall 2025

Virtual Talk: Metadata Matters: Robust Document Classification via Adaptation Methods for Text-driven Public Health by Xiaolei Huang

Zoom link to follow.

Abstract: Document classifiers have been widely applied in solving health-related issues, such as suicide prevention, flu vaccination surveillance and disease diagnosis. However, document metadata including time, gender, age and location has an enormous impact on robustness of
document classifiers. Language varies across the metadata bringing both challenges and opportunities to build reliable document classifiers. For example, online written language changes over time, and males and females express opinions differently. This talk describes how to use domain adaptation to integrate temporal and user demographic factors into document classifiers. By adapting knowledge of how language varies across the metadata, models can learn generalized representations of language through the metadata-invariant embeddings.
This approach will lead to metadata-adapted document classifiers and can also extend to personalize classification models by user embedding.

Bio: Xiaolei Huang is a 4th-year PhD candidate in Information Science at the University of Colorado, Boulder. He is currently a visiting scholar at the Johns Hopkins University. His research interests are in Natural Language Processing, Machine Learning and Public Health. Particularly, he focuses on domain adaptation, cross-lingual transfer learning, user modeling and fairness.

Read more about Virtual Talk: Metadata Matters: Robust Document Classification via Adaptation Methods for Text-driven Public Health by Xiaolei Huang

ICB&DD 19th Annual Symposium

Iwao Ojima, Director, ICB&DD
Ivet Bahar Chair, Organizing Committee
Dima KozakovCo-Chair, OrganizingCommittee

There will be poster sessions on projects conducted in the ICB&DD member's laboratories aswell as other laboratories in the area. Awards will be given to the best three posters.

Please see the link for the registration and poster sessions in:
https://www.stonybrook.edu/commcms/icbdd/https://forms.gle/Wh4UzVx9U4HWStXb8

Read more about Drug Discovery & AI: Advances and New Directions

Dates:

Wednesday, March 3, 2021 - 6:00pm to 7:30pm

Location:

Zoom - contact events@cs.stonybrook.edu for Zoom info.

Event Description:

Women in Computer Science (WiCS), the Society of Women Engineers (SWE), and the Stony Brook Robotics Team (SBRT) are collaborating to host an event called Inspiring Women in STEM Academia: A Community Dialogue to address the lack of female representation in STEM academia.

All are invited to attend so they may gain a better understanding of the challenges faced by their female colleagues and hear perspectives on how they can offer support in the workplace. Given the shockingly disproportionate number of female professionals in STEM academia, we feel that this event would be extremely beneficial for male faculty to listen to and amplify their voices.

It will begin with a discussion panel consisting of Stony Brook professors and faculty who will provide valuable insight into the issue. From there, we will split into smaller discussion groups where student and faculty attendees will be able to voice their opinions, hear about the thoughts/experiences of others, and participate in an engaging discussion with panelists.

The event will be held on March 3rd from 6:00 - 7:30 PM on Zoom.

The following Stony Brook faculty will be panelists:

Dr. Aruna Balasubramanian - Computer Science Professor, WiCS Advisor, WPhD Advisor

Dr. Xinwei Mao - Civil Engineering Assistant Professor

Urszula Zalewski - Director of Experiential Learning, Career Center Advisor (Healthcare)

Dr. Heather Lynch - Ecology and Evolution Professor, Lynch Lab for Quantitative Ecology

Karen Kernan - URECA Director, Simons Summer Research Program Director

Dr. Eszter Boros - Chemistry Assistant Professor, Boros Lab

Dr. Maria Nagan - Chemistry Lecturer, Nagan Research Lab

Read more about Inspiring Women in STEM Academia: A Community Dialogue

Climate Uncertainty, Decision Making, and AI for Earth System Predictability Dr. Nathan Urban, Brookhaven National Laboratory

Bio: Nathan Urban is the group leader of the Optimal Experimental Design & Uncertainty Quantification group in the Applied Mathematics Department at Brookhaven National Laboratory's Computing & Data Sciences directorate (CDS). He holds a Ph.D. in theoretical condensed matter physics from Penn State, and has previously held research positions at Los Alamos National Laboratory, Princeton, and Penn State. His research interests include Bayesian inference and spatiotemporal statistics, probabilistic prediction and forecasting, multi-model / model-form / model structural uncertainty quantification, reduced order modeling, scientific machine learning and hybrid physical-data driven modeling, in-situ/streaming data analysis at scale, information fusion, decision making under uncertainty and optimal experimental design, and integrated multiscale computational frameworks for decision support.

Location: IACS Seminar Room

Lunch will be provided

Read more about Climate uncertainty, decision making, and AI for Earth system predictability

Abstract: Astronomers slowly made sense of the cosmos by following the stars night after night. I suggest we examine human identity in a similar way. Let's observe the words individuals use to describe themselves day after day. In this presentation, I will introduce ipseology - a new approach to studying human selves. Ipseology is the systematic, empirical study of ipseity: selfhood, individuality and the elements of identity. The primary idea is that we can learn a lot about people from their self-authored self-descriptions - especially if we follow their revisions over time. I will discuss results from sampling millions of social media bios over more than a decade and present new approaches for observation in the Post-API age.

Bio: Dr. Jason Jeffrey Jones is a computational social scientist whose expertise includes online experiments, social networks, high-throughput text analysis and machine learning. He is interested in humans' perceptions of themselves and the developing role of artificial intelligence in society.

Dr. Jones is the director of CSSERG (pronounced sea surge): the Computational Social Science of Emerging Realities Group. CSSERG is a team of scholars committed to cross-disciplinary collaboration, united by common computational methodologies and always with eyes on the near future. CSSERG has studied the effectiveness of virtual reality in evoking empathy, the dynamics of gender stereotypes in language over decades and temporal trends in personally expressed identity.

This seminar will take place in person and online (zoom link below):

Join Zoom Meeting
https://stonybrook.zoom.us/j/93686609778?pwd=KdHVyIbU3ymML6hTchXsm6JLYKLSru.1

Meeting ID: 936 8660 9778
Passcode: 638699

Read more about Ipseology: A New Science of the Self