Abstract: Human gaze behavior is a fundamental cue for understanding social intent, human-machine interaction, and cognitive processes. This thesis addresses the challenges of gaze target estimation (GTE), also known as gaze following, by developing a holistic understanding of gaze in complex environments.

The first part of this work improves GTE performance by introducing Patch-level Distribution Prediction (PDP). Unlike traditional models that rely on strict pixel-wise regression, PDP models gaze as a distribution over patches, which better accounts for annotation variance and bridges the gap between target location and in/out-of-frame prediction. To address the laborious nature of data labeling, the second part presents GCDR, the first semi-supervised method for gaze following. By prompting large Visual Question Answering (VQA) models to generate initial Grad-CAM heatmaps and refining them with a diffusion model, this method achieves high performance with significantly fewer human annotations. The third part expands the applicability of GTE to multi-camera environments. By introducing the Multi-View Gaze Target (MVGT) dataset, along with two novel frameworks for integrating information between multiple views and predicting the gaze target across views, we explore a new direction that overcomes single-view limitations such as face occlusion and out-of-view targets.

Building on these foundations, the final part of this thesis proposes a new direction toward semantic social gaze understanding using next-generation multimodal Large Language Models (LLMs). Rather than focusing solely on geometric gaze target localization, we aim to enrich gaze prediction with semantic and relational interpretation in complex social scenes. To this end, we will leverage existing gaze following datasets to derive social gaze supervision, including mutual gaze and shared attention, and obtain aligned language descriptions of scene-level gaze behaviors. This proposed work will enable the model to not only locate gaze targets but also predict structured social gaze relations among individuals, meanwhile generating a concise natural-language summary describing the dominant gaze interactions. By integrating spatial gaze estimation, social relation reasoning, and language-based scene understanding within a unified multimodal model, this work takes an important step toward a holistic understanding of human gaze behavior in real-world environments.

Speaker: Qiaomu Miao

Abstract:
Quantum Machine Learning (QML) holds significant promise for solving computational challenges across diverse domains. However, its practical deployment is constrained by the limitations of noisy intermediate-scale quantum (NISQ) devices, including noise, limited scalability, and trainability issues in variational quantum circuits (VQCs). We introduce the multi-chip ensemble VQC framework, which partitions high-dimensional computations across smaller quantum chips to enhance scalability, trainability, and noise resilience. We show that this approach mitigates barren plateaus, reduces quantum error bias and variance, and maintains robust generalization through controlled entanglement. Designed to align with current and emerging quantum hardware, the framework demonstrates strong potential for enabling scalable QML on near-term devices, as validated by experiments on standard benchmark datasets (MNIST, FashionMNIST, CIFAR-10).

IACS Student Seminar Speaker:
Junghoon Park, Seoul National University
BA in Economics, Seoul National University, Korea
PhD Candidate for Interdisciplinary Programme in Artificial Intelligence at Seoul National University
Visiting Researcher at Brookhaven National Laboratory


Current Research Interests
Quantum Machine Learning


Recent Papers
Park, J., Cha, J., Chen, S. Y.-C., Yoo, S., & Tseng, H.-H. (2025). Addressing the Current Challenges of Quantum Machine Learning through Multi-Chip Ensembles. In Review at ICML.
Park, J., Kim, K., & Cha, J. (2025). How to Assess AI Ethics: Suggestions for Ethical Rating Agencies. In Review at IJCAI.
Park, J., Cha, J., Chen, S. Y.-C., Yoo, S., & Tseng, H.-H. (2024, 15-20 Sept.). Over the Quantum Rainbow: Explaining Hybrid Quantum Reinforcement Learning. 2024 IEEE International Conference on Quantum Computing and Engineering (QCE).
Park, J., Lee, E., Cho, G., Hwang, H., Kim, B.-G., Kim, G., Joo, Y. Y., & Cha, J. (2024). Gene-Environment Pathways to Cognitive Intelligence and Psychotic-Like Experiences in Children. eLife, 12, RP88117. DOI:10.7554/eLife.88117

This seminar will be held in person (food provided!) in the IACS Seminar Room, and online (zoom link below!)
https://stonybrook.zoom.us/j/96548538719?pwd=jBmI43H68q2UkdcRRjVbTkgrC6F942.1
Meeting ID: 965 4853 8719
Passcode: 493290
University Libraries Present: Qualitative data can be challenging to analyze and interpret effectively. In this workshop, SBU Libraries' Data Literacies Lead, Ahmad Pratama will show you how to extract meaningful insights from textual data, including understanding sentiment trends. Learn to explore qualitative data with Python using word clouds, basic natural language processing (NLP) techniques, and lexicon-based sentiment analysis with VADER.
https://stonybrook.zoom.us/meeting/register/k0r6mPYCRayk2AOGmyd0qw#/registration
The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors. Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Everyone else is welcome to attend. Fill in https://forms.gle/q6UG9ygauLp2a8Po8 to subscribe to our mailing list for further announcement.

This workshop synthesizes the latest research on the impact of AI usage in education so that you could make informed decisions on whether and how to use AI to facilitate your learning. You might have seen conflicting reports on whether the use of AI is good for learning. In this workshop, we are going to tease out, drawing on the latest research, which types of AI usage are beneficial or harmful for different kinds of learning. At the end of the workshop, you should walk away with more clarity on when and how to use AI for your own learning. Join PRODIG+ fellow on critical AI, Zheng Fu, in this informative workshop.

Register for this Zoom workshop.

This is Stony Brook's quantum moment. Join us for a spotlight on the core achievements and research excellence of faculty across the Colleges of Arts and Sciences (CAS), and Engineering and Applied Sciences (CEAS) - and their collaborative advancements in quantum science and technology. Learn about the real world impact of their enduring work, their leadership in translating foundational science into entrepreneurial opportunities, and their impetus for making connections to next generation innovation.

Presented by: Catherine Chen, Ph.D., Research Development Associate

Welcome remarks: President Andrea Goldsmith

Panel moderators: Dean David Wrobel, CAS, and Dean Andrew Singer, CEAS

Presentations and panel featuring our faculty:

  • Jennifer Cano, CAS, Physics and Astronomy

  • P. Scott Carney, CEAS, Mechanical Engineering

  • Hyeongrak Chuck Choi, CEAS, Electrical and Computer Engineering

  • Eden Figueroa, CAS, Physics and Astronomy

  • Humanshu Gupta, CEAS, Computer Science

  • Angela Kelly, CAS, Physics and Astronomy

Location: Theatre at the Charles B. Wang Center, Stony Brook University

Reserve your tickets by March 26!

The Future of Learning: Rethinking Practice in a Changing World

Thursday, March 26, 2026 (Workshops)
Friday, March 27, 2026 (Symposium)

Open to Stony Brook University Faculty, Staff, and Graduate Students. Hosted by the Center for Excellence in Learning and Teaching, Office of the Provost.

Thursday, March 26, 2026
Workshop: AI Tools and Techniques
  • Open to all faculty & staff
  • Hands-on, exploratory
  • Registration only limited to the size of the room
  • Location: In-person, TBD
  • Time: 10 AM - 12 PM
  • Registration required

Friday, March 27, 2026
Keynote: Teaching and Thinking with AI
  • Faculty, TAs, postdocs, and academic staff
  • In-person on-campus conference venue
  • Location: SAC Balroom
  • Time: 9 AM - 3 PM
  • Registration required

Keynote Speaker: José Antonio Bowen

José Antonio Bowen has been leading innovation and change for over 40 years at Stanford, Georgetown and the University of Southampton (UK), as a dean at Miami University and SMU and as President of Goucher College. Bowen has worked as a musician with Stan Getz, Dave Brubeck, and many others and his symphony was nominated for the Pulitzer Prize in Music (1985).
Bowen holds four degrees from Stanford and has written over 100 scholarly articles and books, including the Cambridge Companion to Conducting (2003), Teaching Naked (2012 and the winner of the Ness Award for Best Book on Higher Education), Teaching Naked Techniques with C. Edward Watson (2017) and Teaching Change: How to Develop Independent Thinkers using Relationships, Resilience and Reflection (Johns Hopkins University Press, 2021).
Bowen has appeared in The New York Times, Forbes, The Wall Street Journal, and has three TED talks. Stanford honored him as a Distinguished Alumni Scholar (2010) and he has presented keynotes and workshops at more than 300 campuses and conferences 46 states and 17 countries around the world. In 2018, he was awarded the Ernest L. Boyer Award (for significant contributions to American higher education). He is a senior fellow for the American Association of Colleges and Universities.

Register here.
The coach who led Team USA to four Math Olympiad gold medals shares his blueprint for staying irreplaceable in an AI-driven world.

As artificial intelligence transforms our world, what skills will remain uniquely human? How can we prepare for careers in an automated future?

Join Carnegie Mellon mathematics professor Po-Shen Loh for insights on navigating the AI revolution by embracing our humanity.

Dr. Loh brings a distinctive perspective shaped by his dual expertise: serving as national coach of the USA Mathematical Olympiad team (which has won four gold medals under his leadership) and developing innovative solutions for real-world challenges from pandemic response to educational technology.

Through his nationwide speaking tour that reached 250 audiences across 100 cities, he has refined a practical framework for thriving alongside AI.

In this presentation, Dr. Loh will explore how creative problem-solving, judgment, and communication become more valuable as automation grows -- and how students and professionals can build those strengths now.

The session includes real-world examples, guidance for education and careers, and a Q&A.

Speaker: Po-Shen Loh is a social entrepreneur and inventor, working across the spectrum of mathematics, education, and healthcare.

A math professor at Carnegie Mellon University, he also served a decade-long term as the national coach of the USA International Mathematical Olympiad (IMO) team, taking the team to gold on numerous occasions.

He has pioneered numerous innovations and has been featured in or co-created YouTube videos with more than 25 million views.

Location: Wang Center Theater

The series is offered by Stony Brook University's Institute for Creative Problem Solving in collaboration with the National Museum of Mathematics (MoMath) and Brookhaven National Laboratory.

The event is free but space is limited. Please register to reserve your space.

Join the Conversation: Share Your Thoughts about Learning, Academics, and AI

The world of college is changing fast, and Artificial Intelligence (AI) is at the center of it. We are part of the Institute on AI, Pedagogy, and the Curriculum with AAC&U, and we need to hear from the people AI affects most: you!

This is an open discussion for all students to share their honest experiences, their top concerns, and their best ideas about AI in our academic environment. We'll be diving into these key questions:
  • How can AI actually make learning better or easier? What opportunities do you see for using AI tools to enhance your assignments, research, or skills?
  • What are your biggest worries about AI? Is it about cheating, being graded fairly, or preparing for the job market? How is AI impacting your workload or stress levels?
  • What specific tools, workshops, or policies would help you use AI responsibly and successfully? (Think training, software, or clear rules.)
Date: Monday, December 1st
Time: 12:30pm-1:45pm
Location: West Campus - Location TBD
or
Date: Wednesday, December 3rd
Time: 10:30am-11:45am
Location: East Campus - HSC 2-154B

Please register in advance so we can confirm the room.

Note: Videos will not be shared publicly and comments will only be shared in aggregate.

Your voice matters. Come tell us how AI is affecting your studies, your stress, and your success!
  • Dr. Rose Tirotta-Esposito (Assistant Provost; Director of CELT)
  • Dr. Elizabeth Hewitt (Associate Professor in the Department of Technology and Society (DTS) in the College of Engineering and Applied Sciences)
  • Chris Kretz (Associate Librarian and Head of Academic Engagement at SBU Libraries)
  • Prof. Rajiv Lajmi (Assistant Professor in the School of Health Professions and Chair of Applied Health Informatics)
  • Dr. Matthew Salzano (Assistant Professor in the Department of Communication in the School of Communication and Journalism)
Virtual Talk: Metadata Matters: Robust Document Classification via Adaptation Methods for Text-driven Public Health by Xiaolei Huang

Zoom link to follow.

Abstract: Document classifiers have been widely applied in solving health-related issues, such as suicide prevention, flu vaccination surveillance and disease diagnosis. However, document metadata including time, gender, age and location has an enormous impact on robustness of 
document classifiers. Language varies across the metadata bringing both challenges and opportunities to build reliable document classifiers. For example, online written language changes over time, and males and females express opinions differently. This talk describes how to use domain adaptation to integrate temporal and user demographic factors into document classifiers. By adapting knowledge of how language varies across the metadata, models can learn generalized representations of language through the metadata-invariant embeddings. 
This approach will lead to metadata-adapted document classifiers and can also extend to personalize classification models by user embedding. 

Bio: Xiaolei Huang is a 4th-year PhD candidate in Information Science at the University of Colorado, Boulder. He is currently a visiting scholar at the Johns Hopkins University. His research interests are in Natural Language Processing, Machine Learning and Public Health. Particularly, he focuses on domain adaptation, cross-lingual transfer learning, user modeling and fairness.