Join us as we celebrate this year's Brook & Beyond Challenge finalists.
The Office for Research and Innovation invites you to hear about the two-month journey in which the Brook & Beyond team supported eight cohorts in bringing their bold ideas from the lab to the marketplace. It's an energizing evening that highlights the collaboration, creativity, and entrepreneurial spirit driving discovery across the University.
Meet this year's award recipients, hear pitches from the emerging founders, and applaud their achievements.
Connect, celebrate, and be part of the momentum shaping the future of innovation at
Stony Brook University.
Refreshments will be served. Registration is required.
Register Here.
The Institute for AI-Driven Discovery and Innovation hosts Dr. Mary
Simoni for a talk on her music and its intersection with AI, as part
of the Music and AI Seminars series.

The event will be held on Thursday, December 10, 2020, at 3:00 PM.

Abstract: Mary Simoni, Dean of Humanities, Arts & Social Sciences at
Rensselaer Polytechnic Institute will discuss her research in the use
of computer algorithms and technology in the composition and
performance of music. The talk will feature compositions inspired by
Augmented Transition Networks (ATNs), employ motion tracking to
control synthesis parameters, and a work in progress that employs
machine learning using training data that juxtaposes classical music
with COVID-19. During this talk, participants will be introduced to
several technologies that support music information retrieval, machine
learning, and algorithmic composition such as jSymbolic, Weka, and
Common Music.

Zoom details below:
https://stonybrook.zoom.us/j/98236706900?pwd=bDFEZFZtaHBWU0cyL0wxK3UrdUpIdz09
Meeting ID: 982 3670 6900
Passcode: 133945  
Abstract: Human gaze behavior is a fundamental cue for understanding social intent, human-machine interaction, and cognitive processes. This thesis addresses the challenges of gaze target estimation (GTE), also known as gaze following, by developing a holistic understanding of gaze in complex environments.

The first part of this work improves GTE performance by introducing Patch-level Distribution Prediction (PDP). Unlike traditional models that rely on strict pixel-wise regression, PDP models gaze as a distribution over patches, which better accounts for annotation variance and bridges the gap between target location and in/out-of-frame prediction. To address the laborious nature of data labeling, the second part presents GCDR, the first semi-supervised method for gaze following. By prompting large Visual Question Answering (VQA) models to generate initial Grad-CAM heatmaps and refining them with a diffusion model, this method achieves high performance with significantly fewer human annotations. The third part expands the applicability of GTE to multi-camera environments. By introducing the Multi-View Gaze Target (MVGT) dataset, along with two novel frameworks for integrating information between multiple views and predicting the gaze target across views, we explore a new direction that overcomes single-view limitations such as face occlusion and out-of-view targets.

Building on these foundations, the final part of this thesis proposes a new direction toward semantic social gaze understanding using next-generation multimodal Large Language Models (LLMs). Rather than focusing solely on geometric gaze target localization, we aim to enrich gaze prediction with semantic and relational interpretation in complex social scenes. To this end, we will leverage existing gaze following datasets to derive social gaze supervision, including mutual gaze and shared attention, and obtain aligned language descriptions of scene-level gaze behaviors. This proposed work will enable the model to not only locate gaze targets but also predict structured social gaze relations among individuals, meanwhile generating a concise natural-language summary describing the dominant gaze interactions. By integrating spatial gaze estimation, social relation reasoning, and language-based scene understanding within a unified multimodal model, this work takes an important step toward a holistic understanding of human gaze behavior in real-world environments.

Speaker: Qiaomu Miao
Please join us on Zoom for our next event in the Fall 2025 Stony Brook School of Nursing Research Seminar Series presented by our Office of Research and Innovation.

Topic: Responsible Artificial Intelligence: Promoting Health Equity for All

Speaker: Michael P. Cary, Jr., PhD, RN, FAAN.

Dr. Cary is a tenured Associate Professor at the Duke University School of Nursing. Dually trained as a health services researcher and applied health data scientist, Dr. Cary utilizes AI to investigate health disparities in aging populations, thereby promoting health equity and improving healthcare delivery. He co-directs HUMAINE™, an initiative dedicated to equipping nurses and healthcare professionals with the knowledge and skills necessary for the responsible use of AI in clinical practice.

Register: https://web.cvent.com/event/057978a5-a770-4de5-aca5-ad00287e4902/summary

Are you tired of drowning in a sea of resumes and losing top talent in the hiring whirlwind? Transform your hiring process through a different lens and learn about AI in the Workplace and the Applicant Tracking System (ATS). Whether you're a recent graduate seeking your first job or an undergraduate student looking to delve into more career-oriented opportunities, this workshop by SBU Career Center is designed to equip you with the knowledge and strategies needed to succeed.

Register here: https://stonybrook.joinhandshake.com/stu/events/1568133?

CSE 656 Seminars in Computer Vision - Wednesdays 11:30am-12:50pm, Room NCS 120

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.

The first meeting will be Wed Jan 29 at 11.30am, room 120 New CS. The meeting will deal with organizational matters and we will start right away with some presentations. Send David Paredes Merino <dparedesmeri@cs.stonybrook.edu> an email if you are interested but cannot attend the first meeting. Please forward to people outside the CS department that you think might be interested.

You are cordially invited to attend the biweekly Brookhaven AI Mixer (BAM). BAM includes one short talk on AI research happening at BNL, followed by an open mixer over coffee and snacks for everyone to network and discuss all things AI. The first half hour will consist of presentations that will be available via ZOOM, and the second half hour will be for in person only networking.

Join us every other Tuesday at noon in CDSD's Training Room (building 725, 2nd floor) to learn about interesting AI methods and applications, engage with potential collaborators, prepare for pending FASST funding calls, and build a community of AI for Science at BNL.

Abstract: The increasing complexity and volume of data from electron microscopy necessitates advanced computational tools for timely and accurate analysis. In this talk, I will present several machine learning (ML) models developed to interpret diverse datasets from transmission electron microscopy (TEM). First, I demonstrate segmentation models for labelling regions of interest from in situ TEM images, such as atomic column positions or reaction sites that allow atomic-level quantitative analysis of data. Second, I introduce a self-supervised CNN model for denoising of low-dose HRTEM images, enabling clearer visualization of atomic features without sacrificing temporal resolution. Finally, a transformer-based model trained to predict copper oxidation states directly from their electron energy loss spectroscopy spectra will be introduced. Together, these projects showcase the power of tailored ML solutions to extract quantitative insights from complex microscopy data.

Biography: Brian Lee is a research associate working for the Electron Microscopy group and Theory and Computation group at the Center for Functional Nanomaterials. Previously, he has received PhD in Mechanical Engineering from Duke University and worked as a postdoc at Purdue University. His research focuses on applying machine learning and simulation techniques for materials science.

Location: CDS, Bldg. 725, Training Room

Join ZoomGov Meeting: https://bnl.zoomgov.com/j/1604383624?pwd=ffQ5cUPNxTI7nzClKQO6cnsNbhF9Vf.1

Meeting ID: 160 438 3624
Passcode: 558449

Title: Cultural Biases, World Languages, and User Privacy in Large Language Models
Abstract: In this talk, I will highlight three key aspects of large language models: (1) cultural bias in LLMs and pre-training data, (2) decoding algorithm for low-resource languages, and (3) human-centered design for real-world applications.

The first part focuses on systematically assessing LLMs' favoritism towards Western culture. We take an entity-centric approach to measure the cultural biases among LLMs (e.g., GPT-4, Aya, and mT5) through natural prompts, story generation, sentiment analysis, and named entity tasks. One interesting finding is that a potential cause of cultural biases in LLMs is the extensive use and upsampling of Wikipedia data during the pre-training of almost all LLMs. The second part will introduce a constrained decoding algorithm that can facilitate the generation of high-quality synthetic training data for fine-grained prediction tasks (e.g., named entity recognition, event extraction). This approach outperforms GPT-4 on many non-English languages, particularly low-resource African languages. Lastly, I will showcase an LLM-powered privacy preservation tool designed to safeguard users against the disclosure of personal information. I will share findings from an HCI user study that involves real Reddit users utilizing our tool, which in turn informs our ongoing efforts to improve the design of AI models.
Bio:

Wei Xu is an Associate Professor in the College of Computing and Machine Learning Center at the Georgia Institute of Technology, where she is the director of the NLP X Lab. Her research interests are in natural language processing and machine learning, with a focus on Generative AI, robustness and fairness of large language models, multilingual LLMs, as well as AI for science, education, accessibility, and privacy research. She is a recipient of the NSF CAREER Award, Google Academic Research Award, CrowdFlower AI for Everyone Award, Best Paper Awards and Honorable Mentions at COLING'18, ACL'23, ACL'24. She also received research funds from DARPA and IARPA. She is currently an executive board member of NAACL. Join Zoom Meeting https://stonybrook.zoom.us/j/98855994362?pwd=F2qnpwL85fhCBHAEW9ZBpXihfwGHsj.1 (ID: 98855994362, passcode: 172797) Join by phone (US) +1 646-876-9923 (passcode: 172797) Joining instructions: https://www.google.com/url?q=https://applications.zoom.us/addon/invitation/detail?meetingUuid%3DuDJcUTvyQueZkCaUSAwFlg%253D%253D%26signature%3Da3d49e0f7f2e74e7130f7308c74bd85ba7b99587b98ba2e34238bb657ca51a09%26v%3D1&sa=D&source=calendar&usg=AOvVaw2jTn5cjfRG8vXU8KHHlU2Y Meeting host: H.Andrew.Schwartz@stonybrook.edu

Join Zoom Meeting:
https://stonybrook.zoom.us/j/98855994362?pwd=F2qnpwL85fhCBHAEW9ZBpXihfwGHsj.1

Abstract:
Quantum Machine Learning (QML) holds significant promise for solving computational challenges across diverse domains. However, its practical deployment is constrained by the limitations of noisy intermediate-scale quantum (NISQ) devices, including noise, limited scalability, and trainability issues in variational quantum circuits (VQCs). We introduce the multi-chip ensemble VQC framework, which partitions high-dimensional computations across smaller quantum chips to enhance scalability, trainability, and noise resilience. We show that this approach mitigates barren plateaus, reduces quantum error bias and variance, and maintains robust generalization through controlled entanglement. Designed to align with current and emerging quantum hardware, the framework demonstrates strong potential for enabling scalable QML on near-term devices, as validated by experiments on standard benchmark datasets (MNIST, FashionMNIST, CIFAR-10).

IACS Student Seminar Speaker:
Junghoon Park, Seoul National University
BA in Economics, Seoul National University, Korea
PhD Candidate for Interdisciplinary Programme in Artificial Intelligence at Seoul National University
Visiting Researcher at Brookhaven National Laboratory


Current Research Interests
Quantum Machine Learning


Recent Papers
Park, J., Cha, J., Chen, S. Y.-C., Yoo, S., & Tseng, H.-H. (2025). Addressing the Current Challenges of Quantum Machine Learning through Multi-Chip Ensembles. In Review at ICML.
Park, J., Kim, K., & Cha, J. (2025). How to Assess AI Ethics: Suggestions for Ethical Rating Agencies. In Review at IJCAI.
Park, J., Cha, J., Chen, S. Y.-C., Yoo, S., & Tseng, H.-H. (2024, 15-20 Sept.). Over the Quantum Rainbow: Explaining Hybrid Quantum Reinforcement Learning. 2024 IEEE International Conference on Quantum Computing and Engineering (QCE).
Park, J., Lee, E., Cho, G., Hwang, H., Kim, B.-G., Kim, G., Joo, Y. Y., & Cha, J. (2024). Gene-Environment Pathways to Cognitive Intelligence and Psychotic-Like Experiences in Children. eLife, 12, RP88117. DOI:10.7554/eLife.88117

This seminar will be held in person (food provided!) in the IACS Seminar Room, and online (zoom link below!)
https://stonybrook.zoom.us/j/96548538719?pwd=jBmI43H68q2UkdcRRjVbTkgrC6F942.1
Meeting ID: 965 4853 8719
Passcode: 493290