Abstract: Computer vision seeks to extract semantic and geometric information from images and videos, serving as the perceptual foundation for intelligent systems such as robots and autonomous vehicles. Over the past decade, deep learning has driven remarkable progress in the field, advancing capabilities from 2D recognition to 3D reconstruction. However, the current purely data-driven paradigm faces fundamental challenges, including data inefficiency, curse of high dimensionality, and limited understanding of visual entities beyond individual objects.

In this talk, I will present my recent research on modeling and learning rich visual structures to address these challenges. First, I will introduce a novel framework that integrates explicit visual dependency modeling with deep learning for 2D and 3D dense prediction. Next, I will demonstrate how unfolding the manifold structure of visual data enables unsupervised semantic segmentation. Finally, I will present a recent project that represents, parses, and learns the geometric compositionality of 3D objects to facilitate self-supervised part-whole reconstruction. Through these efforts, I aim to bridge the gap between data-driven deep learning and visual structure modeling, paving the way for more efficient, generalizable, and interpretable computer vision models.

Bio: Dr. Wei Tang is an Assistant Professor in the Department of Computer Science at the University of Illinois Chicago (UIC). He obtained his Ph.D. in Electrical Engineering from Northwestern University, where his dissertation was honored with a Best Dissertation Award. His research interests include computer vision, digital image processing, and machine learning. Dr. Tang has served as an associate editor for several international journals, including Pattern Recognition and Machine Vision and Applications, and as an area chair for leading conferences, including CVPR, ICCV, and WACV. His research has been funded by the National Science Foundation (NSF) and industry partners such as Motorola and Wormpex AI Research.


Location: NCS 115

Zoom: https://stonybrook.zoom.us/j/4624091659?omn=95178138684&jst=3
University Libraries Present: Qualitative data can be challenging to analyze and interpret effectively. In this workshop, SBU Libraries' Data Literacies Lead, Ahmad Pratama will show you how to extract meaningful insights from textual data, including understanding sentiment trends. Learn to explore qualitative data with Python using word clouds, basic natural language processing (NLP) techniques, and lexicon-based sentiment analysis with VADER.
https://stonybrook.zoom.us/meeting/register/k0r6mPYCRayk2AOGmyd0qw#/registration
Abstract: Recent work in NLP uses debates between multiple LLMs to arrive at a more accurate conclusion. Earlier chain-of-thought prompting also shows improvements in accuracy when the model is asked to provide step-by-step reasoning in its response. Many publications since have developed strategies to improve the reasoning of model output with the goal of generating a more accurate result. However, even when asked to provide problem solving steps, the content of the reasoning provided by models is not well studied for all tasks and sometimes contains errors or conflicting statements even when the final result is correct. In fact, when evaluated across reasoning tasks, evidence shows that LLMs are not learning how to reason but are instead mimicking relevant solutions from their training sets.
By studying and evaluating the argumentation that LLMs provide, we can determine factors that may benefit or hinder the model's ability to give a complete, cohesive, and thorough answer. While there are signs that LLMs pattern match, finding where, when, and why this fails is valuable, as there may be ways to help the model imitate solutions that are more relevant to the task it is attempting to solve. Determining when pattern matching is not enough could show an area of improvement for future generations of LLMs. This research may separately aid in work on human-(AI)agent and inter-agent interaction. Specifically, frameworks could be used to determine when and why other models or humans are convinced by LLM-generated responses and which argument methods cause other models to change their response. Our current research in systematic versus heuristic cues shows that large language models sometimes present systematic or heuristic reasoning patterns based on prompting. Future research aims to explore other methods of classifying argumentation.

Speaker: Kiera Gross

Joining link: https://meet.google.com/xae-ywpv-udo
What comes after today's large language models and deep neural networks? Join the Computing Community Consortium (CCC) for a virtual 30-min community chat led by David Jensen, CCC Council Member and lead author of the new CCC whitepaper, Envisioning Possible Futures for AI Research. Jensen will explore paradigm-shifting AI Research Futures like Neuro-Symbolic, Embodied, Multi-Agent, and Quantum AI, and then open the floor to the audience for an engaging Q&A discussion.

Register here.

The University's Main Commencement Ceremony will take place on Friday, May 23, 2025 at 11 am at Kenneth P. LaValle Stadium. Gates open at 10 am.

All guests need a valid ticket to enter LaValle Stadium - no exceptions. Children age 1 and older require a ticket. Seating is first-come, first-served.

Register here.



Abstract: In high-dimensional data spaces, vast empty regions often exist where no known data points are present. These empty spaces are not merely gaps but hold untapped potential for discovering novel configurations, optimizing parameters, and improving decision-making processes. However, traditional exploration techniques struggle to identify and leverage these regions due to the curse of dimensionality. To address this, we introduce the Empty Space Search Algorithm (ESA), a scalable, physics-inspired method that systematically identifies and explores these uncharted voids. ESA operates by modeling the data space as a dynamic system, using a repulsion-attraction mechanism to locate optimal empty space configurations (ESCs) without requiring exhaustive search. Building upon ESA, we present GapMiner, a visual analytics system that integrates human-in-the-loop AI to iteratively refine and validate ESCs. GapMiner combines parallel coordinate visualization, interactive optimization, and deep learning-based predictive modeling to enhance the efficiency of empty space exploration. This methodology has broad applications, including accelerating convergence in evolutionary algorithms through a more diverse initial population, optimizing adversarial learning strategies, and discovering novel parameter configurations in reinforcement learning. Our approach demonstrates that empty space is not just an absence of data but a frontier for new possibilities in high-dimensional problem-solving.
Bio: Xinyu Zhang received his B.E. in Computer Science from Shandong University, Taishan College, in 2019. He is currently a final-year Ph.D. candidate in the Department of Computer Science at Stony Brook University, advised by Prof. Klaus Mueller. His research focuses on multivariate data analysis, scientific visualization, and reinforcement learning. He has published multiple papers in top-tier journals and conferences, including IEEE TVCG and NeurIPS.
*this seminar will be held in person (food provided on a first come, first serve basis), and online (zoom link below)!
Topic: IACS Student Seminar Speaker: Xinyu Zhang
Time: Feb 26, 2025 12:00 PM Eastern Time (US and Canada)
Join Zoom Meeting
https://stonybrook.zoom.us/j/91848218975?pwd=lfITFa61GaXZ2Wsa1B1OnbLQMmXvOE.1

Meeting ID: 918 4821 8975
Passcode: 027337
Abstract: Large language models are prone to memorizing some of their training data. Memorized (and possibly sensitive) samples can then be extracted at generation time by adversarial or benign users. There is hope that model alignment---a standard training process that tunes a model to harmlessly follow user instructions---would mitigate the risk of extraction. However, we develop two novel attacks that undo a language model's alignment and recover thousands of training examples from popular proprietary aligned models such as OpenAI's ChatGPT. Our work highlights the limitations of existing safeguards to prevent training data leakage in production language models.

Speaker: Pegah Alipoormolabashi

Location: CS2311