CSE 600 Seminar Series | Fall 2025


Abstract: Large reasoning models have demonstrated capabilities to solve competition-level math problems, answer deep research questions, and address complex coding needs. Much of this progress has been enabled by scaling of data: pre-training data to learn vast knowledge, fine-tuning data to learn natural language reasoning, and RL environments to refine that reasoning. In this talk, I will describe the current LLM reasoning paradigm, its boundaries, and the future of LLM reasoning beyond scaling. First, I will describe the state of reasoning models and where I think scaling can lead to some additional (though perhaps limited) successes. I will then shift to discussing more fundamental issues with models that scale will not resolve in the next few years. I will touch on four current limitations: outdated knowledge, generator-validator gaps, limited creativity, and poor compositional generalization. In all cases, fundamental limitations of LLMs or of supervised learning in general make these problems challenging, inviting future study and novel solutions beyond scaling.

Bio: Greg Durrett is an associate professor in the Department of Computer Science and the Center for Data Science at New York University. His research is broadly in the areas of natural language processing and machine learning. Currently, his group's focus is on reasoning about knowledge in text, verifying correctness of generation methods, and studying how to make progress on problems that defy LLM scaling. He is a 2023 Sloan Research Fellow and a recipient of a 2022 NSF CAREER award. He has served in numerous roles for ACL conferences, recently as a member of the NAACL Board since 2024 and as Senior Area Chair for ACL 2025 and EMNLP 2025. He received his BS in Computer Science and Mathematics from MIT and his PhD in Computer Science from UC Berkeley, where he was advised by Dan Klein.
Over the past decade, Artificial Intelligence (AI) has made stunning advances, from mastering language to solving the structure of proteins. These breakthroughs arise from more than forty years of work in neural networks, where ideas from neuroscience have inspired solutions in AI. In this lecture, Anthony Zador, MD, PhD, will explore how reverse engineering the brain's computations has driven progress in both fields, and how this back-and-forth between neuroscience and AI is set to grow even stronger -- with brain-inspired designs driving new AI advances while AI tools transform our understanding of how the brain works.

Speaker:
Dr. Zador works at the intersection of neuroscience and artificial intelligence. He is the Alle Davis Harris Professor of Biology at Cold Spring Harbor Laboratory, where he served as Chair of Neuroscience. He was named one of Foreign Policy's 100 Leading Global Thinkers and is a recipient of the Brain Research Foundation Fellowship, the Gill Symposium Transformative Investigator Award, and the Allen Distinguished Investigator Award.

Watch online at stonybrook.edu/live
CSE 656 Seminars in Computer Vision - Wednesdays 11:30am-12:50pm, Room NCS 120

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.

The first meeting will be Wed Jan 29 at 11.30am, room 120 New CS. The meeting will deal with organizational matters and we will start right away with some presentations. Send David Paredes Merino <dparedesmeri@cs.stonybrook.edu> an email if you are interested but cannot attend the first meeting. Please forward to people outside the CS department that you think might be interested.
Abstract: Theory-internal work on opacity in phonology has been focused on the challenges these interactions present for one theory (rules, constraints) versus another. But there has also been interest in studying the formal, invariant properties of opaque and other process interactions (Chandlee et al. 2018; Bakovic and Blumenfeld 2024), though these works crucially differ in their underlying assumptions. In this talk I will recontextualize Chandlee et al. (2018)'s result that opaque maps are ISL in light of Bakovic and Blumenfeld (2024)'s recent formal typology of process interactions, and this recontextualization will provide an answer to an open question about the k-value of an interaction map. I will then discuss the implications of this collective formal understanding of opacity for a recent model of lexicon and phonological grammar learning (i.e., Hua and Jardine 2021, Chandlee and Jardine to appear).


Speaker: Prof. Jane Chandlee, Associate Professor in the Department of Linguistics at Haverford College

Location: IACS Seminar room.
Abstract: Artificial Intelligence (AI) is no longer a futuristic concept -- it is here, but its development, benefits, and risks remain unevenly distributed across industries, nations, and social groups. In this talk, Jieshu presents her research on the societal dimensions of AI from two perspectives: the forces shaping AI's development (backward-looking) and its current and potential impact on society (forward-looking). She first examines disparities in AI, including women's underrepresentation in AI patents and the geographic concentration of AI innovation, highlighting inequalities in who creates AI and who benefits from it. She then explores AI's societal impact, focusing on workforce transformation and the need for GenAI literacy. She will also discuss AI patents, AI's role in climate change mitigation and adaptation, potential environmental biases in LLMs, and gender-specific patterns in AI portrayals in science fiction.

Bio: Jieshu Wang is a Postdoctoral Research Scholar at Arizona State University (ASU), focusing on the social dimensions of artificial intelligence (AI). With a background in engineering, economics, communication, and science and technology studies, she examines how AI both shapes and is shaped by broader societal forces. Her research employs interdisciplinary methods to explore the social, political, and economic factors influencing AI development, as well as its role in innovation, the economy, the future of work, climate change mitigation, and popular culture. Jieshu holds a Ph.D. in Human and Social Dimensions of Science and Technology from ASU. She is also a science book translator and has translated six books.

Location: Old Computer Science, room 1310
Making sense of Twitter @ Bloomberg presented by Daniel Preotiuc-Pietro

ABSTRACT: The Bloomberg Terminal has provided ways for investors and journalists to sift through and understand the immense volume of tweets and discover financially-relevant content ever since the SEC approved the use of Twitter for company disclosures back in 2013.

In the first part of the talk, I will showcase how tweets impact financial markets and how Bloomberg is using Natural Language Processing methods to identify financially relevant tweets that move the markets. Our processing pipeline feeds directly to clients, journalists in the newsroom and powers several news analytic products offered by the company including trending companies and consumer sentiment for publicly traded equities.

However, understanding user pragmatic intent in individual tweets would allow us to gain deeper insights and enable new applications. I will present several recent research studies focused on understanding intent including identifying complaints and the roles with which vulgarity is used in social media and how these can help improve applications such as sentiment analysis and hate speech detection.

BIO: Daniel Preotiuc-Pietro is a Senior Research Engineer and Team Lead at Bloomberg LP, where he works on analyzing and building models for real-world large scale social media and news mining and information extraction. His research interests are focused on understanding the social and temporal aspects of text, especially from social media, with applications in domains such as Social Psychology, Law, Political Science and Journalism. Several of his research studies were featured in popular press including the Washington Post, BBC, New Scientist, Scientific American or FiveThirtyEight. He is a co-organizer of the Natural Legal Language Processing workshop series. Prior to joining Bloomberg LP, Daniel was a postdoctoral researcher at the University of Pennsylvania with the interdisciplinary World Well Being Project and obtained his PhD in Natural Language Processing and Machine Learning at the University of Sheffield, UK.



Matthew Salzano (Stony Brook), AI and DEIA: Getting at the Roots

Link to the talk (no pre-registration required this time): https://stonybrook.zoom.us/j/96209347479?pwd=Cs8fEfFdbXrGTC5cQgyHRb8Msh5vp8.1Meeting ID: 962 0934 7479 Passcode: 272489

Abstract: Conversations about AI and DEIA (Diversity, Equity, Inclusion, and Access) often unwittingly assume that social problems can and should have technical fixes. Left unaddressed, scholars, advocates, and technologists inevitably miss important consequences in our proposed solutions, and focus on surface-level problems rather than addressing the root causes of inequity. Drawing from scholarship in communication, rhetoric, and critical digital studies, this talk explains how we are often trimming branches when we need to pull out roots -- and introduces new terms and questions that can help reorient our conversations about AI and DEIA.

Speaker Bio: Matthew Salzano, Ph.D., is a communication scholar researching new media technologies, user practices, and cultural trends that threaten to limit possibilities for diverse engagement in public argument, debate, and protest. His scholarship has appeared in journals like The Quarterly Journal of Speech, Critical Studies in Media Communication, and Women's Studies in Communication, and his research on DEIA, AI, and advocacy communications has been funded by the Waterhouse Family Institute at Villanova University. He is currently an Inclusion, Diversity, Equity, and Access fellow in Ethical AI at Stony Brook University's School of Communication and Journalism and Alan Alda Center for Communicating Science.

Abstract: Jailbreak attacks circumvent LLMs' built-in safeguards by concealing harmful queries within adversarial prompts. While most existing defenses attempt to mitigate the effects of adversarial prompts, they often prove inadequate as adversarial prompts can take arbitrary, adaptive forms. This paper introduces RobustKV, a novel jailbreak defense that takes a fundamentally different approach by selectively removing critical tokens of harmful queries from key-value (KV) caches. Intuitively, for an adversarial prompt to be effective, its tokens must achieve sufficient `importance' (measured by attention scores), which consequently lowers the importance of tokens in the concealed harmful query. Therefore, by carefully evicting the KVs of low-ranked tokens, RobustKV minimizes the harmful query's presence in the KV cache, thus preventing the LLM from generating informative responses. Extensive evaluation using benchmark datasets and models demonstrates that RobustKV effectively counters state-of-the-art jailbreak attacks while maintaining the LLM's performance on benign queries. Notably, RobustKV creates an interesting effectiveness-evasiveness dilemma for the adversary, leading to its robustness against adaptive attacks.

Speaker: Tanqiu Jiang

Where: NCS 220 and Zoom (https://stonybrook.zoom.us/j/6406956411)