Spring 2025, Mondays 3.30 to 4.50 pm, NCS 220 and Zoom link to be announced soon.

The seminar will be jointly taught by Prof. Chao Chen, chao.chen.1@stonybrook.edu and Prof. Dimitris Samaras samaras@cs.stonybrook.edu

The overall purpose of this seminar is to bring together people with interests
in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision.

To enroll in this course, you must either: (1) be in the Ph.D. program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 15 minutes) by multiple students. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 talks. Registered students must attend in person. Up to 3 absences will be excused. Everyone else is welcome to attend.

OVERVIEW


This workshop, Expanding Horizons in AI with HPC, aims to explore the dynamic intersection of AI and HPC, focusing on how advanced computing can accelerate AI research and applications. As AI models become more complex and data-intensive, traditional computing systems struggle to meet the demand for scalability, efficiency, and speed. HPC offers a solution by providing the necessary infrastructure for training large-scale models, enhancing AI algorithms, and enabling breakthroughs in fields such as deep learning, natural language processing, and autonomous systems.

Through a combination of expert presentations and panel discussions, participants will gain insights into the latest developments in AI-HPC integration. Attendees will also engage in discussions on the future trends, challenges, and ethical considerations surrounding the use of HPC in AI.

The workshop is designed for AI researchers, data scientists, engineers, and HPC professionals seeking to enhance their understanding of how high-performance computing can drive innovation and expand the potential of AI in solving complex, real-world problems.

The workshop will be held at the Wang Center at Stony Brook University.

https://you.stonybrook.edu/hpcai/

PROGRAM

The program features sessions on HPC Architectures for AI, AI Applications in HPC, LLM's in HPC, and AI in HPC Workflows, and open student presentations. The tentative program and list of confirmed speakers is available at https://you.stonybrook.edu/hpcai/program/.

CALL FOR STUDENT PRESENTATIONS & PARTICIPATION

We are excited to offer students the opportunity to present their work in the area of high-performance scientific computing and artificial intelligence at the workshop. We are calling for students to submit their talk proposals (Name + Title) by April 15 to hpc_ai_workshop@stonybrook.edu. The committee will select the best submission to be presented at the workshop. Accepted speakers will be notified by April 22, 2025.

All students, regardless of whether they are presenting, may reach out to hpc_ai_workshop@stonybrook.edu for financial support to cover travel and lodging costs.

REGISTRATION

Registration is available at https://www.eventbrite.com/e/expanding-horizons-in-ai-with-hpc-tickets-1256469978529?aff=oddtdtcreator until May 2nd. The registration fee covers the workshop participation and the social event in the evening of May 9.

Regular registration: $200
Student registration: $100


IMPORTANT NOTE

The registration fee was meant to cover the room rent, catering, and dinner. Thanks to an RF seed grant, we are able to drop the registration fees for SBU students and staff/faculty. We still ask for an informal registration via email to hpc_ai_workshop@stonybrook.edu until April 27, so we can plan for catering and dinner.
Please get in touch with us if you have already registered as an SBU student/faculty/staff member for the workshop so we can handle any reimbursement.

The program is now online at https://you.stonybrook.edu/hpcai/program/.
CSE 656 Seminars in Computer Vision - Wednesdays 11:30am-12:50pm, Room NCS 120

The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.

Abstract: Traditional questionnaires remain the primary method for assessing psychological outcomes and beliefs, capturing individuals' and populations' inner states. This dissertation presents an alternative computational method that overcomes key limitations in current mental health monitoring, particularly in spatiotemporal resolution, responses to major events, and automatic belief identification. By analyzing ∼1 billion Tweets from 2 million geo-located users, we created a big data pipeline for estimating depression and anxiety at the county-week level. These Language-Based Mental Health Assessments (LBMHA) demonstrated higher reliability and validity than traditional survey measures. Our approach effectively captured mental health trends and highlighted significant increases in mental illness following major events. Using the LBMHA pipeline, we conducted quasi-experiments, research designs that simulate randomized control trials, to generate explanations for mental health changes due to COVID-19 incidence/death. Utilizing these time-series analyses, we conducted discontinuity forecasting for community-specific anxiety shifts using statistical learning via ensemble and contextual models. To likewise investigate individual internal states, we created a novel task and annotated dataset for self belief language identification. Our fine-tuned language model for self-belief classification, despite its relatively small scale, outperformed GPT-4o. The self belief topics identified by our model successfully predicted depression, anxiety, and stress, offering insights into the relationship between self-conceptualization and mental health. The adoption of scalable language-based assessments with modern distributed computation presents a promising avenue for advancing community and individual mental health research.

Speaker: Siddharth Mangalik

https://stonybrook.zoom.us/j/91251321639?pwd=faggV5jZ7ByFDCFmnLXD3HiYxjQ1Eb.1&jst=2

Description:

Curious about what AI image generation tools are out there and how they work? Come down to the library Galleria space (outside the Central Reading Room) to see some demonstrations and learn more about them.

Librarians Chris Kretz and Ahmad Pratama, along with David Ecker of DoIT, will be hosting Explore AI demos from Monday - Wednesday this week on different topics. Whether you're new to AI or an experienced user, stop by and take a look!

Location: Library Galleria

Abstract: Datalog is a powerful language for expressing recursive computations through rules: Horn clauses in first order logic. Although effective at expressing queries over existential properties, Datalog and many of its popular implementations struggle with queries that involve more complex aggregates, requiring users to apply verbose, non-composable, and/or inefficient workarounds. Recent work on lattice-based datalogs addresses many of these concerns for aggregates that can be encoded as lattices (e.g., min or max), but more general aggregates like count remain problematic. In this talk, I will argue that this is not a fundamental limitation of Datalog, but rather from its model of truth: Both datalog semantics and evaluation rules make heavy use of the fact that insertion is both monotone and idempotent. Once a fact is known to be true, it can not be retracted, nor can further discoveries of the same fact alter its truth. Monotonicity is critical for forward progress under Datalog's ``open world'' model, as it allows us to safely assert the truth of a body. Meanwhile, idempotence makes it easier to reason about evaluation, as we need only guarantee that each head atom will be derived at-least-once. Unfortunately, more general aggregates like sum() are neither idempotent, nor monotone. I will introduce Hedgelog, a strict generalization of Datalog that uses general monoids as a basis for truth. I will show that this generalization remains compatible with Datalog's open world model, how it enables cleaner and more composable datalog programs, and how the underlying monoid relations open the door to interesting datastructure-level optimizations.

Bio: Oliver Kennedy is an associate professor at the University at Buffalo. He earned his PhD from Cornell University in 2011 and now leads the Online Data Interactions (ODIn) lab, which operates at the intersection of databases and programming languages. Oliver is the recipient of an NSF CAREER award, an IEEE Region 1 Technological Innovation Award, UB's Exceptional Scholar Award, and several UB SEAS teaching awards. Oliver is also one of the founding board members of Breadcrumb Analytics. Several of Oliver's papers have been invited to Best of compilations from SIGMOD and VLDB. The ODIn lab is currently exploring (i) how we can leverage database techniques like incremental view maintenance to make compilers faster, (ii) how to make it easier for data scientists to track how sources of uncertainty, ambiguity, and/or bias affect analyses, and (iii) how to streamline the interfaces --- both human and software --- between different tools for data science, like python, sql, and spreadsheets.

Location: NCS 120


Abstract:
Large language models (LLMs) have transformed the way humans write code, bringing unprecedented automation to software development. In this talk, I will first provide an overview of my research on enhancing LLMs' code intelligence, optimizing each step of the development pipeline towards more complex software engineering tasks. I will then delve into my key contributions, focusing on how to equip LLMs with a deeper, more comprehensive understanding of software programs. Finally, I will discuss the future of AI-driven software engineering, envisioning a new era of automation that is more reliable, intelligent, and cost-efficient.

Bio:
Yangruibo (Robin) Ding is a Ph.D. candidate in the Department of Computer Science at Columbia University. His research is at the intersection of Software Engineering and Machine Learning, focusing on developing large language models (LLMs) for code. He trains LLMs to generate, analyze, and refine software programs and constructs benchmarks to systematically evaluate LLMs in solving software engineering tasks. He also studies how to improve LLMs' reasoning capability to tackle complex programming tasks, such as debugging and patching. His interdisciplinary research has been published in top-tier conferences of software engineering, programming languages, natural language processing, and machine learning. He won an ACM SIGSOFT Distinguished Paper Award, an IEEE TSE Best Paper Runner-up, and received an IBM Ph.D. Fellowship.
Location:
NCS 120

Abstract: The landscape of machine learning evolves rapidly and the complexity of the networks and their architectures defies easy comprehension. AI is touted as the next scientific revolution by allowing the processing and pattern-finding in increasingly massive data sets. One potential end results could be AI enhanced measurement technologies, but what does that mean? This talk will give examples of how classical tools indicate the technical obstacles to this vision in terms of understanding training processes, model comparisons, and feature embeddings. While the results in this talk are largely empirical, they point to interesting directions for (infomation?) theoretical investigation.

Bio: Anand D. Sarwate is an Associate Professor in the Electrical and Computer Engineering Department at Rutgers, The State University of New Jersey. He received B.S. degrees in math and electrical engineering from MIT and a Ph.D. in electrical engineering from UC Berkeley. Prior to joining Rutgers he was a Research Assistant Professor at TTI-Chicago and a postdoc at the ITA Center at UC San Diego. His research interests include information theory, machine learning, signal processing, optimization, and privacy and security.
Location: Light Engineering 250
Abstract: This dissertation addresses the methodological disconnect between Natural Language Processing (NLP) and human-centric analysis by shifting the unit of analysis from document to human behavior in two broad respects: (i) time-ordering: modeling documents as sequential person-indexed behavioral observations, and (ii) person-level semantics: evaluation and explainability of models by their latent structure of psychological constructs rather than just its predictive accuracy against narrow proxy measures. First, we consider the most basic implication of language as a person's behaviors when measuring their psychological constructs: relationship between language sample size and model's predictive performance. We empirically show that the state-of-the-art transformers are often over-parameterized for typical NLP dataset sizes and can be reduced in dimensionality without performance loss. Establishing the author as the unit of analysis naturally allows us to treat their behavior as a time-ordered sequence. Second, we introduce a longitudinal evaluation framework that establishes ecologically valid evaluation settings, namely, cross-sectional and prospective generalization, and separates error measurement of the model into within-person dynamics and between-person differences. We demonstrate that traditional NLP evaluations based on random document splits can yield reversed conclusions under ecologically valid generalization settings. To address this, we develop models that capture the trajectory of mental states (e.g., mood shifts) rather than static traits. Third, moving into person-level semantics, we evaluate the latent structure of large language models using a novel machine behavior analytic framework. We find that while GPT-4 achieves high predictive correlation with self-reports, its latent symptoms structure diverges from clinical understanding. Finally, we propose a method for modeling multidimensional behaviors, embedding concurrent behavioral signals alongside language to predict future states. Taken together, this work suggests that operationalizing language as behavior advances NLP methods into a rigorous instrument for valid psychological inquiry.

Speaker: Adithya Ganesan

Location: Join Zoom Meeting (ID: 99021939129, Passcode: 569493)