Abstract: In this talk, we will discuss what a CS PhD entails and the traits and habits that are important for success in PhD programs and future careers. While the talk is targeted to first-year PhD students, PhD students at all levels should derive from it.

Bio: Samir Das is a professor in the Department of Computer Science at Stony Brook
University. He is currently serving as the department chair. He is well recognized in the
community for his research in wireless networks and systems.

Location: NCS120
Abstract: Many scientific and engineering challenges, such as the design of materials or molecules or the control of experimental systems, rely on the existence of fast predictive models that can evaluate potential designs or control policies. Traditionally this has been accomplished through numerical simulation; more recently data-driven machine learning methods have been applied. However, both approaches leave gaps: physical modeling can be accurate and extrapolates well to previously-unstudied conditions, but it is often computationally expensive and relies on physics approximations that may not be valid. Machine learning can generalize from massive amounts of real-world or simulation data, but suffers from physical grounding and extrapolation into new regimes, as well as in settings where large data sets do not exist.
In this talk I explore an intermediate regime, which is hybrid reduced order models: fast simplified physics approximations where some of the unknown or approximated equations are replaced with data-driven machine learning components. Examples include coarse-grained models where the full macroscopic equations cannot be derived from first-principles microscopic equations, multiscale models with unknown closure terms or sub-grid parameterization schemes, and low-order or latent dynamical systems that learn governing equations on a low-dimensional reduced state space. I discuss how such reduced systems can be identified from very limited data, much less than is often needed in traditional machine learning but at much lower time-to-solution than traditional numerical modeling. This facilitates not only system design and control but also uncertainty quantification approaches that search the space of possible equations for predictive models that can explain the data. I will focus on an example from materials science concerning the design of self-assembling block copolymer nanomaterials.

Speaker: Dr. Nathan Urban, Applied Mathematics Department, Brookhaven National Laboratory

Location: Laufer 101

Zoom: https://stonybrook.zoom.us/j/96090260834?pwd=mw8QTHbMOw9oeU9hazZeoq8bN4VIfH.1
Meeting ID: 960 9026 0834 Passcode: 374969

This workshop is intended for researchers, practitioners, students, and industry professionals in AI, robotics, machine learning, human-robot interaction, and related fields.

Workshop Overview:

Instead of learning from data alone, an embodied AI system learns through its movements, sensors, and interactions with the environment. This form of active, experience-based learning, informed by ongoing self-evaluation of its own abilities, enables embodied AI systems to adapt on the fly, understand context rather than just commands, and collaborate with humans in more natural and trustworthy ways.

Workshop Goals:

  1. Foster interdisciplinary dialogue across AI, robotics, and cognitive science.
  2. Identify key challenges and future research directions in embodied intelligence.
  3. Examine the role of embodiment in advancing toward AGI.

This workshop is Invitation-only. Please email Dr. IV Ramakrishnan (ram@cs.stonybrook.edu) to attend.

Read the announcement: https://mcusercontent.com/237207911c0fd4c1f78dd8524/files/070dec2e-a2f5-143e-0fe2-c4ebecdb5193/Embodied_AI_Workshop_Invitation_.pdf

CG Group member (and SBU faculty) Chao Chen will speak on Fri, March 12, about the use of topological data analysis in machine learning for image analysis.
Chao has shared some of his research with the CG Group previously, and this will be a great opportunity to learn more about this exciting research area related to computational geometry/topology!

Time: Friday, March 12, 2pm-3pm
Place: Zoom
https://stonybrook.zoom.us/my/profweizhu?pwd=RjVIVXg3YUhudzZZQ3pheHUydTJBUT09



Title: Learning with Topological Information - Image Analysis and Label Noise
Speaker: Prof. Chao Chen (SBU)

Abstract: Modern machine learning faces new challenges. We are
analyzing highly complex data with unknown noise. Topology provides
novel structural information to model such data and noise. In this
talk, we discuss two directions in which we are using topological
information in the learning context. In image analysis, we propose a
topological loss to segment and to generate images with not only
per-pixel accuracy, but also topological accuracy. This is necessary
in analysis of images of fine-scale biomedical structures such as
neurons, vessels, etc.  Extracting these structures with correct
topology is essential for the success of downstream
analysis. Meanwhile, we discuss how to use topological information to
train classifiers robust to label noise. This is important in practice
especially when we are using deep neural networks which tend to
overfit noise. These results have been published in NeurIPS, ECCV,
ICML and ICLR.

The Program in Writing and Rhetoric
Invites you to
A Rhetorical/Deliberative Framework for AI Language Model Alignment
featuring
Prof Zoltan Majdik Professor
North Dakota State University
In this talk, Prof. Majdik proposes a framework for aligning LLMs with values grounded in the norms of rhetorical culture and deliberative democracy. Alongside long-standing AI alignment value targets like safety and transparency, this AI alignment framework assesses to what extent a language model exhibits human and humane values that foster communicative engagement, and it codifies approaches to tuning existing models to better align with such values.

Location: Humanities 1008
Abstract:
People shift their visual attention to gather and prioritize information from their surroundings, helping them navigate complex environments. Understanding these attentional shifts involves decoding the features that guide where attention is directed (spatial areas of focus) and when attention shifts (timing). Decoding these processes can aid applications from interface design to medical diagnosis. However, prior models have not fully explored the underlying factors addressing these aspects. In this dissertation, we study the factors that guide visual attention across diverse image types, spanning natural images, graphic design documents, and whole slide images (WSIs) of cancer tissues, while also predicting visual attention based on these factors.
First, we propose a method to quantify object recognition uncertainty as a factor influencing spatio-temporal attention (where and when) in natural images. We found that it plays a larger role than bottom-up saliency in guiding visual attention. Second, we analyze graphic design documents such as webpages, comics, posters, mobile UIs, etc., which differ from natural images in that they are designed to convey specific messages or elicit desired viewer response. We propose a unified and interpretable deep learning model that predicts both static and dynamic visual attention behavior (addressing where and when) by integrating document layout and content saliency as factors, enhancing attention prediction performance. Finally, in the domain of digital pathology, we investigate pathologists' attention during their examination of giga-pixel WSIs of prostate cancer with an objective to aid in the development of computer-assisted pathology training and clinical decision support systems. Using a digital microscope interface, we collected the largest known dataset of pathologist attention, which allows us to study the factors that guide their spatial and temporal attention patterns (where and when) and develop predictive models. Our study explores key factors guiding their attention, including magnification, slide staining, the nature of the diagnostic task, and their expertise. Motivated by this analysis, we propose deep learning models to solve two tasks: 1) predicting pathologist attention via spatial (heatmaps) and spatio-temporal (scanpaths) models, and 2) inferring pathologist expertise level, both essential technical components towards developing an AI-assisted pathology training pipeline.

Speaker:
Souradeep Chakraborty

Location: New Computer Science Bldg., Room 220

Zoom Link: https://stonybrook.zoom.us/j/9755288447?pwd=TW95T2xqOUZjRnlqcnVFcUQvN0JMdz09
Meeting ID: 975 528 8447
Passcode: 338037

Join the Department of Computer Science as we welcome Lyle Ungar, University of Pennsylvania, who will be delivering a lecture on 'Measuring Cultural Variation using Natural Language Processing.' When: 11/08/24 @ 2:30 PM Where: New Computer Science Building, Room 120. Reception to follow. Abstract: Cultures vary widely in how they view the world, for example being more individualist or collectivist. Such cultural differences are, of course, reflected in the words that people use. We first show a variety of ways in which multilingual language models are not multicultural; they speak Hindi or Mandarin, but still think like Americans. In contrast, we then present a scalable method that uses embedding-derived lexica to successfully measure regional variation in culture. Bio: Lyle Ungar is a Professor of Computer and Information Science at the University of Pennsylvania, where he also holds secondary appointments in Psychology, Bioengineering, Genomics and Computational Biology, and Operations, Information and Decisions. His group uses natural language processing and explainable AI for psychological research, including analyzing social media and cell phone sensor data to better understand the drivers of physical and mental well-being. They are currently building socio-emotionally sensitive GPT-based tutors and coaches.


Time:
Sep 7, Tue, 11:00am EDT

Place:
NCS 220 or on Zoom (info below)

Title: Data-Driven Document Unwarping


Abstract:
Capturing document images is a common way to digitize and record physical documents due to the ubiquitousness of mobile cameras. To make text recognition easier, it is often desirable to digitally flatten a document image when the physical document sheet is folded or curved. However, unwarping a document from a single image in natural scenes is very challenging due to the complexity of document sheet deformation, document texture, and environmental conditions. Previous model-driven approaches struggle with inefficiency and limited generalizability. In this thesis, I investigate several data-driven approaches to tackle the document unwarping problem.

Data acquisition is the central challenge in data-driven methods. I first design an efficient data synthesis pipeline based on 2D image warping and train DocUNet, the pioneering data-driven document unwarping model, on the synthetic data. A benchmark dataset is also created to facilitate comprehensive evaluation and comparison. To improve the unwarping performance by training on more realistic data, I introduce the Doc3D dataset and DewarpNet. Supervised by 3D shape ground truth in Doc3D, DewarpNet is significantly better than DocUNet. DocUNet and DewarpNet depend on the synthetic data for the ground truth deformation annotation. To exploit the real-world images, I propose PaperEdge, a weakly supervised model trained with in-the-wild document images with easy-to-obtain boundary information. PaperEdge surpasses DewarpNet by utilizing both the synthetic data and weakly annotated real data in the Document In the Wild (DIW) dataset. Finally, I propose directly predicting the $uv$ parameterized 3D mesh of the document with 3D constraints and using the accessible 3D presentations like depth maps as training targets. Predicting the 3D mesh of the document solves the unwarping task and also benefits VR/AR applications.

Join Zoom Meeting
https://stonybrook.zoom.us/j/96440592912?pwd=ZU5waTdyUzRFNW5SRHM5ME84TWdFQT09

Meeting ID: 964 4059 2912
Passcode: 793149
One tap mobile
+16468769923,,96440592912# US (New York)
+13017158592,,96440592912# US (Washington DC)

Dial by your location
        +1 646 876 9923 US (New York)
        +1 301 715 8592 US (Washington DC)
        +1 312 626 6799 US (Chicago)
        +1 253 215 8782 US (Tacoma)
        +1 346 248 7799 US (Houston)
        +1 408 638 0968 US (San Jose)
        +1 669 900 6833 US (San Jose)
Meeting ID: 964 4059 2912
Find your local number: https://stonybrook.zoom.us/u/adxTt9ZbuJ