Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
The program will fund projects for up to a one-year period, depending on the availability of funds. AI^3 anticipates making at least six awards on this call. A one-year, no-cost extension can be requested in the final 6 months of a project with approval subject to progress towards project goals and active participation in research themes.
Competitive applications will actively incorporate modern AI technologies into the work; integrate students; document significant potential for future funding or other growth-oriented outcomes; and highlight innovations.
The 2024 application deadline will be October 15, at 11:59 PM EST. Recipients will be notified by December 20, and projects are anticipated to commence at the start of the Spring 2025 semester.
Abstract:
We often talk about AI as if it begins with a dataset and ends with an application. But behind every model lie two sets of actors who are rarely acknowledged in technical documentation: the workers who train AI systems and the researchers who try to make sense of them. This talk brings both groups into view.
Dr. Ben Zhang will offer an on-the-ground examination of the prevailing values and invisible labor that underpin commercial AI production and data production. Drawing on ethnographic research inside AI data annotation centers in China, he introduces the concept of precision labor to unpack the labor dimension of constructing, managing, and performing technical accuracy. This concept highlights the hidden and excessive labor required to reconcile the ambiguity and uncertainty involved in AI training. A precision labor lens challenges the legitimacy and sustainability of the relentless pursuit of technical accuracy, raising new questions about its consequences and implications.
On the other end of the pipeline, as LLMs become embedded in society, social scientists like Dr. Jieshu Wang is scrutinizing their potential biases while employing them as research tools. She will present her recent work auditing LLM responses across different contexts, revealing that LLMs exhibit varying levels of environmental awareness and disproportionately reward institutional prestige in peer-review simulations. She also demonstrates how LLMs can serve as useful tools in social-science pipelines, e.g., extracting location information, inferring demographics, parsing citations, mapping social networks, and analyzing occupational data.
By placing these two worlds side by side - the labor of training AI and the scholarly efforts to study it - we show why responsible AI should go beyond the deployment phase - emphasizing fairness audits, and model explainability. It requires reimaging the values, labor regimes, and social science practices that shape AI systems from annotation to analysis.
Bios:
Dr. Jieshu Wang is an interdisciplinary researcher studying the human and social dimensions of artificial intelligence (AI) and how people can thrive in an AI-integrated future. She combines computational methods with qualitative insights to trace technology trends and understand their broader societal impact. She earned her Ph.D. in Human and Social Dimensions of Science and Technology from Arizona State University, after earlier degrees in Civil Engineering, Economics, and Science and Technology Studies. She has also worked as a patent examiner, an editor at a popular science magazine, and co-founded Synced (机器之心), an AI-focused media company in China. Her research looks both backward and forward. Backward-looking, she examines how AI are created, who creates them, and who is missing from the process. Forward-looking, she studies how AI is transforming the way we live, connect, invent, work, and adapt, as well as how AI might help address challenges such as climate change and workforce transitions.
Dr. Ben Zhang is an Assistant Professor in the Department of Technology. His research explores the production and sociotechnical impacts of AI systems in critical areas such as work, health, and sustainability. Drawing from his background in Human-Computer Interaction (HCI), Human-Centered AI, and Science and Technology Studies (STS), he employs a life-cycle-centered approach to holistically examine the promises and harms of these systems and to inform the design of responsible AI infrastructures across their development, deployment, and governance. Ben received his Ph.D. in Information Science from the University of Michigan. Ben's work has been supported by competitive awards and fellowships, including the University of Michigan Rackham Predoctoral Fellowship and the Weizenbaum Fellowship. His research has appeared in premier computing venues, including ACM CHI, ACM CSCW, and AAAI ICWSM.
Location: NCS 120
Please register for the STEM Speaker Series: Can a Machine learn Chemistry here.
Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
Abstract: As intelligent systems become more integrated into human environments, fostering trustworthy human-AI collaboration presents a pressing challenge. In this talk, I examine the interplay between an agent's performance and social dynamics in shaping trust in human-AI interactions. My approach combines testbed development, behavioral prototyping, and user study design to create controlled experimental setups that capture real-world interaction complexities, such as ambiguity, multi-agent dynamics, and conflicting goals.
I illustrate this with a recent VR study on multi-user interaction with an autonomous vehicle (AV). Moving beyond dyadic interactions, the study probes human perspectives from the roles of a pedestrian, driver, and AV passenger, all interacting with the AV simultaneously at an ambiguous all-way stop sign intersection. We compare interactions with efficient and prosocial AV behavior strategies, revealing diverging trust perceptions and preferences across user roles. These insights inform a broader research trajectory focused on balancing performance with social considerations in designing trustworthy human-AI collaborations.
Bio: JiHyun Jeong is a postdoctoral researcher at Cornell University working on human-computer interaction and human-robot interaction. Her research develops prototypes and methods to explore performance and social factors that influence collaboration and trust between humans and artificial agents. She holds a Ph.D. and MPS in Information Science from Cornell University, and a BSc in Computer Science and Engineering from Korea University. She is a recipient of an honorable mention for best paper at DIS.
Zoom: https://stonybrook.zoom.
Meeting ID: 987 3823 4619
Passcode: 474618
4-5pm, Dec 17 2020
https://stonybrook.zoom.us/j/9
The molecular mechanisms and functions in complex biological systems
currently remain elusive. Recent high-throughput techniques, such as
next-generation sequencing, have generated a wide variety of
multiomics datasets that enable the identification of biological
functions and mechanisms via multiple facets. However, integrating
these large-scale multiomics data and discovering functional insights
are, nevertheless, challenging tasks. To address these challenges,
machine learning has been broadly applied to analyze multiomics. In
particular, multiview learning is more effective than previous
integrative methods for learning data's heterogeneity and revealing
cross-talk patterns. Although it has been applied to various contexts,
such as computer vision and speech recognition, multiview learning has
not yet been widely applied to biological data--specifically,
multiomics data. Therefore, we have developed a framework called
multiview empirical risk minimization (MV-ERM) for unifying multiview
learning methods (Nguyen, et al., PLoS Computational Biology, 2020).
MV-ERM enables potential applications to understand multiomics
including genomics, transcriptomics, and epigenomics, in an aim to
discover the functional and mechanistic interpretations across omics.
Based on MV-ERM, we have developed the following methods:
ManiNetCluster, Varmole and ECMarker.
(1) ManiNetCluster (Nguyen, et al., BMC Genomics, 2019) is a manifold
learning method which simultaneously aligns and clusters gene networks
(e.g., co-expression) to systematically reveal the links of genomic
function between different phenotypes. Specifically, ManiNetCluster
employs manifold alignment to uncover and match local and non-linear
structures among networks, and identifies cross-network functional
links. We demonstrated that ManiNetCluster better aligns the
orthologous genes from their developmental expression profiles across
model organisms than state-of-the-art methods. This indicates the
potential non-linear interactions of evolutionarily conserved genes
across species in development. Furthermore, we applied ManiNetCluster
to time series transcriptome data measured in the green alga
Chlamydomonas reinhardtii to discover the genomic functions linking
various metabolic processes between the light and dark periods of a
diurnally cycling culture;
(2) Varmole (Nguyen, et al., Bioinformatics, 2020) is an interpretable
deep learning method that simultaneously reveals genomic functions and
mechanisms while predicting phenotype from genotype. In particular,
Varmole embeds multi-omic networks into a deep neural network
architecture and prioritizes variants, genes and regulatory linkages
via biological drop-connect without needing prior feature selections.
With an application to schizophonia, we demonstrate that Varmole
provides an effective alternative for recent statistical methods that
associate functional omic data (e.g. gene expression) with genotype
and phenotype and that link variants to individual genes in population
studies such as genome-wide association study;
(3) ECMarker (Jin*, Nguyen*, et al., Bioinformatics, 2020) is an
interpretable and scalable machine learning model that predicts gene
expression biomarkers for disease phenotypes and simultaneously
reveals underlying regulatory mechanisms. Particularly, ECMarker is
built on the integration of semi- and discriminative- restricted
Boltzmann machines, a neural network model for classification allowing
lateral connections at the input gene layer. With application to the
gene expression data of non-small cell lung cancer (NSCLC) patients,
we found that ECMarker not only achieved a relatively high accuracy
for predicting cancer stages but also identified the biomarker genes
and gene networks implying the regulatory mechanisms in lung cancer
development.
Finally, we propose a novel multiview learning method, Malignomics, to
predict phenotypes from heterogeneous multi-omic features. Malignomics
will first align multi-omic features by deep manifold alignment onto a
common latent space, better predicting nonlinear relationships across
omics. This deep alignment aims to preserve both global consistency
and local smoothness across omics and reveal higher-order nonlinear
interactions (i.e., manifolds) among cross-omic features. Second, it
uses these manifold structures to regularize the classifiers for
predicting phenotypes. This manifold-regularization allows
highlighting cross-omic feature manifolds and prioritizing the
features and interactions for the phenotypes. The prioritized
multi-omic features will further reveal underlying phenotypic
functions and mechanisms and thus enhance the biological
interpretation of Malignomics. We will apply Malignomics to
multi-omics data in neuropsychiatric disorders, and prioritize gene
regulatory networks linking risk variants, regulatory elements, and
genes for the disorders. We will also compare Malignomics with the
state-of-the-arts, and investigate how the manifold regulation will
potentially improve understanding of multi-omics functions and
predicting diseases.