AI is everywhere -- and so are the privacy concerns that come with it. At its core, the most common forms of AI we use today are online digital services -- and thus inherit the usual privacy risks of any internet-based tool. However, AI also introduces a set of unique and evolving risks. We'll take a closer look at one of the newest developments in this area: indirect prompt injection -- a technique that can trick AI tools into revealing or extracting private information. You'll learn how this emerging form of AI manipulation works, why it matters, and how to protect yourself -- as well as how similar techniques are being used in academic contexts to manipulate systems and even mislead researchers.
Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
Speaker: Tanqiu Jiang
Where: NCS 220 and Zoom (https://stonybrook.zoom.us/j/
ABSTRACT
When we develop learning algorithms, what computational problems are we solving? In this talk, I discuss different answers that have been proposed for this question, and discuss some of the consequences for machine learning and artificial intelligence. The main lessons I offer are that (1) feasible solutions to learning problems require careful consideration of a target class C of functions, (2) that such a class C cannot include all functions, or even all computable functions, and so many logically possible functions must be outside of C and (3) class C must have significant structure which the solutions take advantage of. These main ideas are motivated and illustrated from modeling language acquisition and the related problem of grammatical inference from example sequences belonging to formal languages.
Topic: Responsible Artificial Intelligence: Promoting Health Equity for All
Speaker: Michael P. Cary, Jr., PhD, RN, FAAN.
Dr. Cary is a tenured Associate Professor at the Duke University School of Nursing. Dually trained as a health services researcher and applied health data scientist, Dr. Cary utilizes AI to investigate health disparities in aging populations, thereby promoting health equity and improving healthcare delivery. He co-directs HUMAINE™, an initiative dedicated to equipping nurses and healthcare professionals with the knowledge and skills necessary for the responsible use of AI in clinical practice.
Register: https://web.cvent.com/event/057978a5-a770-4de5-aca5-ad00287e4902/summary
When: Thu: 10/28/2021, 10 am
Where: NCS Room 220, or
Zoom: https://stonybrook.zoom.
Deep Surface MeshesPascal FuaEPFLGeometric Deep Learning has recently made striking progress with the advent of Deep Implicit Fields (SDFs). They allow for detailed modeling of watertight surfaces of arbitrary topology while not relying on a 3D Euclidean grid, resulting in a learnable 3D surface parameterization that is not limited in resolution. Unfortunately, they have not yet reached their full potential for applications that require an explicit surface representation in terms of vertices and facets because converting the SDF to such a 3D mesh representation requires a marching-cube algorithm, whose output cannot be easily differentiated with respect to the SDF parameters. In this talk, I will discuss our approach to overcoming this limitation and implementing convolutional neural nets that output complex 3D surface meshes while remaining fully-differentiable and end-to-end trainable. I will also present applications to single view reconstruction, physically-driven Shape optimization, and bio-medical image segmentation.
Bio:
Pascal Fua received an engineering degree from Ecole Polytechnique, Paris, in 1984 and a Ph.D. in Computer Science from the University of Orsay in 1989. He joined EPFL (Swiss Federal Institute of Technology) in 1996 where he is a Professor in the School of Computer and Communication Science and head of the Computer Vision Lab. Before that, he worked at SRI International and at INRIA Sophia-Antipolis as a Computer Scientist. His research interests include shape modeling and motion recovery from images, analysis of microscopy images, and Augmented Reality. He has (co)authored over 300 publications in refereed journals and conferences. He has received several ERC grants. He is an IEEE Fellow and has been an Associate Editor of IEEE journal Transactions for Pattern Analysis and Machine Intelligence. He often serves as program committee member, area chair, and program chair of major vision conferences and has cofounded three spinoff companies.
at International Love Data Week
sponsored by The Office of the Provost and
Educational and Institutional Effectiveness (EIE)
Special Talk and Panel Discussion
How I Learned to Stop Worrying and Love AI (For Now)
with Paul Fain from The Job and Work Shift
A reporter's take on what we know--and what we don't know--about AI's emerging impacts on the labor market. The discussion will include the latest research from economists and the AI labs themselves about how workers are using AI, and current thinking among experts on how the tech's rapid deployment will play out across job roles, industries, and regions.
Panel discussion to follow with:
- Lav Varshney, Della Pietra Infinity Professor and inaugural director of the AI Innovation Institute
- Nicholas Johnson, Director of AI, SBU Libraries
- Marianna Savoca, Associate Vice President for Career Readiness and Experiential Education
Limited Seats!
Registration is required.
Time: Friday 4/29, 2:40 PM
Location: NCS 120
Abstract:
Natural language processing (NLP) technology has supercharged many real-world applications ranging from intelligent personal assistants (like Alexa, Siri, and Google Assistant) to commercial search engines such as Google and Bing. But current NLP applications use extremely large neural models, making them (i) expensive to deploy on servers, requiring large amounts of compute resources and power, and (ii) impossible to run on mobile devices, making on-device, privacy-preserving applications impractical.
In the first part of the talk, I will describe systems optimizations we have developed that significantly reduce the compute and memory requirement of NLP models. The optimizations we developed can be applied broadly and results in over 10x reduction in latency when deployed on mobile devices. In the second part of the talk, I will describe our recent work on predicting energy consumption of NLP models. Existing energy prediction approaches are not accurate, making it difficult for developers and practitioners to reason about their models in terms of power. We use a multi-level regression approach that produces highly accurate and interpretable energy predictions.
Bio:
Aruna Balasubramanian is an Associate Professor at Stony Brook University. She received her Ph.D from the University of Massachusetts Amherst, where her dissertation won the UMass outstanding dissertation award and was the SIGCOMM dissertation award runner up. She works in the area of networked systems. Her current work consists of two threads: (1) significantly improving Quality of Experience of Internet applications, and (2) improving the usability, accessibility, and privacy of mobile systems. She is the recipient of the SIGMobile Rockstar award, a Ubicomp best paper award, a Computing Innovation Fellowship, a VMWare Early Career award, several Google research awards, an
Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
Time: Thursday, Feb 4, 11:30am - 1:00pm
Zoom:
https://stonybrook.zoom.us/j/
Meeting ID: 856 364 6526
Passcode: 203791
Abstract:
Over the past decade, larger datasets, hardware accelerations, and network architecture improvements have contributed to phenomenal achievements in many tasks of computer
vision. However, in the absence of large datasets, computer vision models struggle to learn
general representations which results in poor performance. Few-shot learning tries to address
this problem by proposing models which learn from a few examples.
I first give an overall review of few-shot learning methods. I particularly focus on generative Few Shot Learning(FSL) methods, which augment the scarce categories in a dataset by generating samples for those rare categories. As the actual class distribution can be complex and lie very close to each other, the sample generated for one class can be noisy or lie close to another class. However, none of the current FS generative methods perform any form of quality control of the generated samples.
In this work, I propose to identify and remove the generated samples that are less likely to be in the distribution of the few-shot class. Here I particularly deal with few-shot scenarios where the
prior information of the relationship between the classes based on visual similarity is available. The main idea is to exploit these priors to better identify the unreliable generated samples.
Particularly, I have proposed two methods based on class relationship to detect noisy generated samples. In the first method, we assume that the embedding space of each class follows a Gaussian distribution. From this assumption, I propose Gaussian Neighborhood (GN), a method to estimate how likely a generated sample is drawn from the estimated distribution of a few-shot class. We evaluate this method on the Hematopoiesis dataset. By simply eliminating samples based on thresholding our proposed GN scores, the few-shot classification performance is improved by 5% and 2% in five shot and one shot respectively, compared to the model trained on all generated images.
The GN scores represent the similarity distances from the generated samples to their classes, based on the assumption that each class is a Gaussian distribution. However, this assumption might be strict in many scenarios since the real distributions of data can be arbitrarily complex. Thus in my second proposed method, I aim to learn such similarity distances directly from data via metric learning. I propose to train a deep-network to regress the similarity distance between a pair of samples. This network is trained using both the class-level visual similarity information and the class labels. This method improves the 1-shot and 5-shot classification performances by 0.5% and 1% respectively, compared to GN.