Unlock the power of AI in your job search! Join the Head of Indeed Job Search Academy and AI experts as they explore how to leverage cutting-edge AI tools to optimize your job search activities, enhance your resume, prepare for interviews, and conduct thorough
career research, as well as answer all your AI-related questions.
This virtual watch party session will equip you with the knowledge to stand out in today's competitive market.

https://forms.gle/TtWu3iDh9bmU3niD6
The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.
ICB&DD 19th Annual Symposium

Iwao Ojima, Director, ICB&DD
Ivet Bahar Chair, Organizing Committee
Dima KozakovCo-Chair, OrganizingCommittee

There will be poster sessions on projects conducted in the ICB&DD member's laboratories aswell as other laboratories in the area. Awards will be given to the best three posters.

Please see the link for the registration and poster sessions in:
https://www.stonybrook.edu/commcms/icbdd/https://forms.gle/Wh4UzVx9U4HWStXb8
Abstract:

Recent advances in deep learning have significantly enhanced the capabilities of Natural Language Processing (NLP) and Vision-Language Models (VLMs). However, these advancements come with increased vulnerabilities, notably through backdoor attacks that pose severe security threats. This thesis addresses two critical dimensions of Trustworthy AI and Efficient Multimodal Representation Learning: (1) security through analyzing, detecting, and designing backdoor attacks in NLP and VLMs, and (2) efficiency through advanced multimodal representation methods tailored for clinical and medical imaging applications.

In the first dimension, we explore the internal mechanisms exploited by backdoor attacks, identifying the distinctive phenomenon of attention focus drifting in compromised transformer models, where trigger tokens consistently hijack attention. Leveraging these insights, we propose robust detection frameworks, including the attention-based Trojan detector (AttenTD) and a task-agnostic logit-based detection method (TABDet), achieving effective identification of backdoored NLP models across diverse tasks. We further introduce novel backdoor attack methodologies: the Trojan Attention Loss (TAL), enhancing attack efficiency and stealth through direct attention manipulation, and BadCLM, demonstrating critical vulnerabilities in clinical decision-support systems by effectively compromising clinical language models.

Extending our security exploration to multimodal settings, we investigate backdoor attacks on Vision-Language Models (VLMs), particularly in complex image-to-text generation tasks, proposing innovative techniques (TrojVLM, VLOOD) capable of embedding backdoors without direct access to original training data, thus showcasing practical risks in real-world scenarios.

In the second dimension, we address efficiency and interpretability challenges in clinical and pathology applications. We introduce TCP-LLaVA, the first multimodal large language model (MLLM) designed explicitly for Whole Slide Image (WSI) Visual Question Answering (VQA). Utilizing a novel token compression mechanism inspired by transformer-based models, TCP-LLaVA substantially reduces computational resource consumption while maintaining superior VQA performance across multiple tumor subtypes. Additionally, we present a multimodal transformer model integrating structured Electronic Health Records (EHR) with clinical notes, demonstrating enhanced predictive accuracy and interpretability for in-hospital mortality prediction through integrated gradient-based interpretability methods.

Together, these contributions present a comprehensive approach to ensuring AI models are not only secure against malicious manipulation but also efficient and interpretable for critical clinical applications, underscoring the essential need for trustworthy and effective AI systems.

Speaker: Weimin Lyu

Zoom: https://stonybrook.zoom.us/j/2392326575?pwd=SVQ2VkFXTnZZYmJUMXgvTXBuZWM3UT09

Meeting ID: 239 232 6575
Passcode: 436192
Title: Class visual similarity based noisy sample removal in generative Few Shot Learning
Time: Thursday, Feb 4, 11:30am - 1:00pm
Zoom:
https://stonybrook.zoom.us/j/8563646526?pwd=anJna1gzUStXNlNVSUIzdDRUSC9CUT09

Meeting ID: 856 364 6526
Passcode: 203791



Abstract:  

Over the past decade, larger datasets, hardware accelerations, and network architecture improvements have contributed to phenomenal achievements in many tasks of computer

vision. However, in the absence of large datasets, computer vision models struggle to learn

general representations which results in poor performance. Few-shot learning tries to address 

this problem by proposing models which learn from a few examples.


I first give an overall review of few-shot learning methods. I particularly focus on generative Few Shot Learning(FSL) methods, which augment the scarce categories in a dataset by generating samples for those rare categories. As the actual class distribution can be complex and lie very close to each other, the sample generated for one class can be noisy or lie close to another class.  However, none of the current FS generative methods perform any form of quality control of the generated samples.


In this work, I propose to identify and remove the generated samples that are less likely to be in the distribution of the few-shot class. Here I particularly deal with few-shot scenarios where the

prior information of the relationship between the classes based on visual  similarity is available. The main idea is to exploit these priors to better identify the unreliable generated samples.


Particularly, I have proposed two methods based on class relationship to detect noisy generated samples. In the first method, we assume that the embedding space of each class follows a Gaussian distribution.  From this assumption, I propose Gaussian Neighborhood (GN), a method to estimate how likely a generated sample is drawn from the estimated distribution of a few-shot class.  We evaluate this method on the Hematopoiesis dataset.  By simply eliminating samples based on thresholding our proposed GN scores, the few-shot  classification  performance  is  improved by 5% and 2% in five shot and one shot respectively, compared to the model trained on all generated images. 


The GN scores represent the similarity distances from the generated samples to their classes, based on the assumption that each class is a Gaussian distribution.  However, this assumption might be strict in many scenarios since the real distributions of data can be arbitrarily complex.  Thus in my second proposed method, I aim to learn such similarity distances directly from data via metric learning. I propose to train a deep-network to regress the similarity distance between a pair of samples.  This network is trained using both the class-level  visual  similarity  information  and  the  class  labels.   This method improves the 1-shot and 5-shot classification performances by 0.5% and 1% respectively, compared to GN.

Learn how these two AI tools will help you this year. AI has been all over, but figuring out the tools that we may use is critical. Background remover of images and a replacement for Google Search may disrupt the industry this year. Learn and refresh your knowledge about these tools.
University Libraries Presents: The Library AI Club is a welcoming space for students, faculty, and staff to explore AI in a supportive, low-pressure environment. Meeting every two weeks, the club features discussions, collaborative projects, guest speakers, and hands-on experiments. Join us to learn, share ideas, and engage with AI responsibly and creatively. We'd love to see you at an upcoming meeting! Location: Melville Library, Scholarly Communication Seminar Room