Abstract: This talk shows how machine learning can address challenges in Astrophysics. We specifically focus on black hole simulations and supernova observations. First, we present a super-resolution technique for black hole simulations that avoids the need for high-resolution labels by leveraging the Hamiltonian and momentum constraints from general relativity. This method reduces constraint violations by one to two orders of magnitude. Next, we introduce Maven, a multimodal foundation model for supernova science. Using contrastive learning to align photometric and spectroscopic data, Maven achieves state-of-the-art results in classification and redshift estimation by pre-training on synthetic data and fine-tuning on real observations.

Bio: Thomas Helfer is a computational physicist specializing in deep learning and physics. Currently based at the Institute for Advanced Computational Science at Stony Brook University, Thomas was previously a postdoctoral fellow at Johns Hopkins and did his PhD with Eugene Lim at King's College in London. In his work, he looks to bridge topics; in his PhD, he bridged theoretical particle physics and gravitational waves. Now, in his postdoctoral work, he aims to find novel applications of deep learning in astrophysics.

*please note: this seminar will be held in a hybrid format*


Location: IACS Seminar Room OR Join Zoom Meeting
https://stonybrook.zoom.us/j/98617630652?pwd=tb4hplPgb3bTTifPCJTCcsn3P9vX8y.1

Meeting ID: 986 1763 0652
Passcode: 882994
West Campus - SAC- Student Activities Center - Ballrooms A & B 100 Nicolls Road Stony Brook NY 11794 Job Fair.jpg The Career Center invites Alumni Employers and Job Seekers to the IT/Computer Science Job and Internship Fair this spring. Job Seekers: A job fair is an opportunity for you to present yourself professionally in person to a potential employer, while showcasing your communication skills. Get more information Alumni Employers: Held in both the fall and spring semesters, this event is ideal for employers looking to fill internship, co-op, part-time and full-time opportunities in the field of information technology (i.e. Software Engineering, Network Administration, Web Development, etc.). Register here to recruit top SBU talent.
All are welcome to attend BMI grand rounds talk by Dr. Le Lu on 04/14. 

Le Lu, Ph.D 
Executive Director, PAII Inc 
Johns Hopkins University
IEEE Fellow, MICCAI Board Member


Time: Wednesday, April 14, 2021 3:00 pm - 4:00 pm 

Zoom Meeting 
https://stonybrook.zoom.us/j/95617197636?pwd=KytzZ2pVRG9SZGpKZUtpNXJISjNjZz09 
Meeting ID: 956 1719 7636 Passcode: 924293

Title: 
In Search of Effective and Reproducible Clinical Imaging Biomarkers for Population Health and Oncology Applications of Screening, Diagnosis and Prognosis

Bio: 
Le Lu received a PhD in 2007 from Johns Hopkins University. During his first six years at Siemens, he made significant contributions to the company's CT colonography and Lung CAD product lines. From 2013 to 2017, Dr. Lu served as a staff scientist in the Radiology and Imaging Sciences department of the National Institutes of Health Clinical Center. He then went on to found Nvidia's medical image analysis group and he held the position of senior research manager until June 2018. Since then, he has been the Executive Director at PAII Inc., Bethesda Research lab, Maryland, USA which has become one of the leading industrial research labs in medical imaging. He was the main technical leader for two of the most-impactful public radiology image dataset releases (NIH ChestXray14, NIH DeepLesion 2018). He won NIH Clinical Center Director Award in 2017, NIH Mentor of the year award in 2015, and won numerous best paper awards in MICCAI and RSNA from 2016 to 2020 (over 10000 citations). In 2021, He was elected into IEEE Fellow class cited for his contribution to machine learning for cancer detection and diagnosis, and MICCAI society board member (MICCAI-Industry Workgroup Chair). He is currently an Associate Editor for IEEE Trans. Pattern Analysis and Machine Intelligence and IEEE Signal Processing Letters. He has served as an Area Chair for recent MICCAI, AAAI, CVPR, WACV, ICIP and ICHI conferences for 14 times.

Abstract: 
This talk will first give an overall on the work of employing deep learning to permit novel clinical workflows in two population health tasks, namely using conventional ultrasound for liver steatosis screening and quantitative reporting; osteoporosis screening via conventional X-ray imaging and AI readers. These two tasks were generally considered as infeasible tasks for human readers, but as proved by our scientific and clinical studies and peer-reviewed publications, they are suitable for AI readers. AI can be a supplementary and useful tool to assist physicians for cheaper and more convenient/precision patient management. Next, the main part of this talk describes a roadmap on three key problems in pancreatic cancer imaging solution: early screening, precision differential diagnosis, and deep prognosis on patient survival prediction. (1) Based on a new self- learning framework, we train the pancreatic ductal adenocarcinoma (PDAC) segmentation model using a larger quantity of patients (≈1,000, four institutions), with a mix of annotated/unannotated venous or multi-phase CT images. Pseudo annotations are generated by combining two teacher models with different PDAC segmentation specialties on unannotated images, and can be further refined by a teaching assistant model that identifies associated vessels around the pancreas. Our approach makes it technically feasible for robust large-scale PDAC screening from multi-institutional multi-phase partially-annotated CT scans. (2) We propose a holistic segmentation-mesh classification network (SMCN) to provide patient-level diagnosis, by fully utilizing the geometry and location information. SMCN learns the pancreas and mass segmentation task and builds an anatomical correspondence-aware organ mesh model by progressively deforming a pancreas prototype on the raw segmentation mask. Our results are comparable to a multimodality clinical test that combines clinical, imaging, and molecular testing for clinical management of patients with cysts. (3) Accurate preoperative prognosis of resectable PDACs for personalized treatment is highly desired in clinical practice. We present a novel deep neural network for the survival prediction of resectable PDAC patients, 3D Contrast-Enhanced Convolutional Long Short-Term Memory network (CE- ConvLSTM), to derive the tumor attenuation signatures from CE-CT imaging studies. Our framework can significantly improve the prediction performances upon existing state-of-the-art survival analysis methods. This deep tumor signature has evidently added values (as a predictive biomarker) to be combined with the existing clinical staging system.

More information can be found at:
https://bmi.stonybrookmedicine.edu/sites/default/files/Lu_le_04_14.pdf
IACS Research Theme: Human Centered Computing Seminar

Abstract: The AI art platform Artbreeder hosts daily remix parties where users build on each other's work, creating transparent evolutionary chains of images from a single seed. This study analyzes 130,882 images from 368 remix parties to identify the drivers of novelty, complexity, and competitive success. The results reveal an interesting tension: while more novel parent images produce more novel and complex children and attract more likes, users paradoxically prefer to remix images that are less novel and complex. At the group level, larger remix parties produce more novelty at the cost of lower complexity. Additionally, images tend to converge towards common thematic attractors (e.g., steampunk scenes, alien architecture, furries) over the course of remix parties. These results provide quantitative insights into collective creativity--the production of novelty by groups of people--a typically opaque aspect of human cultural evolution.

Speaker: Dr. Mason Youngblood

Location: Institute for Advanced Computational Science, Seminar Room
Abstract: Graphs are a universal language of science. Molecules, materials, quantum systems, and knowledge bases can all be naturally represented as graphs. This talk explores how graph-based artificial intelligence is emerging as a powerful engine for scientific discovery. Using molecular design as a guiding example, we examine how modern graph AI enables machines not only to analyze complex scientific structures but also to generate new ones. We will discuss graph neural networks for learning predictive models of molecular properties, graph generative models for constructing novel chemical structures, and emerging multimodal graph-language models that support inverse design and synthesis planning. Together, these advances make graph AI more scalable, interpretable, and data-efficient--key capabilities for real-world scientific discovery. As artificial intelligence enters the era of foundation models, the next frontier lies in multimodal reasoning. Scientific knowledge is not purely textual; it is expressed through structures, code, and experimental data. By integrating graph representations with large language models, we move toward AI systems that can reason across multiple modalities and engage with scientific knowledge in its native forms. Looking ahead, we envision AI systems that behave less like tools and more like collaborators in the scientific process--generating hypotheses, designing candidate structures, planning experiments, interpreting results, and iteratively refining ideas through cycles of success and failure. In this vision, multimodal and agentic AI will enable scientists to explore vast and previously inaccessible design spaces, accelerating breakthroughs across domains ranging from drug discovery and materials innovation to software systems and quantum technologies.

Bio: Jie Chen is an interdisciplinary researcher working at the intersection of computing and mathematics, with a current focus on foundation models and AI agents for scientific discovery. His research integrates machine learning, statistics, scientific computing, and numerical linear algebra, with contributions spanning graph neural networks, multimodal graph LLMs, graph structure learning, scalable Gaussian processes, graph coarsening, and matrix functions. He is widely recognized for transformative contributions to graph-based deep learning and large-scale statistical modeling, and for bridging theory with real-world scientific and engineering applications. Dr. Chen has led externally funded, multi-institutional research programs supported by Shell, Evonik, and the U.S. Department of Energy, with applications in materials discovery, financial forensics, and power system resilience. He previously served as a Senior Research Scientist and Manager at IBM Research and the MIT-IBM Watson AI Lab, and as a Postdoctoral Fellow at Argonne National Laboratory. He has published extensively in top-tier AI, statistics, and applied mathematics venues, and his work has been recognized by multiple IBM Outstanding Technical Achievement Awards and the SIAM Student Paper Prize. He earned his Ph.D. in Computer Science from the University of Minnesota and his B.S. in Mathematics with honors from Zhejiang University.

Location: NCS 120

Zoom Link: https://stonybrook.zoom.us/j/98533029054?pwd=5FXO6lWGTJssCADEYkYbA7sjaacPRX.1

Meeting ID: 985 3302 9054

Passcode: 436997

Abstract:

Semantic segmentation, the task of assigning a semantic label to each pixel in an image, is a fundamental problem in the field of Computer Vision. with crucial applications in domains like autonomous driving, drone imagery and medical image analysis. Despite advancements in deep learning architectures, state-of-the-art models still heavily depend on large-scale pixel-level annotations, which are costly and time-consuming to acquire. To address this issue, Semi-Supervised Segmentation (SSS) has emerged as a promising solution, leveraging a small set of labeled images alongside a larger corpus of unlabeled data to reduce the annotation burden. In this proposal, I aim to investigate the challenges of SSS and propose approaches to address them. Existing SSS methods rely on a teacher-student framework to generate pseudo-labels for unlabeled images, which are then used for model training. However, this approach presents two major challenges. Pixel-level consistency fails to effectively capture contextual information, and pseudo-labels are noisy, especially in the early stages of training. To address the challenge of noisy pseudo-labels, existing methods rely on confidence-based thresholding to identify reliable pseudo-labels. However, during early training phases, when the model is poorly calibrated, this approach can select high-confidence but noisy pseudo-labels. To address this, we propose a novel approach that reduces reliance on model confidence to select reliable pseudo-labels. Our method employs an ensemble of a segmentation model and an object detection model to select more reliable pseudo-labels, which are then used to weight pseudo-labels using rank statistics, reducing the influence of noisy labels in training. Next, to address both the challenge of capturing contextual information and noisy pseudo-labels I introduce a novel Multi-scale Patch-based Multi-label Classifier (MPMC), which incorporates patch-level contextual information and reduces the impact of noisy pixel pseudo-labels by using the predictions of the patch-level Multi-label classifier to detect noisy labels, enhancing overall segmentation performance. While my work so far has focused on effectively utilizing unlabeled data to improve segmentation performance, as part of our future work, I will explore the use of textual information, such as category descriptions, for segmentation tasks. In limited labeled data scenarios it is more challenging to align visual features with textual features from large language models (LLMs).