Postmortem Program Analysis from a Conventional Program Analysis Method to an AI-assisted Approach

Abstract: Despite the best efforts of developers, software inevitably contains flaws that may be leveraged as security vulnerabilities. Modern operating systems integrate various security mechanisms to prevent software faults from being exploited. To bypass these defenses and hijack program execution, an attacker needs to constantly mutate an exploit and make many attempts. While in their attempts, the exploit triggers a security vulnerability and makes the running process abnormally terminate.

After a program has crashed and abnormally terminated, it typically leaves behind a snapshot of its crashing state in the form of a core dump. While a core dump carries a large amount of information, which has long been used for software debugging, it barely serves as informative debugging aids in locating software faults, particularly memory corruption vulnerabilities. As such, previous research mainly seeks fully reproducible execution tracing to identify software vulnerabilities in crashes. However, such techniques are usually impractical for complex programs. Even for simple programs, the overhead of fully reproducible tracing may only be acceptable at the time of in-house testing.

In this talk, I will discuss how we tackle this issue by bridging program analysis with artificial intelligence (AI). More specifically, I will first talk about the history of postmortem program analysis, characterizing and disclosing their limitations. Second, I will introduce how we design a new reverse-execution approach for postmortem program analysis. Third, I will discuss how we integrate AI into our reverse-execution method to escalate its analysis efficiency and accuracy. Last but not least, as part of this talk, I will demonstrate the effectiveness of this AI-assisted postmortem program analysis framework by using massive amounts of real-world programs.

Bio: Dr. Xinyu Xing is an Assistant Professor at Pennsylvania State University. His research interests include exploring, designing and developing new program analysis and AI techniques to automate vulnerability discovery, failure reproduction, vulnerability diagnosis (and triage), exploit and security patch generation. His past research has been featured by many mainstream media and received the best paper awards from ACM CCS and ACSAC. Going beyond academic research, he also actively participates and hosts many world-class cybersecurity competitions (such as HITB and XCTF). As the founder of JD-OMEGA, his team has been selected for DEFCON/GeekPwn AI challenge grand final at Las Vegas. Currently, his research is mainly supported by NSF, ONR, NSA and industry partners.

Abstract: Much like other AI for Science domains, polymer design poses significant challenges. It requires grounding in empirical data and physical laws, precise handling of domain-specific structured representations, and compositional reasoning over multiple interacting constraints--all while working with limited data.

To address these limitations, we introduce PolyBench, a large-scale benchmark comprising over 125K polymer design and analysis tasks grounded in verified experimental and synthetic data. PolyBench includes tasks created from a wide range of data sources and presents diverse structural, property-driven, and synthesis-oriented reasoning problems. Tasks in PolyBench are organized from simple to complex analytical reasoning problems, enabling generalization tests and includes diagnostic probes to evaluate model capabilities. Additionally, to support effective domain alignment, we propose a knowledge-augmented reasoning distillation framework that enriches the dataset with structured chain-of-thought supervision derived from expert-informed reasoning strategies.

Small language models (7B-14B parameters) trained on PolyBench substantially outperform comparably sized baselines and, in many cases, exceed the performance of larger closed-source frontier models on polymer reasoning tasks, while also demonstrating improved transfer to external polymer benchmarks. Last, we conduct a diagnostic study that reveals a compositionality gap: despite strong performance on decomposed sub-questions, models struggle to integrate multiple interacting constraints and intermediate reasoning steps, highlighting fundamental limitations in current scientific language models.

Speaker: Dikshya Mohanty

Location: NCS 115/Online

Zoom: https://stonybrook.zoom.us/j/94746001760?pwd=BCAd8gu7cXLn3PXM6kkbh11V6r0Mr7.1
Meeting ID: 947 4600 1760 Passcode: 987917

Abstract: As computing and society become increasingly inseparable, we confront a fundamental design challenge: creating AI systems where human-machine interactions authentically embody our diverse values while thoughtfully evolving our social relationships. The recursive nature of these interactions--where human behavior shapes technology design and technological affordances influence human behavior--presents both profound risks and transformative opportunities as we reimagine our collective digital future. What interaction patterns emerge when algorithmic systems become active participants in societal decision-making? How can we design human-AI collaboration that ensures algorithmic systems align with diverse community values while serving the public interest? Through Public Interest AI, we explore a Pluralistic Design Language that creates interaction models for value-sensitive algorithmic ecosystems, strengthening AI-society alignment in both technology design and policy development. Through collaborative interaction with communities, we create systems that augment human capabilities while embedding ethical principles into the sociotechnical design of AI itself--ultimately redefining possibilities at the intersection of technology, policy, and society. This talk will examine the challenges of designing meaningful human-AI systems within social contexts through real-world applications that combine value-sensitive interaction design, human-inspired computing, and societal development to create technologies that advance our shared commitment to the public good.

Bio: Neil Gaikwad is an Assistant Professor of Data Science and Computer Science at UNC Chapel Hill. Additionally, he serves on the Faculty Advisory Council of the UNC Parr Center for Ethics and is a Fellow at the MIT Dalai Lama Center for Ethics and Transformative Values. Neil holds a Ph.D. in Society-Centered AI from MIT and is an alumnus of Carnegie Mellon University's School of Computer Science. Neil's scholarship, published in prominent AI and HCI conferences, has been recognized with several prestigious honors, including the Facebook Research Fellowship, UIST Best Paper Honorable Mention, MIT Engineering Fellowship, Human Rights & Technology Fellowship, Graduate Teaching Award, and the Karl Taylor Compton Prize, MIT's highest student honor. He has been recognized as a Rising Star by both Stanford University and the University of Chicago. Translating research into real-world impact, Neil is a dedicated educator and mentor who has taught over 500 students throughout his career. He has guided more than 30 students to publish influential papers on AI fairness, secure prestigious fellowships, and contribute to shaping AI policy through public interest research. Neil is also the founder of the AI Policy Global Initiative, which has successfully brought together academia, industry, government, and communities to address critical challenges in AI governance and develop collaborative approaches to responsible AI.

Location: Old Computer Science, room 1310

The Provost's Office is excited to invite you to join in responding to an extraordinary opportunity to enhance our academic and research capabilities in AI at Stony Brook. SUNY recently made funding available to support the creation of departments of AI and Society at its universities. Stony Brook is well-positioned to seize this opportunity to build upon our interdisciplinary strengths in AI.

The office is hosting a forum on Friday, Nov. 15, from 11:30 a.m. to 1:30 p.m., in Ballroom A, SAC. You are invited to attend to learn more about this opportunity and to help us generate ideas to build a compelling proposal for Stony Brook to submit to SUNY. Lunch will be provided.

Please click here to RSVP as soon as possible.

This funding will support innovation in our curriculum, allowing us to create programs that explore the social and societal impact of AI alongside the technological advancements led by researchers in engineering and scientific disciplines.

We believe we can make a significant impact through this SUNY program and look forward to your participation in this initiative.
Face Editing with Machine Learning presented by Zhixin Shu

ABSTRACT: The face is the most informative feature of humans and has been a long-standing research topic in Computer Vision and Graphics. Images of faces are also ubiquitous in photography and social media, and people have devoted significant resources to capturing and editing face images. Face editing can be broadly viewed as the encoding, manipulation and the decoding of some representations for face images. The challenges are that we want to manipulate an image in a controllable way and generate results that are both desirable and as realistic as possible. This thesis explores different Machine Learning-based face-editing approaches. I discuss the role of machine learning for achieving desirable edits by learning both the physical aspects as well as the statistical manifold of human faces. In my work for eye-editing, I discuss the importance of understanding multiple physical elements of a face image, such as shape, illumination, pose, etc. In a deep-learning-based approach, I introduce image formation domain knowledge to the construction and training of a neural network. This network provides transparent access to the disentangled representations of the aforementioned physical properties. With this network, we can achieve various face editing tasks in forms of representation manipulation. After that, I introduce Deforming Autoencoders, a network that learns to disentangle shape and appearance in an unsupervised manner. This disentanglement is beneficial for the learning of some other factors of variations, such as illumination and facial expression. In an extension of Deforming Autoencoders, we incorporate non-rigid structure-from-motion to learn a 3D morphable model for faces that only requires an image set for training. At last, I describe an image-to-image network for 3D face reconstruction, which also utilizes structure-from-motion in deep learning. With real face images in training, this network not only reconstructs 3D faces more accurately than prior art but also has better generalization ability in real-life testing cases.
The overall purpose of this seminar is to bring together people with interests in Computer Vision theory and techniques and to examine current research issues. This course will be appropriate for people who already took a Computer Vision graduate course or already had research experience in Computer Vision. To enroll in this course, you must either: (1) be in the PhD program or (2) receive permission from the instructors.

Each seminar will consist of multiple short talks (around 10 minutes) by multiple people. Students can register for 1 credit for CSE 656. Registered students must attend and present a minimum of 2 or 3 talks. Everyone else is welcome to attend. Fill in https://forms.gle/pCVXovgfMfQwGqG38 to subscribe to our mailing list for further announcement.

Speaker: Gary Kazantsev (Head of Quant Technology Strategy in the Office of the CTO at Bloomberg)

 

Date/Time: Friday, October 15, 2021 10:00AM-11:00AM EST

 

Title: Machine Learning in Finance

Abstract: Machine learning is changing our world at an accelerating pace. In this talk we will discuss the recent developments in how machine learning and artificial intelligence are changing finance, from a perspective of a technology company which is a key  participant in the financial markets. We will give an overview and discuss the evolution of selected flagship Bloomberg ML and AI projects, such as sentiment analysis, question answering, social media analysis, information extraction and prediction of market impact of news stories. We will discuss practical issues in delivering production machine learning solutions to problems of finance, highlighting issues such as interpretability, privacy and nonstationarity. We will also discuss current research directions in machine learning for finance. We will conclude with a Q&A session.

Bio: (https://www.techatbloomberg.com/people/gary-kazantsev/) Gary is the Head of Quant Technology Strategy in the Office of the CTO at Bloomberg. Prior to taking on this role, he created and headed the company's Machine Learning Engineering group, leading projects at the intersection of computational linguistics, machine learning and finance, such as sentiment analysis of financial news, market impact indicators, statistical text classification, social media analytics, question answering, and predictive modeling of financial markets.

Prior to joining Bloomberg in 2007, Gary had earned degrees in physics, mathematics, and computer science from Boston University.

He is engaged in advisory roles with FinTech and Machine Learning startups and has worked at a variety of technology and academic organizations over the last 20 years. In addition to speaking regularly at industry and academic events around the globe, he is a member of the KDD Data Science + Journalism workshop program committee and the advisory board for the AI & Data Science in Trading conference series. He is also a co-organizer of the annual Machine Learning in Finance conference at Columbia University.


Join Zoom Meetinghttps://stonybrook.zoom.us/j/93374426887?pwd=cE9zeW51VXFEN2R0YnNPbHF1WFp0Zz09Meeting ID: 933 7442 6887Passcode: 330347One tap mobile+16468769923,,93374426887# US (New York)+13126266799,,93374426887# US (Chicago)Dial by your location +1 646 876 9923 US (New York) +1 312 626 6799 US (Chicago) +1 301 715 8592 US (Washington DC) +1 346 248 7799 US (Houston) +1 408 638 0968 US (San Jose) +1 669 900 6833 US (San Jose) +1 253 215 8782 US (Tacoma)Meeting ID: 933 7442 6887

The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning.

ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics.

ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.


For more information and registration, visit the official website.

Scaling the NY AI Innovation Ecosystem

The State University of New York at Stony Brook will bring together leading AI experts to promote a future where AI drives responsible progress. This two-day event will provide a significant opportunity to explore the future of AI, exchange ideas, and connect with those at the forefront of research and deployment. We invite faculty, staff, and students from all SUNY institutions and beyond, as well as industry AI practitioners and policymakers to attend.

Recognized AI experts from academia, industry, and government will present on topics such as AI applications, innovative developments in research and technology, workforce development, as well as ethical and societal impacts.

A 90-minute poster session is included in the schedule. If you would like to submit an abstract for consideration, please see the Call for Abstracts. The poster session segment of the symposium will be held in honor of the Inauguration of Dr. Andrea Goldsmith, the State University of New York at Stony Brook's seventh President. Poster printing for all participants will be covered by the Inauguration Planning Committee. SUNY students presenting posters are also eligible for travel reimbursement.

We kindly ask faculty to encourage their students to attend and to submit their work for presentation.

For additional information and to register, visit the symposium website. Please direct any questions to suny-ai-symposium-sbu@stonybrook.edu.

Register.

Abstract: Sub-grid turbulence is challenging to resolve in climate models; therefore, it is parameterized. Traditionally, turbulent parameterizations have relied on physics-based and equation-based approaches. However, ad hoc and uncertain components in these parameterizations introduce uncertainty in future climate predictions. Recently, data-driven techniques have emerged as an alternative for modeling sub-grid fluxes. I will demonstrate the use of machine learning to model vertical turbulent fluxes in the ocean surface boundary layer and its impact on reducing biases in NOAA's Geophysical Fluid Dynamics Laboratory ocean climate model.

I will show how neural networks, trained to predict the eddy diffusivity profile from high-fidelity yet computationally expensive turbulence schemes, enhance the vertical mixing scheme in the climate model. These networks replace ad hoc components while maintaining the conservation principles of the standard ocean model equations. The enhanced scheme outperforms its predecessor by reducing biases in the mixed-layer depth and modestly improving tropical upper-ocean stratification in ocean-only global simulations. Furthermore, simplified equations that can replace the neural networks show similar improvements but with lower computational cost and better interpretability. They point to structural deficiencies in the baseline parameterization. This work is one of the first successful applications of machine learning to improve a sub-grid parameterization of turbulent mixing in ocean climate models.

IACS Seminar Speaker: Aakash Sane, Princeton University

Location: IACS Seminar Room or Zoom

Join Zoom Meeting: https://stonybrook.zoom.us/j/97764942108?pwd=MzCWupCe3L9mKdrgfO2bJg3GBbvXuf.1
Meeting ID: 977 6494 2108
Passcode: 519324