Learn how to prompt AI to help clean datasets and write formulas in Google Sheets.

When you have a messy dataset, it can take a lot of time to clean it up before you can start analyzing. Can AI help? In this workshop, we'll collect live data and then use Gemini AI (the stand alone tool) to help clean up the data. Then, we'll use it to help do some analysis. Because we'll be working with live data live in Gemini, we don't know exactly what will happen, but that's the reality of data and data cleaning!

In this session, you will

  1. Craft effective AI prompts to generate Google Sheets formulas for data analysis and manipulation
  2. Utilize Gemini to develop regular expression formulas to extract, reformat, clean text-based data
  3. Develop formulas for numerical analysis using Gemini AI

https://stonybrookuniversity.co1.qualtrics.com/jfe/form/SV_dht1o3rNzlZhHka?source=event+manager&session=0815250900sheets
Abstract: DeepSeek-R1-Zero has shown that reinforcement learning (RL) at scale can directly enhance the reasoning capabilities of LLMs without supervised fine-tuning. In this work, we critically examine R1-Zero-like training by analyzing its two core components: base models and RL. We investigate a wide range of base models, including DeepSeek-V3-Base, to understand how pretraining characteristics influence RL performance. Our analysis reveals that DeepSeek-V3-Base already exhibit ''Aha moment'', while Qwen2.5 base models demonstrate strong reasoning capabilities even without prompt templates, suggesting potential pretraining biases. Additionally, we identify an optimization bias in Group Relative Policy Optimization (GRPO), which artificially increases response length (especially for incorrect outputs) during training. To address this, we introduce Dr. GRPO, an unbiased optimization method that improves token efficiency while maintaining reasoning performance. Leveraging these insights, we present a minimalist R1-Zero recipe that achieves 43.3% accuracy on AIME 2024 with a 7B base model, establishing a new state-of-the-art.

Speaker: Md. Saqib Hasan

Location: CS2311
Abstract:
Artificial intelligence (AI)-based methods and computational materials science continue to make inroads into accelerated materials design and development. I will review Al-enabled advances made in the subfield of polymer informatics, with a particular focus on the design of application-specific practical polymeric materials. I will describe exemplar design attempts within a few critical and emerging application spaces, including materials designs for storing, producing, and conserving energy, and those that can prepare us for a sustainable economy powered by recyclable and/or biodegradable polymers. Al- powered workflows help efficiently search the staggeringly large chemical and configurational space of materials, using modern machine-learning (ML) algorithms to solve forward and inverse materials design problems. A practical informatics-based design protocol involves creating a set of application-specific target property criteria, building ML model predictors for those relevant target properties, enumerating or generating a tangible population of viable polymers, and selecting candidates that meet design recommendations. The protocol will be demonstrated for several energy and sustainability-related applications. Finally, I will offer an outlook on the lingering obstacles that must be overcome to achieve widespread adoption of informatics-driven protocols in industrial-scale materials development.

Speaker Bio:
Prof. Ramprasad is the Regents' Entrepreneur, Michael E. Tennenbaum Family Chair and Georgia Research Alliance Eminent Scholar in the School of Materials Science & Engineering at the Georgia Institute of Technology. He is also the CEO and co-founder of Matmerize, Inc. His area of expertise is the development and application of computational and machine learning tools to accelerate sustainable materials development aimed at energy production, storage and utilization. Prof. Ramprasad received his B. Tech. in Metallurgical Engineering at the Indian Institute of Technology, Madras, India, an M.S. degree in Materials Science & Engineering at the Washington State University, and a Ph.D. degree also in Materials Science & Engineering at the University of Illinois, Urbana-Champaign.
Prof. Ramprasad is a Fellow of the Materials Research Society, a Fellow of the American Physical Society, an elected member of the Connecticut Academy of Science and Engineering, and the recipient of the Alexander von Humboldt Fellowship and the Max Planck Society Fellowship for Distinguished Scientists. He has authored or co-authored over 300 peer-reviewed journal articles, 8 book chapters and 8 patents, and has delivered over 300 invited talks at Universities and Conferences worldwide. He is a member of the Editorial Advisory Boards of npj Computational Materials, ACS Materials Letters and Journal of Physical Chemistry A/B/C. He created and chaired the inaugural 2022 Gordon Research Conference on Computational Materials Science and Engineering.

Location: Room 301, Engineering Building
Towards Saving Lives with Natural Language Processing Andrew Schwartz Dept. of Computer Science Stony Brook Analyzing language use patterns is proving to be a valuable and unique approach to understanding the psychological, social, and health factors of people. On the individual level, Facebook and Twitter have been found predictive of mental health, personality, demographics, and occupational class (among others). At the community or county-level, Twitter has been found predictive of flu and allergy outbreaks, life satisfaction, atherosclerotic heart disease mortality, health behavioral risk factors, excessive drinking, and HIV prevalence. While these techniques have shown robust links over a plethora of important aspects of human life, it is not clear whether any lives have been saved, at least directly, by such work. At their core, some barriers to improving health care and saving lives are likely not NLP or even AI problems, but others are perhaps technical in nature and suggest changing the way we model data. This seminar will have two parts: a presentation and a discussion. I will start by going over recent and on-going work toward predicting mental health outcomes --- depression, addiction relapse, future psychological distress --- from human language use patterns. Then, I will present an imperfect vision of a future where NLP helps to save lives and open the floor for discussion of technical barriers and whether such a vision is practical. Biography: Andrew Schwartz received his PhD in Computer Science from the University of Central Florida in 2011 with research on acquiring lexical semantic knowledge from the Web. He then joined the University of Pennsylvania where he was a Postdoctoral Research Fellow and later Visiting Assistant Professor in Computer & Information Science. He is Lead Research Scientist for the World Well-Being Project, a multidisciplinary group of Computer Scientists and Psychologists studying physical and psychological well-being based on language in social media.
Mind Brain Lecture: Constructing the World of Taste in Your Head You fork the morsel into your mouth and say yum...chocolate cake. The appreciation of your dessert's taste seems to follow directly, quickly and simply from the placement of the food on your tongue. The truth, however, is far more interesting and complex: your brain actually begins determining whether you will enjoy a bite of food even before the fork approaches your mouth and continues to work the problem well after. Information about your food's color, smell, texture and taste activates multiple parts of your brain, where that information collides with your pre-mouthful beliefs about how it should taste. The coming-together and shuffling of that information around the brain takes time, as networks of neurons work together to help you decide whether the morsel in your mouth is worth swallowing. Referring to work from psychology, biology and computational neuroscience, Professor Katz will de-mystify and reveal the beauty of these complexities of the neuroscience of taste. Donald Katz, Professor of Psychology, Departments of Neuroscience, Psychology, and the Volen National Center for Complex Systems, Brandeis University Free presentation intended for a general audience. Reception to follow. https://www.stonybrook.edu/commcms/mind/

Abstract: Materials used in extreme environments, such as high temperatures, irradiation, and stress, often fail due to rapid defect generation and microstructural evolution, and traditional approaches cannot explore the vast design space needed for next-generation alloys. I will present a machine learning framework powered by massive computing that links individual atomic motion to microstructural evolution. Neural network kinetics models trained on first-principles data map vacancy barrier spectra and capture correlated diffusion in multicomponent alloys, revealing design strategies to suppress radiation damage. At larger scales, simulations uncover dislocation patterning and distinguish between confined and extended slip bands, offering new insight into collective dislocation motion and deformation instabilities. By integrating AI-driven modeling, large-scale computing, and experimental validation, my research goal is to accelerate the discovery of damage-tolerant materials and advance fundamental understanding of defect physics in extreme environments.

Speaker Bio: Penghui Cao is an Associate Professor in Mechanical and Aerospace Engineering at the University of California, Irvine, with a joint appointment in Materials Science and Engineering. He received his PhD in mechanical engineering from Boston University and subsequently worked as a Postdoctoral Associate in the Department of Nuclear Science and Engineering at the Massachusetts Institute of Technology from 2014 to 2018. Dr. Cao's research focuses on understanding the fundamental mechanisms that govern radiation responses and microstructure evolution in materials, and on developing advanced alloys for high-performance nuclear energy systems. His lab advances computational and modeling algorithms, integrates advanced manufacturing techniques to tailor microstructures, and leverages state-of-the-art electron microscopy to characterize and assess underlying mechanisms. He is the recipient of the DOE Early Career Research Program Award and the UCI Samueli School's Mid-Career Award for Faculty Excellence in Research.

Location: Institute for Advanced Computational Science, Seminar Room

*This seminar will be held in-person and online. Zoom link below*

Join Zoom Meeting: https://stonybrook.zoom.us/j/96410717491?pwd=3WGMwbLYNMSbI2IF160VXkvv2JmCQ1.1

Meeting ID: 964 1071 7491
Passcode: 399333

Join us to share your thoughts about teaching, learning, and AI!

The landscape of higher education is rapidly evolving with the integration of Artificial Intelligence (AI). Through the Institute on AI, Pedagogy, and the Curriculum with AAC&U, we are exploring ways that we can better address AI in teaching and learning. We want to hear your experiences, your concerns, and your ideas.

This is an open discussion for all faculty and staff to share their perspectives on the opportunities and challenges AI presents in our academic environment.

We'll be exploring critical questions like:

  • In the age of AI, what are the opportunities you see for enriching the classroom and curriculum? How can it enhance student learning or your professional practice?

  • What are the most significant challenges and concerns that AI raises for you regarding academics, student integrity, or your workload?

  • What resources (tools, training, technical support, policy guidance, etc.) do you need to feel confident and successful in the age of AI?

Dates/Times:

  • Tuesday, 2/3 at 2pm

  • Friday, 2/6 at 9:30am

Please register in advance for the Zoom link.

Can't Make It? Share Your Feedback!

We understand schedules are tight. If you cannot attend the live discussion, you can still share your thoughts! Join our AI Zoom Room to share your thoughts via video recording or email rose.tirotta-esposito@stonybrook.edu with your comments and ideas.

Videos will not be shared publicly and comments will only be shared in aggregate.

Your input is vital. From pedagogy to assessment, your insights will be critical. We look forward to a thoughtful and productive conversation!

  • Dr. Rose Tirotta-Esposito (Assistant Provost; Director of CELT)

  • Dr. Elizabeth Hewitt (Associate Professor in the Department of Technology and Society (DTS) in the College of Engineering and Applied Sciences)

  • Chris Kretz (Associate Librarian and Head of Academic Engagement at SBU Libraries)

  • Prof. Rajiv Lajmi (Assistant Professor in the School of Health Professions and Chair of Applied Health Informatics)

  • Dr. Matthew Salzano (Assistant Professor in the Department of Communication in the School of Communication and Journalism)