Abstract: DeepSeek-R1-Zero has shown that reinforcement learning (RL) at scale can directly enhance the reasoning capabilities of LLMs without supervised fine-tuning. In this work, we critically examine R1-Zero-like training by analyzing its two core components: base models and RL. We investigate a wide range of base models, including DeepSeek-V3-Base, to understand how pretraining characteristics influence RL performance. Our analysis reveals that DeepSeek-V3-Base already exhibit ''Aha moment'', while Qwen2.5 base models demonstrate strong reasoning capabilities even without prompt templates, suggesting potential pretraining biases. Additionally, we identify an optimization bias in Group Relative Policy Optimization (GRPO), which artificially increases response length (especially for incorrect outputs) during training. To address this, we introduce Dr. GRPO, an unbiased optimization method that improves token efficiency while maintaining reasoning performance. Leveraging these insights, we present a minimalist R1-Zero recipe that achieves 43.3% accuracy on AIME 2024 with a 7B base model, establishing a new state-of-the-art.

Speaker: Md. Saqib Hasan

Location: CS2311


Date of Event

Joel H. Saltz, MD, PhD
SUNY Distinguished Professor Cherith Professor and Founding Chair
Department of Biomedical Informatics
Stony Brook University

Apostolos K. Tassiopoulos, MD, FACS
Professor of surgery and vice chair for quality and outcomes Chief of the Division of Vascular and Endovascular Surgery
Director of the Stony Brook Vascular Center Stony Brook Medicine

Title: Clinical applications of artificial intelligence to improve diagnosis and risk stratification for patients with aortic aneurysms

Time: Wednesday, Feb 17, 2021 3 pm - 4 pm

Join Zoom Meeting
https://stonybrook.zoom.us/j/95617197636?pwd=KytzZ2pVRG9SZGpKZUtpNXJISj...
Meeting ID: 956 1719 7636 Passcode: 924293

Abstract: Autonomous systems, whether on Earth or in space, rely on 3D perception to understand and interact with the world around them. Yet traditional techniques for 3D understanding often depend on human designed features, fixed sensors, and conventional imaging modalities. This constrained approach can limit every stage of perception, from sensing to interpretation to decision making.
In this talk, we'll explore an alternative paradigm for imaging: physically based neural representations for 3D scenes and 3D sensing systems. We will discuss how recent advances in large scale learned representations can be used to jointly optimize both 3D scene models and the design of sensing systems for 3D capture, with the goal of enabling task specific perception systems.
Unlike modern AI models trained on internet scale datasets, these specialized 3D representations typically operate in data sparse regimes and therefore require a different kind of prior. We'll examine how grounding these learned representations in the physics of light transport can improve our understanding of scene structure, and inform imaging system design even with limited data. By connecting physical insights with learned representations, we'll highlight new possibilities for robust, efficient, and adaptive perception in challenging environments.

Speaker: Nikhil Behari is a graduate student in the Camera Culture group at the MIT Media Lab, advised by Professor Ramesh Raskar. His research interests include computational imaging, 3D scene understanding, and multi-agent decision-making under uncertainty, with a focus on automating imaging system design for 3D perception in human and planetary health. His research is supported by the NASA Space Technology Graduate Research Fellowship. He received his bachelor's in Computer Science and Statistics from Harvard University in 2022.

Abstract: The development of embodied AI has largely focused on scaling data and computational power, often at the cost of energy efficiency. In contrast, biological intelligence achieves remarkable adaptability with minimal resources, inspiring a shift toward neuromorphic AI, an approach that mimics the structure and dynamics of biological neural systems. In this talk, I will explore the promises and challenges of neuromorphic computer vision from three key perspectives: algorithms, robot actions, and data. First, I will discuss algorithmic advances, including continuous visual hull reconstruction, continuous-time human motion field estimation, and unsupervised independent motion segmentation. Next, I will illustrate how neuromorphic vision enables agile robotic actions by leveraging event-based perception for real-time decision-making. Finally, I will address challenges in training data-driven models with event data, highlighting strategies to enhance data availability and efficiency. By integrating these elements, neuromorphic AI paves the way for energy-efficient, high-performance embodied intelligence in dynamic real-world environments.

Speaker Bio: Ziyun (Claude) Wang is a fifth-year Ph.D. student in the General Robotics, Automation, Sensing & Perception (GRASP) Lab at the University of Pennsylvania, advised by Professor Kostas Daniilidis. His research focuses on developing algorithms for neuromorphic computer vision and integrating them with real hardware to enable agile perception in embodied AI systems. Prior to his Ph.D., he worked at the Samsung AI Center New York, where he developed 3D reconstruction techniques for robotic applications and earned three patents. He also contributed to the Apple Vision Pro team, enhancing user comfort for AR glasses. His research work has been recognized at major computer vision, robotics, and machine learning venues including the AAAI Conference on Artificial Intelligence (AAAI), European Conference on Computer Vision (ECCV), International Conference on Learning Representations (ICLR), Conference on Computer Vision and Pattern Recognition (CVPR) workshops, and IEEE Robotics and Automation Letters (R-AL), with an oral presentation at ECCV placing in the top 2.7%. His research aims to drive the development of next-generation bio-inspired AI systems, enabling more efficient, adaptive, and intelligent embodied perception.
Face Editing with Machine Learning presented by Zhixin Shu

ABSTRACT: The face is the most informative feature of humans and has been a long-standing research topic in Computer Vision and Graphics. Images of faces are also ubiquitous in photography and social media, and people have devoted significant resources to capturing and editing face images. Face editing can be broadly viewed as the encoding, manipulation and the decoding of some representations for face images. The challenges are that we want to manipulate an image in a controllable way and generate results that are both desirable and as realistic as possible. This thesis explores different Machine Learning-based face-editing approaches. I discuss the role of machine learning for achieving desirable edits by learning both the physical aspects as well as the statistical manifold of human faces. In my work for eye-editing, I discuss the importance of understanding multiple physical elements of a face image, such as shape, illumination, pose, etc. In a deep-learning-based approach, I introduce image formation domain knowledge to the construction and training of a neural network. This network provides transparent access to the disentangled representations of the aforementioned physical properties. With this network, we can achieve various face editing tasks in forms of representation manipulation. After that, I introduce Deforming Autoencoders, a network that learns to disentangle shape and appearance in an unsupervised manner. This disentanglement is beneficial for the learning of some other factors of variations, such as illumination and facial expression. In an extension of Deforming Autoencoders, we incorporate non-rigid structure-from-motion to learn a 3D morphable model for faces that only requires an image set for training. At last, I describe an image-to-image network for 3D face reconstruction, which also utilizes structure-from-motion in deep learning. With real face images in training, this network not only reconstructs 3D faces more accurately than prior art but also has better generalization ability in real-life testing cases.
As artificial intelligence continues to transform higher education and the world beyond, how are students engaging with this change? Join us for a student-led discussion that explores how AI is influencing academic integrity, learning practices, and students' perspectives on its role in future workplaces.

Our panelists will share their experiences and reflections on questions such as:
1. What counts as appropriate and inappropriate use of AI in coursework?
2. How do faculty approach AI and talk about its implications in class?
3. What does AI mean for students' learning and ethical decision-making?
4. How are students building their understanding of AI tools and their potential uses in professional contexts?

This conversation offers an authentic look at how students are navigating the promises and challenges of AI--both in their studies and as they look ahead to applying these technologies responsibly in their fields.

Register here.
  • CEWIT's 6th annual hackathon sponsored by Major League Hacking, Hack@CEWIT2022, is taking place virtually on February 18-20, 2022. This year's theme is Hacking Into the Metaverse and will focus on NFT's, Blockchain, Crypto, and the Metaverse. To find out more about the event, mentoring, sponsoring, or to register, visit:

  • https://www.cewit.org/programs/events/hack.php

The International Neuroethics Society (INS) Speaker Series on AI & Consciousness

Abstract: Colln Allen and I noted in our 2008 book Moral Machines: Teaching Robots Right From Wrong, that consciousness, a theory of mind, sociability, situational awareness and embodiment are all supra-rational (beyond reason) capabilities that contribute to making ethical decision Whether any of these can be fully instantiated in machines remains an open question. Nevertheless, moral decision making in the digital age will require an evolution in and refinement of specific skills for both humans and for AI. I call one of these evolutions in moral decision making capabilities tradeoff ethics and another a silent ethics. Aspects of this social, and not just technological evolution, will require research by neuroscientists.

Speaker Bio: Wendell Wallach has an international reputation as an expert on the ethics and governance of emerging technologies, particularly artificial intelligence and biotechnologies. He is also senior advisor to The Hastings Center and a scholar at the Yale University Interdisciplinary Center for Bioethics where he chaired Technology and Ethics studies for eleven years. Wallach's latest book, a primer on emerging technologies, is entitled, A Dangerous Master: How to keep technology from slipping beyond our control. He co-authored (with Colin Allen) Moral Machines: Teaching Robots Right From Wrong. Wallach has been referred to as, a Godfather of AI Ethics.

Pre-register here (required): https://umaryland.zoom.us/meeting/register/sPpiR_drR4-9JYDhI2NhJg

What can you learn from over seven years' worth of Twitter bios? Steven Skiena, Distinguished Teaching Professor of Computer Science and Director of SBU's Institute for AI-Driven Discovery and Innovation, will tell us.

Presenting work done with collaborators Jason Jones, Dakota Handzlik, and Xingzhi Guo, Dr. Skiena will discuss what the team learned about how people portray themselves on social media through their political identities and job status. He'll also show us what you can predict about a person based on their self-description.

If you have a disability and are requesting accommodations in order to fully participate in this event, please email libraryevents@stonybrook.edu or call 631-632-7100.

Register now: https://library.stonybrook.edu/library-events/stem-speaker-series-measuring-self-identity/