Abstract:

It is known that models like large language models (LLMs) can often suggest colloquial plans given verbal descriptions of tasks, yet they are unable to reliably provide executable and verifiable plans given formally specified environments. In this talk, I will discuss a strand of efforts to have LLMs generate accurate and explainable plans in textual simulations. Instead of directly generating the plan or actions, LLMs are prompted to generate Planning Domain Definition Language (PDDL) that specifies the environment (domain file) and the task (problem file), which can then be deterministically solved with an off-the-shelf planner. In a 3-phase study, my collaborators and I first observed that it is possible but very challenging for LLMs to generate long-form code such as PDDL domain and problem files given textual specifications. Next, we devise methodologies for LLMs to iteratively generate and refine problem files while exploring a partially-observed, simulated, textual environment. Finally, we show that domain files are even more difficult to generate correctly, even on well-established planning tasks such as BlocksWorld. Finally, I will discuss ongoing efforts to improve said ability of structured generation and promising frontiers to explore.

Bio:
Li Harry Zhang is an assistant professor at Drexel University, focusing on Natural Language Processing (NLP) and artificial intelligence (AI). He obtained his PhD degree from the University of Pennsylvania advised by Prof. Chris Callison-Burch. Prior, he obtained his Bachelor's degree at the University of Michigan mentored by Prof. Rada Mihalcea and Prof. Dragomir Radev. His current research uses large language models (LLMs) to reason and plan via symbolic and structured representations. He has published more than 20 peer-reviewed papers in NLP and AI conferences, such as ACL, EMNLP, and AACL, that have been cited more than 1,000 times. He also consistently serves as Area Chair, Session Chair, and reviewer in those venues. Being a musician, producer, and content creator having over 50,000 subscribers, he is also passionate in the research of AI music and creativity.

Launching a University-Wide AI Innovation Institute:

Last spring, the Office of the Provost led a group of over 30 faculty, staff, and administrators to consider how we can expand and leverage our strengths in AI research and discovery. The resulting recommendation was to launch a university-wide AI Innovation Institute (AI3), which would expand the Institute for AI-driven Discovery and Innovation established in 2018 from a department-level institute within the College of Engineering and Applied Science (CEAS) to the university-wide AI Innovation Institute reporting to the provost.

As a university-wide enterprise, the AI Innovation Institute (AI3) is intended to accelerate, coordinate, and organize AI innovation and education across Stony Brook. The institute will serve to empower the entire university community and beyond, catalyzing core AI research, curriculum innovation, and societal change in the ever-evolving landscape of knowledge work.

The AI Town Hall, led by AI3 Interim Director Skiena, is an open house event that will provide an overview of the major AI initiatives on campus, including the new AI Seed Grant program and Stony Brook's role in New York State's Empire AI program. The session will include time for questions and discussion about the future of AI at Stony Brook.

Join us at the Center for Excellence in Learning and Teaching (CELT) for an engaging workshop on Generative AI. This Zoom workshop is designed for faculty and staff members seeking to enhance their teaching methods and assessment strategies, foster student engagement, and navigate the evolving landscape of AI tools. Recording and slides will be sent to you.

Register here: https://stonybrook.zoom.us/meeting/register/tJ0qceisrjsrE9w1QtMkvSVw4lmr4h4x_Vqu

Talk Title: Knowledge-enhanced LLMs and Human-AI Collaboration Frameworks for Creativity Support


Abstract:

Large language models (LLMs) constitute a paradigm shift in Natural Language Processing and Artificial Intelligence. To build AI systems that are human-centered, I propose we need knowledge-aware models and human-AI collaboration frameworks to help them solve tasks ultimately aligning these models better with human values. In this talk, I will discuss my research agenda for human-centered AI with a case study on creativity that focuses on how to augment LMs with external knowledge, build effective human-AI collaboration frameworks as well as theoretically grounded robust evaluation protocols for measuring capabilities of NLG systems. I will begin by describing knowledge-enhanced methods for creative text generation such as metaphors. Next, I will describe how content creators can collaborate and benefit from the creative capabilities of text-to-image-based AI models. Finally, I will focus on the design and development of theoretically grounded evaluation protocols to benchmark the creative capabilities of Large Language Models in both producing as well as assessing creative text. I will end this talk by highlighting the current limitations of existing models and future directions toward building better models that will enable efficient and trustworthy human-AI collaboration systems.


Bio:

Tuhin Chakrabarty is a final-year Ph.D. candidate in the Natural Language Processing group within the Computer Science department at Columbia University. His research is supported by the Columbia Center of Artificial Intelligence & Technology (CAIT) & an Amazon Science Ph.D. Fellowship. He was also a Computational Journalism fellow at NYTimes R&D and an intern at the Allen Institute of Artificial Intelligence, Salesforce Research, and Deepmind. His research interests are broadly in Natural Language Processing, Computer Vision, and Human-Computer Interaction with a special focus on Human-Centered Methods for Understanding, Generation, and Evaluation of Creativity. His work has been recognized at top natural language processing and human-computer interaction conferences and journals such as ACL, NAACL, EMNLP, TACL, and CHI. He has been involved in organizing several workshops and tutorials at NLP conferences such Figurative Language Processing workshop at EMNLP 2022, NAACL 2024, and the tutorial on Creative Text Generation at EMNLP 2023. His work on AI and creativity has been mentioned in mainstream news media such as The Hollywood Reporter and more recently The Washington Post.

Join Zoom Meeting
https://stonybrook.zoom.us/j/97103601583?pwd=TnpGMXdpeEd1N0hZcXppS1BLNF… (ID: 97103601583, passcode: 004031)

Join by phone
(US) +1 646-931-3860 (passcode: 004031)

Joining instructions: https://www.google.com/url?q=https://applications.zoom.us/addon/invitat…

Meeting host: H.Andrew.Schwartz@stonybrook.edu

Join Zoom Meeting:
https://stonybrook.zoom.us/j/97103601583?pwd=TnpGMXdpeEd1N0hZcXppS1BLNFJhZz09