Stony Brook University and Columbia researchers discover that the New York Times word game Connections can serve as a challenging benchmark for training Large Language Models in abstract reasoning.

Stony Brook, NY, Nov 1, 2024 - While AI and machine learning regularly beat the world’s greatest chess players, a recent study found that when it comes to the New York Times Connections, even the best-performing Large Learning Model (LLM), Claude 3.5 Sonnect, can fully solve only 18% of the games.

Artificial Intelligence (AI) tools like ChatGPT and Google Gemini are transforming how content is created, and creating new problems around privacy and truth. At the same time, these tools can be incredibly useful in compiling information and drafting messages.

A group of Stony Brook students this spring took a close look at potential benefits and risks of using AI tools, particularly as they could enhance or hinder advocacy efforts for social causes. The students created a guide for advocacy groups that will help those groups consider how, when or if to use AI in their work.

Department of Ecology and Evolution Professor Ross Nehm has been selected to serve on the National Science Foundation funded committee “Advancing AI in Science Education: A Comprehensive Approach to Equity, Inclusion, and Three-Dimensional Learning.”

Stony Brook, NY, Oct 21, 2024 - SBU Professor Ross Nehm, a leading researcher on the use of AI in student assessment, is one of 15 international scholars selected to serve on this NSF-funded committee, which is charged with establishing norms, frameworks, and guidelines for AI-involved science education research.

Harsh Trivedi, a doctoral researcher at Stony Brook University, was recently invited to present his work at the White House. His project, AppWorld, promises to revolutionize how we automate daily tasks in our digital lives.

Stony Brook, NY, Oct 19, 2024 - Stony Brook University Professor Niranjan Balasubramanian and Ph.D. researcher Harsh Trivedi were invited to represent the university at the White House Office of Science and Technology Policy (OSTP) launch event to celebrate the first allocation of the NAIRR Pilot.