Stony Brook University and Columbia researchers discover that the New York Times word game Connections can serve as a challenging benchmark for training Large Language Models in abstract reasoning.

Stony Brook, NY, Nov 1, 2024 - While AI and machine learning regularly beat the world’s greatest chess players, a recent study found that when it comes to the New York Times Connections, even the best-performing Large Learning Model (LLM), Claude 3.5 Sonnect, can fully solve only 18% of the games.