Date: Thursday, September 25, 2025
Time: 11:45 a.m. to 12:45 p.m.
Location: G01 Gates Hall
Speaker: Samy Bengio, Senior Director of Machine Learning Research at Apple
Abstract: In this presentation, I will go over a few recent topics towards understanding the limits of reasoning capabilities of large language models. I will start with an analysis of the hardness of problems like syllogisms when tackling them with the Transformer architecture; I will propose a metric to measure that hardness, and an approach to make it easier for LLMs to reduce such hardness. I will show that these limitations are not specific to the textual domain and also exist for other domains like visual tasks. I will then show how difficult it is to accurately measure the performance of modern reasoning models and will end with a few approaches to reduce the complexity of some reasoning problems.
