Friday, February 20, 202611:00 am - 12:00 pm
Tom Goldstein, Volpi-Cupal Endowed Professor of Computer Science, Director of the Maryland Center for Machine Learning, University of Maryland
Location: Hamilton Hall, Room 702
Abstract: Recent trends in LLM development have focused on “Reasoning” models that expend large amounts of compute to improve their performance at inference time by producing many tokens. In this talk, we consider alternatives to the many-token paradigm. We will focus on models that perform efficient latent reasoning without verbalizing their outputs as tokens. We will also consider new generation strategies that bypass arduous and expensive token generating processes altogether.