BEGIN:VCALENDAR
VERSION:1.0
PRODID:Faculty of Science and Engineering - Research
BEGIN:VEVENT
SUMMARY:ML Seminar - Suvrat Raju - 10/04/26
DESCRIPTION;ENCODING=QUOTED-PRINTABLE: =0D=0A=
Title: A model of errors in transformers=0D=0A=
Abstract: We study the error rate of LLMs on tasks like arithmetic that require a deterministic output, and repetitive processing of tokens drawn from a small set of alternatives. By analyzing the accumulation of errors in the attention mechanism, we theoretically derive a quantitative two-parameter relationship between the accuracy and the complexity of the task. We empirically verify our formula across a range of tasks and state-of-the art LLMs find excellent agreement between the predicted and observed accuracy in many cases. We also identify deviations in some cases that lead us to interesting insights about the functioning of models. We show how this understanding helps to construct prompts to reduce the error rate.=0D=0A=
(work in collaboration with Praneeth Netrapalli)=0D=0A=
 
LOCATION:114 GO Jones Building
DTSTART:20260410T143000
DTEND:20260410T153000
END:VEVENT
END:VCALENDAR
