Seminario

How Large Language Models Learn: A Statistical Walkthrough

Hristo Inouzhe Valdés (Universidad Autónoma de Madrid)

Fecha: 23/07/2025 12:00
Lugar: Seminario del Departamento de Estadística e I.O.
Grupo: G.I.R. PEM

Cartel Vertical
Cartel Horizontal

Abstract:
Large Language Models (LLMs) have revolutionized natural language processing, but much of their training pipeline remains opaque or framed in engineering terms. In this talk, I’ll present a statistical and machine learning perspective on how LLMs are trained, structured around three core phases: autoregressive pre-training, supervised fine-tuning, and reinforcement learning with human feedback (RLHF). We’ll formalize the learning objectives behind each stage, explore consistency guarantees under ideal assumptions, and reinterpret the full pipeline as a sequence of conditional distribution estimators and value-based policy updates. The aim is to demystify LLMs by grounding them in tools familiar to statisticians and ML theorists, without reference to architecture or optimization heuristics.

Agenda

2025) - Convocatoria de propuestas de Actividades del IMUVA financiables parcialmente. 2025.

Curso de Doctorado (16/09/2025) - Cipriano Escalante, David Ketcheson, Hendrik Ranocha y Mario Richiutto ()

Seminario (23/07/2025) - Hristo Inouzhe Valdés (Universidad Autónoma de Madrid)

Seminario (07/07/2025) - Alberto González Sanz (Columbia University)

Seminario de Doctorado (01/07/2025) - Christian Canedo Ortega (Universidad de Valladolid)

Seminario de Doctorado (01/07/2025) - Alejandro Gutiérrez Mielgo (Universidad de Valladolid)

Próximos Eventos

Curso de Doctorado (16/09/25) - Cipriano Escalante, David Ketcheson, Hendrik Ranocha y Mario Richiutto ()

Seminario (23/07/25) - Hristo Inouzhe Valdés (Universidad Autónoma de Madrid)

Seminario (07/07/25) - Alberto González Sanz (Columbia University)

Seminario de Doctorado (01/07/25) - Christian Canedo Ortega (Universidad de Valladolid)

Seminario de Doctorado (01/07/25) - Alejandro Gutiérrez Mielgo (Universidad de Valladolid)

Seminario de Doctorado (01/07/25) - David Rodríguez Vítores (Universidad de Valladolid)

Ver más Eventos