Events
Emma Simpson (UCL): Accounting for Missing Data When Modelling Block Maxima
Centre for Probability, Statistics and Data ScienceDate: 7 May 2026 Time: 14:00 - 15:00
Location: Hybrid: MB503, SMS, QMUL, or via the Teams link below
Modeling block maxima using the generalized extreme value (GEV) distribution is a classical and widely used method for studying univariate extremes. It allows for theoretically-motivated estimation of return levels, including extrapolation beyond the range of observed data. A frequently overlooked challenge in applying this methodology comes from handling datasets containing missing values. In this case, one cannot be sure whether the true maximum has been recorded in each block, and simply ignoring the issue can lead to biased parameter estimators and, crucially, underestimated return levels. We propose an extension of the standard block maxima approach to overcome such missing data issues. This is achieved by explicitly accounting for the proportion of missing values in each block within the GEV model. Inference is carried out using likelihood-based techniques, and we propose an update to commonly used diagnostic plots to assess model fit. We assess the performance of our method via a simulation study, with results that are competitive with the "ideal" case of having no missing values. We demonstrate our methodology on air pollution data from Plymouth, U.K.
This is joint work with Paul Northrop (UCL).
| Contact: | Nicolás Hernández |
| Email: | n.hernandez@qmul.ac.uk |
| Website: |
Updated by: Kostas Papafitsoros