Abstract
We consider extreme value analysis in a semi-supervised setting, where we observe, next to the n data on the target variable, n +m data on one or more covariates. This is called the semi-supervised model with n labeled and m unlabeled data. By exploiting the tail dependence between the target variable and the covariates, we derive an estimator for the extreme value index of the target variable in this setting and establish its asymptotic behavior. Our estimator substantially improves the univariate estimator, based on only the n target variable data, in terms of asymptotic variance whereas the asymptotic bias remains unchanged. We present a simulation study in which the asymptotic results are confirmed and also an extreme quantile estimator is derived and its improved performance is shown. Finally the estimation method is applied to rainfall data in France.
Original language | English |
---|---|
Journal | Journal of the American Statistical Association |
DOIs | |
Publication status | E-pub ahead of print - May 2024 |
Keywords
- asymptotic normality
- extreme value index
- semi-supervised inference
- tail dependence
- variance reduction
Fingerprint
Dive into the research topics of 'Extreme value statistics in semi-supervised models'. Together they form a unique fingerprint.Datasets
-
Extreme Value Statistics in Semi-Supervised Models
Ahmed, H. (Creator), Einmahl, J. H. J. (Creator) & Zhou, C. (Creator), Taylor & Francis Group, 15 May 2024
DOI: 10.6084/m9.figshare.25452217.v2, https://tandf.figshare.com/articles/dataset/Extreme_value_statistics_in_semi-supervised_models/25452217/2
Dataset