On the sampling distribution of an $\ell^2$ distance between Empirical Distribution Functions with applications to nonparametric testing
CARON, Francois
Advanced Learning Evolutionary Algorithms [ALEA]
Institut de Mathématiques de Bordeaux [IMB]
Advanced Learning Evolutionary Algorithms [ALEA]
Institut de Mathématiques de Bordeaux [IMB]
CARON, Francois
Advanced Learning Evolutionary Algorithms [ALEA]
Institut de Mathématiques de Bordeaux [IMB]
< Réduire
Advanced Learning Evolutionary Algorithms [ALEA]
Institut de Mathématiques de Bordeaux [IMB]
Langue
en
Rapport
Ce document a été publié dans
2012-04
Résumé en anglais
We consider a situation where two sample sets of independent real valued observations are obtained from unknown distributions. Under a null hypothesis that the distributions are equal, it is well known that the sample ...Lire la suite >
We consider a situation where two sample sets of independent real valued observations are obtained from unknown distributions. Under a null hypothesis that the distributions are equal, it is well known that the sample variation of the infinity norm, maximum, distance between the two empirical distribution functions has as asymptotic density of standard form independent of the unknown distribution. This result underpins the popular two-sample Kolmogorov-Smirnov test. In this article we show that other distance metrics exist for which the asymptotic sampling distribution is also available in standard form. In particular we describe a weighted squared-distance metric derived from a binary recursion of the real line which is shown to follow a sum of chi-squared random variables. This motivates a nonparametric test based on the average divergence rather than the maximum, which we demonstrate exhibits greater sensitivity to changes in scale and tail characteristics when the distributions are unequal, while maintaining power for changes in central location.< Réduire
Origine
Importé de halUnités de recherche