THE MODEL OF COINCIDENCE ANALYSIS IN THE CALCULATION OF NONPARAMETRIC RANK STATISTICS
https://doi.org/10.26896/1028-6861-2017-83-11-66-72
Abstract
Nonparametric statistic is one of the points of growth of modern mathematical and statistical methods of research. In nonparametric statistics, rank criteria based on the use of the ranks of the sample elements (observation results), rather than numerical values of the sample elements themselves, take an important place. Ranks are the numbers of sample elements in the corresponding variation series, constructed by rearranging the results of observations in the order of nondecreasing. Distributions of rank criteria are obtained on the assumption of continuity of the distribution functions of the observation results, hence, the probability of coincidence of the values of the random variables forming the analyzed samples should be equal to zero. However, in actual data, there are coincidences. Consequently, the assumption of the continuity of the distribution functions of the observation results is incorrect and known theorems on the distribution of rank statistics, strictly speaking, are not applicable. However, with a small number of coincidences, ranks statistics can be recommended for use, albeit with some corrections. Thus, an additional superstructure is mounted on the classical mathematical-statistical theory to take into account the coincidence of the data. Naturally, the validity of different methods used for accounting the coincidence of calculation data should be considered. We propose a probabilistic-statistical model that explains the occurrence of the coincidences and provides algorithms for their analysis. This model is based on the assumption that data coincidences appears as a result of «sticking together» of the slightly different observation results. We propose to introduce small corrections into each elements of the coincident group of observation results and thus to obtain a sample without coincidences and calculate the value of rank statistics. Having considered various variants of amendments, we obtain a «cloud» of values of rank statistics. Analysis of this «cloud» allows us to obtain statistical conclusions. Two-sample Wilcoxon test is considered as an example.
About the Author
A. I. OrlovRussian Federation
References
1. Orlov A. I. The growth points of statistical methods / Politem. Set. Élektron. Nauch. Zh. Kuban. Gos. Agrarn. Univ. 2014. N 103. P. 136 – 162 [in Russian].
2. Orlov A. I. Structure of nonparametric statistics (generalizing paper) / Zavod. Lab. Diagn. Mater. 2015. Vol. 81. N 7. P. 62 – 72 [in Russian].
3. Orlov A. I. Nonparametric goodness-of-fit Kolmogorov, Smirnov, omega-square tests and the errors in their application / Politem. Set. Élektron. Nauch. Zh. Kuban. Gos. Agrarn. Univ. 2014. N 97. P. 32 – 45 [in Russian].
4. Orlov A. I. What hypothesis can be verified using the two-sample Wilcoxon test? / Zavod. Lab. Diagn. Mater. 1999. Vol. 65. N 1. P. 51 – 55 [in Russian].
5. Orlov A. I. Two-sample Wilcoxon test — analysis of two myths / Politem. Set. Élektron. Nauch. Zh. Kuban. Gos. Agrarn. Univ. 2014. N 104. P. 91 – 111 [in Russian].
6. Orlov A. I. Consistent tests of absolute homogeneity for independent samples / Zavod. Lab. Diagn. Mater. 2012. Vol. 78. N 11. P. 66 – 70 [in Russian].
7. Orlov A. I. Methods for testing the homogeneity of the paired samples / Zavod. Lab. Diagn. Mater. 2004. Vol. 70. N 7. P. 57 – 61 [in Russian].
8. Orlov A. I. Testing of homogeneity of the paired samples / Politem. Set. Élektron. Nauch. Zh. Kuban. Gos. Agrarn. Univ. 2016. N 123. P. 708 – 726 [in Russian].
9. Hollander M., Wolfe D. A., Chicken E. Nonparametric Statistical Methods. Third Edition. — Hoboken, New Jersey: John Wiley & Sons, Inc., 2014. — 828 p.
10. Orlov A. I. About the methodology of statistical methods / Politem. Set. Élektron. Nauch. Zh. Kuban. Gos. Agrarn. Univ. 2014. N 104. P. 53 – 80 [in Russian].
11. Orlov A. I. Statistics of interval data (generalizing paper) / Zavod. Lab. Diagn. Mater. 2015. Vol. 81. N 3. P. 61 – 69 [in Russian].
12. Orlov A. I. Stable mathematical methods and models / Zavod. Lab. Diagn. Mater. 2010. Vol. 76. N 3. P. 59 – 67 [in Russian].
13. Hajek Ja., Sidak Zb. Theory of rank tests. — Prague: Academia. Publishing house of the Czechoslovak academy of sciences, 1967. — 376 p.
14. Orlov A. I. Applied statistics. — Moscow: Ékzamen, 2006. — 671 p. [in Russian].
Review
For citations:
Orlov A.I. THE MODEL OF COINCIDENCE ANALYSIS IN THE CALCULATION OF NONPARAMETRIC RANK STATISTICS. Industrial laboratory. Diagnostics of materials. 2017;83(11):66-72. (In Russ.) https://doi.org/10.26896/1028-6861-2017-83-11-66-72