School of Statistics, Capital University of Economics and Business, Beijing, China.
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China.
PLoS One. 2023 Aug 3;18(8):e0289498. doi: 10.1371/journal.pone.0289498. eCollection 2023.
Testing whether data are from a normal distribution is a traditional problem and is of great concern for data analyses. The normality is the premise of many statistical methods, such as t-test, Hotelling T2 test and ANOVA. There are numerous tests in the literature and the commonly used ones are Anderson-Darling test, Shapiro-Wilk test and Jarque-Bera test. Each test has its own advantageous points since they are developed for specific patterns and there is no method that consistently performs optimally in all situations. Since the data distribution of practical problems can be complex and diverse, we propose a Cauchy Combination Omnibus Test (CCOT) that is robust and valid in most data cases. We also give some theoretical results to analyze the good properties of CCOT. Two obvious advantages of CCOT are that not only does CCOT have a display expression for calculating statistical significance, but extensive simulation results show its robustness regardless of the shape of distribution the data comes from. Applications to South African Heart Disease and Neonatal Hearing Impairment data further illustrate its practicability.
检验数据是否服从正态分布是一个传统问题,也是数据分析中非常关注的问题。正态性是许多统计方法的前提,如 t 检验、Hotelling T2 检验和 ANOVA。文献中有许多检验方法,常用的有 Anderson-Darling 检验、Shapiro-Wilk 检验和 Jarque-Bera 检验。每种检验方法都有其自身的优点,因为它们是针对特定模式开发的,没有一种方法在所有情况下都能始终表现出最优性能。由于实际问题的数据分布可能复杂多样,我们提出了一种在大多数数据情况下稳健且有效的柯西组合总检验(CCOT)。我们还给出了一些理论结果来分析 CCOT 的良好性质。CCOT 的两个明显优点是,CCOT 不仅具有计算统计显著性的显示表达式,而且广泛的模拟结果表明,无论数据来自何种分布形状,CCOT 都具有稳健性。对南非心脏病和新生儿听力障碍数据的应用进一步说明了它的实用性。