Berman N G, Wang C, Paulsen C A
Department of Pediatrics, Harbor-UCLA Medical Center, Torrance, California, USA.
J Androl. 1996 Jan-Feb;17(1):68-73.
We examined two methodological issues in the analysis of sperm concentration data using a large database of sperm concentrations in healthy men that were collected at the University of Washington. We showed that the raw data were skewed and that log transformation should be used to assure that the data meet the assumptions underlying most statistical estimation and testing procedures. We also addressed the issue of the great variability in sperm concentrations within a single individual and the necessity and utility of multiple sampling to reduce variance. We conclude that log-transformed data should be used for statistical analysis of sperm concentration and recommend that such analyses be based on the geometric mean of several samples from each subject to reduce variability, increase accuracy of estimation, and improve statistical power. This is particularly important when the objective is to detect small but important differences or subtle effects.
我们利用华盛顿大学收集的健康男性精子浓度大数据库,研究了精子浓度数据分析中的两个方法学问题。我们发现原始数据呈偏态分布,应进行对数转换以确保数据符合大多数统计估计和检验程序的基本假设。我们还探讨了个体内精子浓度差异极大的问题,以及多次采样以减少方差的必要性和实用性。我们得出结论,对数转换后的数据应用于精子浓度的统计分析,并建议此类分析应基于每个受试者多个样本的几何平均值,以减少变异性、提高估计准确性并增强统计效力。当目的是检测微小但重要的差异或细微效应时,这一点尤为重要。