Gordon Adam D
Department of Anthropology, University at Albany (SUNY), 1400 Washington Avenue, Albany, NY 12222, USA; College of Fellows, Institute of Advanced Study, Durham University, Cosin's Hall, Palace Green, Durham, DH1 3RL, UK; Department of Anthropology, Durham University, Dawson Building, South Road, Durham, DH1 3LE, UK.
J Hum Evol. 2025 Feb;199:103630. doi: 10.1016/j.jhevol.2024.103630. Epub 2024 Dec 26.
The degree of sexual size dimorphism in fossil hominins is important evidence for the evaluation of evolutionary hypotheses, but it is also difficult/impossible to measure directly. Multiple methods have been developed to estimate dimorphism in univariate and multivariate datasets, including when data are missing. This paper introduces 'dimorph', an R package that implements many of these methods and associated resampling-based significance tests and evaluates their performance in terms of Type I error rates and power. Tests evaluated here are those that appear most commonly in the hominin literature: testing whether a fossil sample is significantly more dimorphic than a comparative sample of known dimorphism. Univariate and multivariate methods are applied to metric data from four extant hominoid species: Gorilla gorilla, Homo sapiens, Pan troglodytes, and Hylobates lar. Each species is represented by 47 female and 47 male adult individuals, from which 10 linear postcranial measurements are collected. Data are resampled at a broad range of sample sizes (n = 4 to n = 82), sex ratios (proportion of females range from 0 to 1), and in the case of missing-data methods, proportions of missing data (0-0.9). Type I error rates and power are evaluated by the proportion of tests correctly or incorrectly rejecting null hypotheses regarding dimorphism difference within pairs of samples drawn from these four species, in which one sample stands in for a fossil sample. Results indicate low Type I error rates for all methods, whereas power is variable across methods but often low at sample sizes common to fossil analyses. Recommendations are made for the best significance tests. Additionally, previous work using lack of significant difference as evidence for similarity in dimorphism between fossils and extant species should be re-examined to determine whether those studies have enough power to detect known differences among extant taxa.
化石人类的两性体型差异程度是评估进化假说的重要证据,但直接测量也很困难或几乎不可能。人们已经开发出多种方法来估计单变量和多变量数据集中的两性差异,包括数据缺失时的情况。本文介绍了“dimorph”,这是一个R软件包,它实现了许多此类方法以及基于重采样的显著性检验,并从I类错误率和检验效能方面评估了它们的性能。这里评估的检验是在人类文献中最常出现的那些:检验一个化石样本是否比已知两性差异的比较样本具有显著更高的两性差异。单变量和多变量方法被应用于来自四种现存类人猿物种的测量数据:西部大猩猩、智人、黑猩猩和白掌长臂猿。每个物种由47名成年雌性和47名成年雄性个体代表,从中收集了10项颅后线性测量数据。数据在广泛的样本量(n = 4至n = 82)、性别比(雌性比例范围从0到1)下进行重采样,对于缺失数据方法,还包括缺失数据的比例(0 - 0.9)。I类错误率和检验效能通过正确或错误拒绝关于从这四个物种中抽取的样本对之间两性差异的零假设的检验比例来评估,其中一个样本代表化石样本。结果表明所有方法的I类错误率都很低,而检验效能因方法而异,但在化石分析常见的样本量下通常较低。文中针对最佳显著性检验给出了建议。此外,应重新审视以前将缺乏显著差异作为化石与现存物种两性差异相似性证据的研究,以确定这些研究是否有足够的检验效能来检测现存分类群之间已知的差异。