Kun Eucharist, Javan Emily M, Smith Olivia, Gulamali Faris, de la Fuente Javier, Flynn Brianna I, Vajrala Kushal, Trutner Zoe, Jayakumar Prakash, Tucker-Drob Elliot M, Sohail Mashaal, Singh Tarjinder, Narasimhan Vagheesh M
Department of Integrative Biology, The University of Texas at Austin.
Icahn School of Medicine at Mount Sinai, New York.
bioRxiv. 2023 Jan 3:2023.01.03.521284. doi: 10.1101/2023.01.03.521284.
The human skeletal form underlies our ability to walk on two legs, but unlike standing height, the genetic basis of limb lengths and skeletal proportions is less well understood. Here we applied a deep learning model to 31,221 whole body dual-energy X-ray absorptiometry (DXA) images from the UK Biobank (UKB) to extract 23 different image-derived phenotypes (IDPs) that include all long bone lengths as well as hip and shoulder width, which we analyzed while controlling for height. All skeletal proportions are highly heritable (∼40-50%), and genome-wide association studies (GWAS) of these traits identified 179 independent loci, of which 102 loci were not associated with height. These loci are enriched in genes regulating skeletal development as well as associated with rare human skeletal diseases and abnormal mouse skeletal phenotypes. Genetic correlation and genomic structural equation modeling indicated that limb proportions exhibited strong genetic sharing but were genetically independent of width and torso proportions. Phenotypic and polygenic risk score analyses identified specific associations between osteoarthritis (OA) of the hip and knee, the leading causes of adult disability in the United States, and skeletal proportions of the corresponding regions. We also found genomic evidence of evolutionary change in arm-to-leg and hip-width proportions in humans consistent with striking anatomical changes in these skeletal proportions in the hominin fossil record. In contrast to cardiovascular, auto-immune, metabolic, and other categories of traits, loci associated with these skeletal proportions are significantly enriched in human accelerated regions (HARs), and regulatory elements of genes differentially expressed through development between humans and the great apes. Taken together, our work validates the use of deep learning models on DXA images to identify novel and specific genetic variants affecting the human skeletal form and ties a major evolutionary facet of human anatomical change to pathogenesis.
人类的骨骼形态是我们能够用双腿行走的基础,但与身高不同的是,人们对四肢长度和骨骼比例的遗传基础了解较少。在这里,我们将深度学习模型应用于来自英国生物银行(UKB)的31221张全身双能X线吸收测定(DXA)图像,以提取23种不同的图像衍生表型(IDP),包括所有长骨长度以及髋部和肩部宽度,并在控制身高的情况下对其进行分析。所有骨骼比例都具有高度遗传性(约40%-50%),对这些性状的全基因组关联研究(GWAS)确定了179个独立位点,其中102个位点与身高无关。这些位点在调节骨骼发育的基因中富集,也与罕见的人类骨骼疾病和异常的小鼠骨骼表型相关。遗传相关性和基因组结构方程模型表明,四肢比例表现出很强的遗传共享性,但在基因上与宽度和躯干比例无关。表型和多基因风险评分分析确定了美国成人残疾的主要原因——髋部和膝部骨关节炎(OA)与相应区域的骨骼比例之间的特定关联。我们还发现了人类手臂与腿部以及髋部宽度比例进化变化的基因组证据,这与人类化石记录中这些骨骼比例的显著解剖学变化一致。与心血管、自身免疫、代谢和其他类型的性状不同,与这些骨骼比例相关的位点在人类加速区域(HARs)以及人类与大猩猩在发育过程中差异表达的基因调控元件中显著富集。综上所述,我们的工作验证了在DXA图像上使用深度学习模型来识别影响人类骨骼形态的新的特定遗传变异,并将人类解剖学变化的一个主要进化方面与发病机制联系起来。