Statistics Online Computational Resource (SOCR), Department of Health Behavior and Biological Sciences, University of Michigan, Ann Arbor, MI, USA.
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA.
Sci Rep. 2019 Apr 12;9(1):6012. doi: 10.1038/s41598-019-41634-y.
The UK Biobank is a rich national health resource that provides enormous opportunities for international researchers to examine, model, and analyze census-like multisource healthcare data. The archive presents several challenges related to aggregation and harmonization of complex data elements, feature heterogeneity and salience, and health analytics. Using 7,614 imaging, clinical, and phenotypic features of 9,914 subjects we performed deep computed phenotyping using unsupervised clustering and derived two distinct sub-cohorts. Using parametric and nonparametric tests, we determined the top 20 most salient features contributing to the cluster separation. Our approach generated decision rules to predict the presence and progression of depression or other mental illnesses by jointly representing and modeling the significant clinical and demographic variables along with the derived salient neuroimaging features. We reported consistency and reliability measures of the derived computed phenotypes and the top salient imaging biomarkers that contributed to the unsupervised clustering. This clinical decision support system identified and utilized holistically the most critical biomarkers for predicting mental health, e.g., depression. External validation of this technique on different populations may lead to reducing healthcare expenses and improving the processes of diagnosis, forecasting, and tracking of normal and pathological aging.
英国生物银行是一个丰富的国家健康资源,为国际研究人员提供了巨大的机会,以检查、建模和分析类似人口普查的多源医疗保健数据。该档案提出了与聚合和协调复杂数据元素、特征异质性和显著性以及健康分析相关的几个挑战。使用 9914 名受试者的 7614 个成像、临床和表型特征,我们使用无监督聚类进行了深度计算表型,并得出了两个不同的子队列。使用参数和非参数检验,我们确定了前 20 个对聚类分离贡献最大的最显著特征。我们的方法通过联合表示和建模重要的临床和人口统计学变量以及得出的显著神经影像学特征,生成了预测抑郁或其他精神疾病存在和进展的决策规则。我们报告了衍生计算表型和对无监督聚类有贡献的顶级显著成像生物标志物的一致性和可靠性度量。这个临床决策支持系统全面地识别和利用了预测心理健康(例如,抑郁)的最关键生物标志物。该技术在不同人群中的外部验证可能会降低医疗保健费用,并改善诊断、预测和跟踪正常和病理性衰老的过程。