Nuffield Department of Population Health, University of Oxford, Oxford, UK.
Nuffield Division of Clinical Laboratory Sciences, Radcliffe Department of Medicine, University of Oxford, Oxford, UK.
Sci Rep. 2021 Mar 8;11(1):5405. doi: 10.1038/s41598-021-84860-z.
The importance of quantifying the distribution and determinants of multimorbidity has prompted novel data-driven classifications of disease. Applications have included improved statistical power and refined prognoses for a range of respiratory, infectious, autoimmune, and neurological diseases, with studies using molecular information, age of disease incidence, and sequences of disease onset ("disease trajectories") to classify disease clusters. Here we consider whether easily measured risk factors such as height and BMI can effectively characterise diseases in UK Biobank data, combining established statistical methods in new but rigorous ways to provide clinically relevant comparisons and clusters of disease. Over 400 common diseases were selected for analysis using clinical and epidemiological criteria, and conventional proportional hazards models were used to estimate associations with 12 established risk factors. Several diseases had strongly sex-dependent associations of disease risk with BMI. Importantly, a large proportion of diseases affecting both sexes could be identified by their risk factors, and equivalent diseases tended to cluster adjacently. These included 10 diseases presently classified as "Symptoms, signs, and abnormal clinical and laboratory findings, not elsewhere classified". Many clusters are associated with a shared, known pathogenesis, others suggest likely but presently unconfirmed causes. The specificity of associations and shared pathogenesis of many clustered diseases provide a new perspective on the interactions between biological pathways, risk factors, and patterns of disease such as multimorbidity.
量化多种疾病的分布和决定因素的重要性促使了新的数据驱动的疾病分类方法的出现。这些方法的应用包括提高了一系列呼吸系统、传染病、自身免疫和神经疾病的统计能力和预后准确性,这些研究使用分子信息、疾病发病年龄和疾病发作顺序(“疾病轨迹”)来对疾病集群进行分类。在这里,我们考虑了身高和 BMI 等易于测量的风险因素是否可以有效地描述英国生物库数据中的疾病,我们将已建立的统计方法以新的但严格的方式结合起来,以提供临床相关的比较和疾病集群。使用临床和流行病学标准选择了 400 多种常见疾病进行分析,并使用传统的比例风险模型估计了与 12 种已确立的风险因素的关联。一些疾病的 BMI 与疾病风险之间存在强烈的性别依赖性关联。重要的是,大多数影响男女两性的疾病都可以通过其风险因素来识别,并且具有相同风险因素的疾病往往相邻聚类。其中包括 10 种目前被归类为“症状、体征和异常临床及实验室发现,无法归类”的疾病。许多聚类与已知的共同发病机制相关联,其他聚类则提示可能存在但目前尚未证实的病因。许多聚类疾病的关联的特异性和共同发病机制为研究生物途径、风险因素和多种疾病(如多种疾病)之间的模式之间的相互作用提供了新的视角。