Yalaev B I, Novikov A V, Minniakhmetov I R, Khusainova R I
Endocrinology Research Centre.
Endocrinology Research Centre; Institute of Biochemistry and Genetics-Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences.
Probl Endokrinol (Mosk). 2024 Jan 24;70(6):67-82. doi: 10.14341/probl13421.
Osteoporosis is a common age-related disease with disabling consequences, the early diagnosis of which is difficult due to its long and hidden course, which often leads to diagnosis only after a fracture. In this regard, great expectations are placed on advanced developments in machine learning technologies aimed at predicting osteoporosis at an early stage of development, including the use of large data sets containing information on genetic and clinical predictors of the disease. Nevertheless, the inclusion of DNA markers in prediction models is fraught with a number of difficulties due to the complex polygenic and heterogeneous nature of the disease. Currently, the predictive power of neural network models is insufficient for their incorporation into modern osteoporosis diagnostic protocols. Studies in this area are sporadic, but are widely demanded, as their results are of great importance for preventive medicine. This leads to the need to search for the most effective machine learning approaches and optimise the selection of genetic markers as input parameters to neural network models.
to evaluate the effectiveness of machine learning and neural network analysis to develop predictive risk models for osteoporosis based on clinical predictors and genetic markers of osteoporetic fractures.
The predictive models were trained using a database of genotyping and clinical characteristics of 701 women and 501 men living in the Volga-Ural region of Russia. Anthropometric parameters, data on gender, bone mineral density level, and the results of genotyping of 152 polymorphic loci of candidate genes and replication loci of the GEFOS consortium's full genome-wide association search were included as input parameters.
It was found that the model for predicting low bone mineral density, including 6 polymorphic variants of the OPG gene (rs2073618, rs2073617, rs7844539, rs3102735, rs3134069) and 5 polymorphic variants of microRNA binding sites in the mRNA of genes involved in bone metabolism (COL11A1 - rs1031820, FGF2 - rs6854081, miR-146 - rs2910164, ZNF239 - rs10793442, SPARC - rs1054204 and VDR - rs11540149) (AUC=0.81 for men and AUC=0.82 for women).
The results confirm the promising application of machine learning to predict the risk of osteoporosis at the preclinical stage of the disease based on the analysis of clinical and genetic factors.
骨质疏松症是一种常见的与年龄相关的疾病,会导致残疾后果,因其病程漫长且隐匿,早期诊断困难,常导致骨折后才得以诊断。在这方面,人们对机器学习技术的先进发展寄予厚望,这些技术旨在早期预测骨质疏松症,包括使用包含该疾病遗传和临床预测指标信息的大数据集。然而,由于该疾病复杂的多基因和异质性,将DNA标记纳入预测模型存在诸多困难。目前,神经网络模型的预测能力不足以使其纳入现代骨质疏松症诊断方案。该领域的研究较为零散,但需求广泛,因为其结果对预防医学至关重要。这就需要寻找最有效的机器学习方法,并优化作为神经网络模型输入参数的遗传标记的选择。
评估机器学习和神经网络分析基于骨质疏松性骨折的临床预测指标和遗传标记开发骨质疏松症预测风险模型的有效性。
使用俄罗斯伏尔加-乌拉尔地区701名女性和501名男性的基因分型和临床特征数据库训练预测模型。人体测量参数、性别数据、骨密度水平以及候选基因152个多态性位点和GEFOS联盟全基因组关联搜索的复制位点的基因分型结果作为输入参数。
发现预测低骨密度的模型,包括骨保护素(OPG)基因的6个多态性变体(rs2073618、rs2073617、rs7844539、rs3102735、rs3134069)以及参与骨代谢的基因mRNA中微小RNA结合位点的5个多态性变体(COL11A1 - rs1031820、FGF2 - rs6854081、miR - 146 - rs2910164、ZNF239 - rs10793442、SPARC - rs1054204和VDR - rs11540149)(男性的曲线下面积[AUC] = 0.81,女性的AUC = 0.82)。
结果证实了基于临床和遗传因素分析,机器学习在疾病临床前期预测骨质疏松症风险方面具有广阔的应用前景。