School of Control Science and Engineering, Shandong University, 17923 Jingshi Road, Jinan, Shandong, People's Republic of China.
Department of Orthopedics, Qilu Hospital of Shandong University, Jinan, Shandong, People's Republic of China.
BMC Bioinformatics. 2022 Feb 10;23(1):63. doi: 10.1186/s12859-022-04596-z.
Osteoporosis is a common metabolic skeletal disease and usually lacks obvious symptoms. Many individuals are not diagnosed until osteoporotic fractures occur. Bone mineral density (BMD) measured by dual-energy X-ray absorptiometry (DXA) is the gold standard for osteoporosis detection. However, only a limited percentage of people with osteoporosis risks undergo the DXA test. As a result, it is vital to develop methods to identify individuals at-risk based on methods other than DXA.
We proposed a hierarchical model with three layers to detect osteoporosis using clinical data (including demographic characteristics and routine laboratory tests data) and CT images covering lumbar vertebral bodies rather than DXA data via machine learning. 2210 individuals over age 40 were collected retrospectively, among which 246 individuals' clinical data and CT images are both available. Irrelevant and redundant features were removed via statistical analysis. Consequently, 28 features, including 16 clinical data and 12 texture features demonstrated statistically significant differences (p < 0.05) between osteoporosis and normal groups. Six machine learning algorithms including logistic regression (LR), support vector machine with radial-basis function kernel, artificial neural network, random forests, eXtreme Gradient Boosting and Stacking that combined the above five classifiers were employed as classifiers to assess the performances of the model. Furthermore, to diminish the influence of data partitioning, the dataset was randomly split into training and test set with stratified sampling repeated five times. The results demonstrated that the hierarchical model based on LR showed better performances with an area under the receiver operating characteristic curve of 0.818, 0.838, and 0.962 for three layers, respectively in distinguishing individuals with osteoporosis and normal BMD.
The proposed model showed great potential in opportunistic screening for osteoporosis without additional expense. It is hoped that this model could serve to detect osteoporosis as early as possible and thereby prevent serious complications of osteoporosis, such as osteoporosis fractures.
骨质疏松症是一种常见的代谢性骨骼疾病,通常缺乏明显的症状。许多人直到发生骨质疏松性骨折才被诊断出来。双能 X 射线吸收法(DXA)测量的骨密度(BMD)是骨质疏松症检测的金标准。然而,只有有限比例的骨质疏松症风险人群接受 DXA 测试。因此,开发基于 DXA 以外的方法识别高危人群的方法至关重要。
我们提出了一个具有三个层次的分层模型,通过机器学习使用临床数据(包括人口统计学特征和常规实验室检查数据)和覆盖腰椎体的 CT 图像而不是 DXA 数据来检测骨质疏松症。回顾性收集了 2210 名年龄在 40 岁以上的个体,其中 246 名个体的临床数据和 CT 图像均可用。通过统计分析去除无关和冗余特征。结果,28 个特征,包括 16 个临床数据和 12 个纹理特征,在骨质疏松症组和正常组之间表现出统计学差异(p < 0.05)。使用包括逻辑回归(LR)、具有径向基函数核的支持向量机、人工神经网络、随机森林、极端梯度提升和堆叠在内的六种机器学习算法作为分类器来评估模型的性能,这些分类器结合了上述五个分类器。此外,为了减少数据分割的影响,使用分层抽样重复五次将数据集随机分为训练集和测试集。结果表明,基于 LR 的分层模型在区分骨质疏松症和正常 BMD 的个体方面表现出较好的性能,三个层次的受试者工作特征曲线下面积分别为 0.818、0.838 和 0.962。
该模型在不增加额外费用的情况下,具有在机会性筛查中检测骨质疏松症的巨大潜力。希望该模型能够尽早检测到骨质疏松症,从而预防骨质疏松症严重并发症,如骨质疏松性骨折。