Sultan Laith R, Cary Theodore W, Al-Hasani Maryam, Karmacharya Mrigendra B, Venkatesh Santosh S, Assenmacher Charles-Antoine, Radaelli Enrico, Sehgal Chandra M
Ultrasound Research Laboratory, Department of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA.
Department of Radiology, Children Hospital of Philadelphia, Philadelphia, PA 19104, USA.
AI (Basel). 2022 Sep;3(3):739-750. doi: 10.3390/ai3030043. Epub 2022 Sep 1.
Machine learning for medical imaging not only requires sufficient amounts of data for training and testing but also that the data be independent. It is common to see highly interdependent data whenever there are inherent correlations between observations. This is especially to be expected for sequential imaging data taken from time series. In this study, we evaluate the use of statistical measures to test the independence of sequential ultrasound image data taken from the same case. A total of 1180 B-mode liver ultrasound images with 5903 regions of interests were analyzed. The ultrasound images were taken from two liver disease groups, fibrosis and steatosis, as well as normal cases. Computer-extracted texture features were then used to train a machine learning (ML) model for computer-aided diagnosis. The experiment resulted in high two-category diagnosis using logistic regression, with AUC of 0.928 and high performance of multicategory classification, using random forest ML, with AUC of 0.917. To evaluate the image region independence for machine learning, Jenson-Shannon (JS) divergence was used. JS distributions showed that images of normal liver were independent from each other, while the images from the two disease pathologies were not independent. To guarantee the generalizability of machine learning models, and to prevent data leakage, multiple frames of image data acquired of the same object should be tested for independence before machine learning. Such tests can be applied to real-world medical image problems to determine if images from the same subject can be used for training.
用于医学成像的机器学习不仅需要足够数量的数据进行训练和测试,还要求数据具有独立性。每当观测值之间存在内在相关性时,就会经常看到高度相互依赖的数据。对于从时间序列中获取的连续成像数据而言,尤其如此。在本研究中,我们评估了使用统计量来检验从同一病例获取的连续超声图像数据的独立性。总共分析了1180幅B型肝脏超声图像,其中包含5903个感兴趣区域。这些超声图像取自两个肝脏疾病组,即纤维化和脂肪变性组,以及正常病例组。然后使用计算机提取的纹理特征来训练用于计算机辅助诊断的机器学习(ML)模型。实验结果表明,使用逻辑回归进行二分类诊断的性能较高,曲线下面积(AUC)为0.928;使用随机森林ML进行多分类诊断的性能也较高,AUC为0.917。为了评估机器学习中图像区域的独立性,使用了詹森 - 香农(JS)散度。JS分布表明,正常肝脏的图像相互独立,而来自两种疾病病理的图像则不独立。为了保证机器学习模型的通用性,并防止数据泄露,在进行机器学习之前,应对从同一对象获取的多帧图像数据进行独立性测试。此类测试可应用于实际的医学图像问题,以确定来自同一受试者的图像是否可用于训练。