Saqib Kiran, Khan Amber Fozia, Butt Zahid Ahmad
School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada.
JMIR Ment Health. 2021 Nov 24;8(11):e29838. doi: 10.2196/29838.
Machine learning (ML) offers vigorous statistical and probabilistic techniques that can successfully predict certain clinical conditions using large volumes of data. A review of ML and big data research analytics in maternal depression is pertinent and timely, given the rapid technological developments in recent years.
This study aims to synthesize the literature on ML and big data analytics for maternal mental health, particularly the prediction of postpartum depression (PPD).
We used a scoping review methodology using the Arksey and O'Malley framework to rapidly map research activity in ML for predicting PPD. Two independent researchers searched PsycINFO, PubMed, IEEE Xplore, and the ACM Digital Library in September 2020 to identify relevant publications in the past 12 years. Data were extracted from the articles' ML model, data type, and study results.
A total of 14 studies were identified. All studies reported the use of supervised learning techniques to predict PPD. Support vector machine and random forest were the most commonly used algorithms in addition to Naive Bayes, regression, artificial neural network, decision trees, and XGBoost (Extreme Gradient Boosting). There was considerable heterogeneity in the best-performing ML algorithm across the selected studies. The area under the receiver operating characteristic curve values reported for different algorithms were support vector machine (range 0.78-0.86), random forest method (0.88), XGBoost (0.80), and logistic regression (0.93).
ML algorithms can analyze larger data sets and perform more advanced computations, which can significantly improve the detection of PPD at an early stage. Further clinical research collaborations are required to fine-tune ML algorithms for prediction and treatment. ML might become part of evidence-based practice in addition to clinical knowledge and existing research evidence.
机器学习(ML)提供了强大的统计和概率技术,能够利用大量数据成功预测某些临床状况。鉴于近年来技术的快速发展,对机器学习和大数据研究分析在孕产妇抑郁症方面的综述是相关且及时的。
本研究旨在综合关于机器学习和大数据分析在孕产妇心理健康方面的文献,特别是产后抑郁症(PPD)的预测。
我们采用了一种基于Arksey和O'Malley框架的范围综述方法,以快速梳理机器学习在预测产后抑郁症方面的研究活动。两名独立研究人员于2020年9月在PsycINFO、PubMed、IEEE Xplore和ACM数字图书馆中进行检索,以识别过去12年中的相关出版物。数据从文章的机器学习模型、数据类型和研究结果中提取。
共识别出14项研究。所有研究均报告使用监督学习技术来预测产后抑郁症。除朴素贝叶斯、回归、人工神经网络、决策树和XGBoost(极端梯度提升)外,支持向量机和随机森林是最常用的算法。在所选定的研究中,表现最佳的机器学习算法存在相当大的异质性。不同算法报告的受试者工作特征曲线下面积值分别为:支持向量机(范围0.78 - 0.86)、随机森林方法(0.88)、XGBoost(0.80)和逻辑回归(0.93)。
机器学习算法可以分析更大的数据集并进行更高级的计算,这可以显著改善产后抑郁症的早期检测。需要进一步开展临床研究合作,以微调用于预测和治疗的机器学习算法。除了临床知识和现有研究证据外,机器学习可能会成为循证实践的一部分。