• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于问卷预测因子的高危人群乳腺异常的集成机器学习预测方案。

An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors.

机构信息

Division of Hepatology and Gastroenterology, Department of Internal Medicine, Shin Kong Wu Ho-Su Memorial Hospital, Taipei 11101, Taiwan.

Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 24205, Taiwan.

出版信息

Int J Environ Res Public Health. 2022 Aug 8;19(15):9756. doi: 10.3390/ijerph19159756.

DOI:10.3390/ijerph19159756
PMID:35955112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9368335/
Abstract

This study aimed to investigate the important predictors related to predicting positive mammographic findings based on questionnaire-based demographic and obstetric/gynecological parameters using the proposed integrated machine learning (ML) scheme. The scheme combines the benefits of two well-known ML algorithms, namely, least absolute shrinkage and selection operator (Lasso) logistic regression and extreme gradient boosting (XGB), to provide adequate prediction for mammographic anomalies in high-risk individuals and the identification of significant risk factors. We collected questionnaire data on 18 breast-cancer-related risk factors from women who participated in a national mammographic screening program between January 2017 and December 2020 at a single tertiary referral hospital to correlate with their mammographic findings. The acquired data were retrospectively analyzed using the proposed integrated ML scheme. Based on the data from 21,107 valid questionnaires, the results showed that the Lasso logistic regression models with variable combinations generated by XGB could provide more effective prediction results. The top five significant predictors for positive mammography results were younger age, breast self-examination, older age at first childbirth, nulliparity, and history of mammography within 2 years, suggesting a need for timely mammographic screening for women with these risk factors.

摘要

本研究旨在探讨基于问卷调查的人口统计学和妇产科参数,利用提出的集成机器学习(ML)方案,预测阳性乳房 X 光检查结果的重要预测因素。该方案结合了两种知名 ML 算法的优势,即最小绝对收缩和选择算子(Lasso)逻辑回归和极端梯度提升(XGB),为高危人群的乳房 X 光异常提供充分预测,并确定显著的风险因素。我们从 2017 年 1 月至 2020 年 12 月在一家三级转诊医院参加全国乳房 X 光筛查计划的女性中收集了 18 个与乳腺癌相关的风险因素的问卷调查数据,并与她们的乳房 X 光检查结果相关联。使用提出的集成 ML 方案对所获得的数据进行回顾性分析。基于 21107 份有效问卷的数据,结果表明,XGB 生成的具有变量组合的 Lasso 逻辑回归模型可以提供更有效的预测结果。阳性乳房 X 光检查结果的五个最重要的预测因素是年龄较小、乳房自我检查、首次分娩年龄较大、未婚和 2 年内有乳房 X 光检查史,这表明这些有风险因素的女性需要及时进行乳房 X 光筛查。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/dc455591859d/ijerph-19-09756-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/8ec4be570d75/ijerph-19-09756-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/f53c61b0202b/ijerph-19-09756-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/ff7aa53c92c9/ijerph-19-09756-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/c62a0baa2067/ijerph-19-09756-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/dc455591859d/ijerph-19-09756-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/8ec4be570d75/ijerph-19-09756-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/f53c61b0202b/ijerph-19-09756-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/ff7aa53c92c9/ijerph-19-09756-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/c62a0baa2067/ijerph-19-09756-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef91/9368335/dc455591859d/ijerph-19-09756-g005.jpg

相似文献

1
An Integrated Machine Learning Scheme for Predicting Mammographic Anomalies in High-Risk Individuals Using Questionnaire-Based Predictors.基于问卷预测因子的高危人群乳腺异常的集成机器学习预测方案。
Int J Environ Res Public Health. 2022 Aug 8;19(15):9756. doi: 10.3390/ijerph19159756.
2
Application of SHAP for Explainable Machine Learning on Age-Based Subgrouping Mammography Questionnaire Data for Positive Mammography Prediction and Risk Factor Identification.SHAP在基于年龄的乳腺钼靶问卷调查数据亚组划分中的应用,用于乳腺钼靶阳性预测和风险因素识别的可解释机器学习。
Healthcare (Basel). 2023 Jul 11;11(14):2000. doi: 10.3390/healthcare11142000.
3
Prediction of breast cancer risk using a machine learning approach embedded with a locality preserving projection algorithm.基于局部保持投影算法的机器学习方法预测乳腺癌风险。
Phys Med Biol. 2018 Jan 30;63(3):035020. doi: 10.1088/1361-6560/aaa1ca.
4
Minimization of annotation work: diagnosis of mammographic masses via active learning.最小化标注工作:通过主动学习进行乳腺肿块诊断。
Phys Med Biol. 2018 May 22;63(11):115003. doi: 10.1088/1361-6560/aac042.
5
Prediction of near-term breast cancer risk based on bilateral mammographic feature asymmetry.基于双侧乳腺影像特征不对称性预测近期乳腺癌风险。
Acad Radiol. 2013 Dec;20(12):1542-50. doi: 10.1016/j.acra.2013.08.020.
6
Using automated texture features to determine the probability for masking of a tumor on mammography, but not ultrasound.利用自动化纹理特征来确定乳腺 X 线摄影中肿瘤的遮蔽概率,但不能用于超声。
Eur J Med Res. 2017 Aug 30;22(1):30. doi: 10.1186/s40001-017-0270-0.
7
Influence of patterns of hormone replacement therapy use and mammographic density on breast cancer detection.激素替代疗法使用模式和乳腺钼靶密度对乳腺癌检测的影响。
Cancer Epidemiol Biomarkers Prev. 2006 Oct;15(10):1856-62. doi: 10.1158/1055-9965.EPI-06-0290.
8
Mammographic surveillance of asymptomatic breast cancer relatives in general practice: rate of re-attendance and GP- and patient-related barriers.全科医疗中对无症状乳腺癌亲属的乳腺钼靶筛查:复诊率以及与全科医生和患者相关的障碍
Fam Pract. 1997 Dec;14(6):450-4. doi: 10.1093/fampra/14.6.450.
9
Screening mammography: value in women 35-39 years old.乳腺钼靶筛查:对35至39岁女性的价值。
AJR Am J Roentgenol. 1993 Jul;161(1):53-6. doi: 10.2214/ajr.161.1.8517320.
10
Effect of estrogen replacement therapy on the specificity and sensitivity of screening mammography.雌激素替代疗法对乳腺钼靶筛查特异性和敏感性的影响。
J Natl Cancer Inst. 1996 May 15;88(10):643-9. doi: 10.1093/jnci/88.10.643.

引用本文的文献

1
An integrated machine learning predictive scheme for longitudinal laboratory data to evaluate the factors determining renal function changes in patients with different chronic kidney disease stages.一种用于纵向实验室数据的集成机器学习预测方案,以评估不同慢性肾病阶段患者肾功能变化的决定因素。
Front Med (Lausanne). 2023 Oct 4;10:1155426. doi: 10.3389/fmed.2023.1155426. eCollection 2023.
2
Application of SHAP for Explainable Machine Learning on Age-Based Subgrouping Mammography Questionnaire Data for Positive Mammography Prediction and Risk Factor Identification.SHAP在基于年龄的乳腺钼靶问卷调查数据亚组划分中的应用,用于乳腺钼靶阳性预测和风险因素识别的可解释机器学习。
Healthcare (Basel). 2023 Jul 11;11(14):2000. doi: 10.3390/healthcare11142000.
3

本文引用的文献

1
Comparing Multiple Linear Regression and Machine Learning in Predicting Diabetic Urine Albumin-Creatinine Ratio in a 4-Year Follow-Up Study.在一项为期4年的随访研究中比较多元线性回归和机器学习对糖尿病患者尿白蛋白-肌酐比值的预测能力
J Clin Med. 2022 Jun 24;11(13):3661. doi: 10.3390/jcm11133661.
2
Important Risk Factors in Patients with Nonvalvular Atrial Fibrillation Taking Dabigatran Using Integrated Machine Learning Scheme-A Post Hoc Analysis.使用集成机器学习方案对服用达比加群的非瓣膜性心房颤动患者的重要危险因素——一项事后分析
J Pers Med. 2022 May 6;12(5):756. doi: 10.3390/jpm12050756.
3
XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer.
Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators.用于评估影响精子数量的风险因素的机器学习预测模型:基于健康筛查指标的预测
J Clin Med. 2023 Feb 3;12(3):1220. doi: 10.3390/jcm12031220.
4
Integrated Machine Learning Decision Tree Model for Risk Evaluation in Patients with Non-Valvular Atrial Fibrillation When Taking Different Doses of Dabigatran.基于整合机器学习决策树模型评估非瓣膜性心房颤动患者应用不同剂量达比加群的风险。
Int J Environ Res Public Health. 2023 Jan 29;20(3):2359. doi: 10.3390/ijerph20032359.
基于 XGBoost 的肿瘤免疫特征基因标志物用于预测乳腺癌的转移状态。
J Transl Med. 2022 Apr 18;20(1):177. doi: 10.1186/s12967-022-03369-9.
4
Predicting the Cochlear Dead Regions Using a Machine Learning-Based Approach with Oversampling Techniques.基于机器学习和过采样技术预测耳蜗死区。
Medicina (Kaunas). 2021 Nov 2;57(11):1192. doi: 10.3390/medicina57111192.
5
Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease.应用于慢性肾脏病风险指标评估的健康数据驱动型机器学习算法
Risk Manag Healthc Policy. 2021 Oct 27;14:4401-4412. doi: 10.2147/RMHP.S319405. eCollection 2021.
6
A novel gene expression test method of minimizing breast cancer risk in reduced cost and time by improving SVM-RFE gene selection method combined with LASSO.一种新的基因表达测试方法,通过改进 SVM-RFE 基因选择方法并结合 LASSO,以降低成本和时间的方式最小化乳腺癌风险。
J Integr Bioinform. 2020 Dec 29;18(2):139-153. doi: 10.1515/jib-2019-0110.
7
Using machine learning approaches for multi-omics data analysis: A review.使用机器学习方法进行多组学数据分析:综述
Biotechnol Adv. 2021 Jul-Aug;49:107739. doi: 10.1016/j.biotechadv.2021.107739. Epub 2021 Mar 29.
8
Identifying DNA-binding proteins based on multi-features and LASSO feature selection.基于多特征和 LASSO 特征选择鉴定 DNA 结合蛋白。
Biopolymers. 2021 Feb;112(2):e23419. doi: 10.1002/bip.23419. Epub 2021 Jan 21.
9
Evaluating the Effect of Topical Atropine Use for Myopia Control on Intraocular Pressure by Using Machine Learning.运用机器学习评估局部使用阿托品控制近视对眼压的影响。
J Clin Med. 2020 Dec 30;10(1):111. doi: 10.3390/jcm10010111.
10
XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction.基于 XGBoost 的吸烟相关非传染性疾病预测框架。
Int J Environ Res Public Health. 2020 Sep 7;17(18):6513. doi: 10.3390/ijerph17186513.