• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用尿液生物标志物和随机森林监督分类增强乳腺癌筛查:一项全面的研究。

Enhancing breast cancer screening with urinary biomarkers and Random Forest supervised classification: A comprehensive investigation.

机构信息

Department of Chemistry, University of Turin, Italy; Centro Regionale Antidoping, Orbassano, TO, Italy.

Centro Regionale Antidoping, Orbassano, TO, Italy.

出版信息

J Pharm Biomed Anal. 2024 Jul 15;244:116113. doi: 10.1016/j.jpba.2024.116113. Epub 2024 Mar 20.

DOI:10.1016/j.jpba.2024.116113
PMID:38554554
Abstract

OBJECTIVES

Urinary sex hormones are investigated as potential biomarkers for the early detection of breast cancer, aiming to evaluate their relevance and applicability, in combination with supervised machine-learning data analysis, toward the ultimate goal of extensive screening.

METHODS

Sex hormones were determined on urine samples collected from 250 post-menopausal women (65 healthy - 185 with breast cancer, recruited among the clinical patients of Candiolo Cancer Institute FPO-IRCCS (Torino, Italy). Two analytical procedures based on UHPLC-MS/HRMS were developed and comprehensively validated to quantify 20 free and conjugated sex hormones from urine samples. The quantitative data were processed by seven machine learning algorithms. The efficiency of the resulting models was compared.

RESULTS

Among the tested models aimed to relate urinary estrogen and androgen levels and the occurrence of breast cancer, Random Forest (RF) proved to underscore all the other supervised classification approaches, including Partial Least Squares - Discriminant Analysis (PLS-DA), in terms of effectiveness and robustness. The final optimized model built on only five biomarkers (testosterone-sulphate, alpha-estradiol, 4-methoxyestradiol, DHEA-sulphate, and epitestosterone-sulphate) achieved an approximate 98% diagnostic accuracy on replicated validation sets. To balance the less-represented population of healthy women, a Synthetic Minority Oversampling TEchnique (SMOTE) data oversampling approach was applied.

CONCLUSIONS

By means of tunable hyperparameters optimization, the RF algorithm showed great potential for early breast cancer detection, as it provides clear biomarkers ranking and their relative efficiency, allowing to ground the final diagnostic model on a restricted selection five steroid biomarkers only, as desirable for noninvasive tests with wide screening purposes.

摘要

目的

研究尿性激素作为乳腺癌早期检测的潜在生物标志物,旨在评估其相关性和适用性,结合有监督的机器学习数据分析,最终实现广泛筛查的目标。

方法

从 250 名绝经后妇女(65 名健康女性-185 名乳腺癌患者)的尿液样本中测定性激素,这些患者均为意大利都灵坎迪奥洛癌症研究所 FPO-IRCCS(Candiolo Cancer Institute FPO-IRCCS)的临床患者。开发并全面验证了两种基于 UHPLC-MS/HRMS 的分析程序,以定量尿液样本中的 20 种游离和结合性激素。定量数据由七种机器学习算法进行处理。比较了得到的模型的效率。

结果

在所测试的旨在将尿雌激素和雄激素水平与乳腺癌发生相关联的模型中,随机森林(RF)被证明在有效性和稳健性方面优于所有其他有监督分类方法,包括偏最小二乘判别分析(PLS-DA)。最终基于仅 5 种生物标志物(硫酸睾酮、α-雌二醇、4-甲氧基雌二醇、硫酸去氢表雄酮和硫酸表雄酮)构建的优化模型在复制验证集上达到了约 98%的诊断准确性。为了平衡健康女性人数较少的情况,应用了一种合成少数过采样技术(SMOTE)数据过采样方法。

结论

通过可调超参数优化,RF 算法显示出用于早期乳腺癌检测的巨大潜力,因为它提供了明确的生物标志物排名及其相对效率,允许将最终诊断模型建立在仅 5 种类固醇生物标志物的受限选择上,这对于具有广泛筛查目的的非侵入性测试是理想的。

相似文献

1
Enhancing breast cancer screening with urinary biomarkers and Random Forest supervised classification: A comprehensive investigation.利用尿液生物标志物和随机森林监督分类增强乳腺癌筛查:一项全面的研究。
J Pharm Biomed Anal. 2024 Jul 15;244:116113. doi: 10.1016/j.jpba.2024.116113. Epub 2024 Mar 20.
2
Development of a headspace-solid phase microextraction gas chromatography-high resolution mass spectrometry method for analyzing volatile organic compounds in urine: Application in breast cancer biomarker discovery.建立顶空固相微萃取-气相色谱-高分辨质谱法分析尿液中挥发性有机化合物的方法:在乳腺癌生物标志物发现中的应用。
Clin Chim Acta. 2023 Feb 1;540:117236. doi: 10.1016/j.cca.2023.117236. Epub 2023 Jan 27.
3
Detection of breast cancer of various clinical stages based on serum FT-IR spectroscopy combined with multiple algorithms.基于血清傅里叶变换红外光谱结合多种算法检测不同临床分期的乳腺癌
Photodiagnosis Photodyn Ther. 2021 Mar;33:102199. doi: 10.1016/j.pdpdt.2021.102199. Epub 2021 Jan 27.
4
Proteomic Analysis of Urine to Identify Breast Cancer Biomarker Candidates Using a Label-Free LC-MS/MS Approach.采用无标记液相色谱-串联质谱法对尿液进行蛋白质组学分析以鉴定乳腺癌生物标志物候选物
PLoS One. 2015 Nov 6;10(11):e0141876. doi: 10.1371/journal.pone.0141876. eCollection 2015.
5
Metabolomic biomarkers in cervicovaginal fluid for detecting endometrial cancer through nuclear magnetic resonance spectroscopy.基于磁共振波谱技术的宫颈阴道分泌物代谢组学生物标志物检测子宫内膜癌
Metabolomics. 2019 Oct 29;15(11):146. doi: 10.1007/s11306-019-1609-z.
6
Cyp17, urinary sex steroid levels and breast cancer risk in postmenopausal women.绝经后女性中细胞色素P450 17α酶、尿中性甾体激素水平与乳腺癌风险
Cancer Epidemiol Biomarkers Prev. 2005 Apr;14(4):815-20. doi: 10.1158/1055-9965.EPI-04-0197.
7
Targeted LC-MS/MS analysis of steroid glucuronides in human urine.靶向液相色谱-串联质谱法分析人尿液中的甾体类葡萄糖醛酸苷。
J Steroid Biochem Mol Biol. 2021 Jan;205:105774. doi: 10.1016/j.jsbmb.2020.105774. Epub 2020 Oct 22.
8
Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods.基于转录组谱特征选择和机器学习方法的乳腺癌预测。
BMC Bioinformatics. 2022 Oct 1;23(1):410. doi: 10.1186/s12859-022-04965-8.
9
Biomarker profiling and integrating heterogeneous models for enhanced multi-grade breast cancer prognostication.基于生物标志物特征分析的多等级乳腺癌预后增强的异质模型整合。
Comput Methods Programs Biomed. 2024 Oct;255:108349. doi: 10.1016/j.cmpb.2024.108349. Epub 2024 Jul 22.
10
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.