与化疗治疗临床结局相关的癌症基因表达谱。

Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments.

机构信息

Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, 91788, USA.

Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Oblast, 141701, Russia.

出版信息

BMC Med Genomics. 2020 Sep 18;13(Suppl 8):111. doi: 10.1186/s12920-020-00759-0.

DOI:10.1186/s12920-020-00759-0

PMID:32948183

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7499993/

Abstract

BACKGROUND

Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn't allow sufficient training of ML classifiers that could be used for improving molecular diagnostics.

METHODS

We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories.

RESULTS

We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases.

CONCLUSIONS

We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others - microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide.

摘要

背景

由于可供临床注释的分子谱数量有限，机器学习 (ML) 方法在个性化肿瘤学中的应用仍然有限。这使得用于改善分子诊断的 ML 分类器无法进行充分的训练。

方法

我们回顾了发表的高通量基因表达谱数据集，这些数据集对应于已知对化疗治疗有反应的癌症患者。我们浏览了基因表达综合数据库（GEO）、癌症基因组图谱（TCGA）和肿瘤改变相关基因组驱动治疗（TARGET）数据库。

结果

我们确定了适合构建用于预测特定化疗方案反应的 ML 模型的数据集合。我们确定了 26 个数据集，每个数据集的病例数从 41 到 508 不等。所有确定的数据集都经过了 ML 适用性和稳健性的检查，采用了留一法交叉验证。发现 23 个数据集适合使用 ML，这些数据集具有平衡的治疗反应者和非反应者病例数。

结论

我们收集了一个与 2786 个个体癌症病例的化疗临床反应相关的基因表达谱数据库。其中 7 个数据集包含 RNA 测序数据（用于 645 个病例），其余数据集为微阵列表达谱。这些病例代表乳腺癌、肺癌、低级别胶质瘤、内皮癌、多发性骨髓瘤、成人白血病、儿科白血病和肾肿瘤。化疗药物包括紫杉醇、硼替佐米、长春新碱、曲妥珠单抗、来曲唑、替西罗莫司、替莫唑胺、白消安和环磷酰胺。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c56/7499993/5f0fedcc8b23/12920_2020_759_Fig1_HTML.jpg

相似文献

Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments.与化疗治疗临床结局相关的癌症基因表达谱。

BMC Med Genomics. 2020 Sep 18;13(Suppl 8):111. doi: 10.1186/s12920-020-00759-0.

SNRFCB: sub-network based random forest classifier for predicting chemotherapy benefit on survival for cancer treatment.SNRFCB：基于子网络的随机森林分类器，用于预测癌症治疗中化疗对生存的益处。

Mol Biosyst. 2016 Apr;12(4):1214-23. doi: 10.1039/c5mb00399g. Epub 2016 Feb 11.

Flexible Data Trimming Improves Performance of Global Machine Learning Methods in Omics-Based Personalized Oncology.灵活的数据修剪可提高基于组学的个体化肿瘤学中全局机器学习方法的性能。

Int J Mol Sci. 2020 Jan 22;21(3):713. doi: 10.3390/ijms21030713.

Machine Learning Applicability for Classification of PAD/VCD Chemotherapy Response Using 53 Multiple Myeloma RNA Sequencing Profiles.利用53份多发性骨髓瘤RNA测序图谱，机器学习在PAD/VCD化疗反应分类中的适用性

Front Oncol. 2021 Apr 15;11:652063. doi: 10.3389/fonc.2021.652063. eCollection 2021.

The impact of pharmacokinetic gene profiles across human cancers.人类癌症中药物代谢动力学基因谱的影响。

BMC Cancer. 2018 May 21;18(1):577. doi: 10.1186/s12885-018-4345-2.

Integrated pan-cancer gene expression and drug sensitivity analysis reveals SLFN11 mRNA as a solid tumor biomarker predictive of sensitivity to DNA-damaging chemotherapy.整合泛癌基因表达和药物敏感性分析揭示 SLFN11 mRNA 作为一种实体瘤生物标志物，可预测对 DNA 损伤化疗的敏感性。

PLoS One. 2019 Nov 4;14(11):e0224267. doi: 10.1371/journal.pone.0224267. eCollection 2019.

Molecular pathway activation - New type of biomarkers for tumor morphology and personalized selection of target drugs.分子通路激活——肿瘤形态学新型生物标志物和靶向药物个体化选择。

Semin Cancer Biol. 2018 Dec;53:110-124. doi: 10.1016/j.semcancer.2018.06.003. Epub 2018 Jun 20.

Machine learning with the TCGA-HNSC dataset: improving usability by addressing inconsistency, sparsity, and high-dimensionality.使用 TCGA-HNSC 数据集进行机器学习：通过解决不一致性、稀疏性和高维性来提高可用性。

BMC Bioinformatics. 2019 Jun 17;20(1):339. doi: 10.1186/s12859-019-2929-8.

Molecularly targeted therapy based on tumour molecular profiling versus conventional therapy for advanced cancer (SHIVA): a multicentre, open-label, proof-of-concept, randomised, controlled phase 2 trial.基于肿瘤分子谱的分子靶向治疗与晚期癌症的常规治疗（SHIVA）：一项多中心、开放标签、概念验证、随机、对照的 2 期临床试验。

Lancet Oncol. 2015 Oct;16(13):1324-34. doi: 10.1016/S1470-2045(15)00188-6. Epub 2015 Sep 3.

PreMSIm: An R package for predicting microsatellite instability from the expression profiling of a gene panel in cancer.PreMSIm：一个用于通过癌症中基因面板的表达谱预测微卫星不稳定性的R包。

Comput Struct Biotechnol J. 2020 Mar 19;18:668-675. doi: 10.1016/j.csbj.2020.03.007. eCollection 2020.

引用本文的文献

Gene expression and agent-based modeling improve precision prognosis in breast cancer.基因表达与基于主体的建模改善乳腺癌的精准预后。

Sci Rep. 2025 May 16;15(1):17059. doi: 10.1038/s41598-025-01275-w.

Activation of the ERK1/2 Molecular Pathways and Its Relation to the Pathogenicity of Human Malignant Tumors.ERK1/2分子通路的激活及其与人类恶性肿瘤致病性的关系。

Acta Naturae. 2025 Jan-Mar;17(1):36-51. doi: 10.32607/actanaturae.27497.

Artificial intelligence in lung cancer: current applications, future perspectives, and challenges.人工智能在肺癌中的应用：当前应用、未来展望及挑战

Front Oncol. 2024 Dec 23;14:1486310. doi: 10.3389/fonc.2024.1486310. eCollection 2024.

Uniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns.形状一致的归一化整合了来自不同平台的人类转录组数据，同时保留其生物学特性和差异基因表达模式。

Front Mol Biosci. 2023 Sep 6;10:1237129. doi: 10.3389/fmolb.2023.1237129. eCollection 2023.

Reclassification of TCGA Diffuse Glioma Profiles Linked to Transcriptomic, Epigenetic, Genomic and Clinical Data, According to the 2021 WHO CNS Tumor Classification.根据 2021 年 WHO CNS 肿瘤分类，对与转录组、表观遗传学、基因组和临床数据相关的 TCGA 弥漫性神经胶质瘤图谱进行重新分类。

Int J Mol Sci. 2022 Dec 21;24(1):157. doi: 10.3390/ijms24010157.

Transcriptomic Harmonization as the Way for Suppressing Cross-Platform Bias and Batch Effect.转录组协调作为抑制跨平台偏差和批次效应的方法

Biomedicines. 2022 Sep 18;10(9):2318. doi: 10.3390/biomedicines10092318.

Transcriptomic Portraits and Molecular Pathway Activation Features of Adult Spinal Intramedullary Astrocytomas.成人脊髓髓内星形细胞瘤的转录组图谱及分子通路激活特征

Front Oncol. 2022 Mar 21;12:837570. doi: 10.3389/fonc.2022.837570. eCollection 2022.

Gene Expression-Based Signature Can Predict Sorafenib Response in Kidney Cancer.基于基因表达的特征可预测肾癌对索拉非尼的反应。

Front Mol Biosci. 2022 Mar 14;9:753318. doi: 10.3389/fmolb.2022.753318. eCollection 2022.

Experimental and Meta-Analytic Validation of RNA Sequencing Signatures for Predicting Status of Microsatellite Instability.用于预测微卫星不稳定性状态的RNA测序特征的实验及荟萃分析验证

Front Mol Biosci. 2021 Nov 23;8:737821. doi: 10.3389/fmolb.2021.737821. eCollection 2021.

RNA Sequencing Data for FFPE Tumor Blocks Can Be Used for Robust Estimation of Tumor Mutation Burden in Individual Biosamples.福尔马林固定石蜡包埋肿瘤组织块的RNA测序数据可用于可靠估计个体生物样本中的肿瘤突变负荷。

Front Oncol. 2021 Sep 28;11:732644. doi: 10.3389/fonc.2021.732644. eCollection 2021.

本文引用的文献

Int J Mol Sci. 2020 Jan 22;21(3):713. doi: 10.3390/ijms21030713.

RNA sequencing for research and diagnostics in clinical oncology.临床肿瘤学中的研究和诊断用 RNA 测序。

Semin Cancer Biol. 2020 Feb;60:311-323. doi: 10.1016/j.semcancer.2019.07.010. Epub 2019 Aug 11.

New Paradigm of Machine Learning (ML) in Personalized Oncology: Data Trimming for Squeezing More Biomarkers From Clinical Datasets.个性化肿瘤学中机器学习（ML）的新范式：通过数据修剪从临床数据集中挖掘更多生物标志物

Front Oncol. 2019 Jul 17;9:658. doi: 10.3389/fonc.2019.00658. eCollection 2019.

Genomic and transcriptomic profiling expands precision cancer medicine: the WINTHER trial.基因组和转录组谱分析拓展精准肿瘤医学：WINTHER 试验。

Nat Med. 2019 May;25(5):751-758. doi: 10.1038/s41591-019-0424-4. Epub 2019 Apr 22.

Theory of Magnetic Domain Phases in Ferromagnetic Superconductors.铁磁超导体中的磁畴相理论。

Phys Rev Lett. 2019 Mar 22;122(11):117002. doi: 10.1103/PhysRevLett.122.117002.

Clinical intelligence: New machine learning techniques for predicting clinical drug response.临床智能：预测临床药物反应的新机器学习技术。

Comput Biol Med. 2019 Apr;107:302-322. doi: 10.1016/j.compbiomed.2018.12.017. Epub 2019 Jan 3.

Pathway Based Analysis of Mutation Data Is Efficient for Scoring Target Cancer Drugs.基于通路的突变数据分析对癌症靶向药物评分很有效。

Front Pharmacol. 2019 Jan 23;10:1. doi: 10.3389/fphar.2019.00001. eCollection 2019.

Shambhala: a platform-agnostic data harmonizer for gene expression data.香巴拉：一个用于基因表达数据的数据协调器，与平台无关。

BMC Bioinformatics. 2019 Feb 6;20(1):66. doi: 10.1186/s12859-019-2641-8.

FLOating-Window Projective Separator (FloWPS): A Data Trimming Tool for Support Vector Machines (SVM) to Improve Robustness of the Classifier.浮动窗口投影分离器（FloWPS）：一种用于支持向量机（SVM）的数据修剪工具，以提高分类器的鲁棒性。

Front Genet. 2019 Jan 15;9:717. doi: 10.3389/fgene.2018.00717. eCollection 2018.

Pathway Instability Is an Effective New Mutation-Based Type of Cancer Biomarkers.通路不稳定性是一种基于新突变的有效的癌症生物标志物类型。

Front Oncol. 2019 Jan 4;8:658. doi: 10.3389/fonc.2018.00658. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

与化疗治疗临床结局相关的癌症基因表达谱。

Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献