通过信息解释全面发现亚样本基因表达成分：对癌症治疗的启示

Comprehensive discovery of subsample gene expression components by information explanation: therapeutic implications in cancer.

作者信息

Pepke Shirley, Ver Steeg Greg

机构信息

Lyrid LLC, South Pasadena, USA.

Information Sciences Institute, University of Southern California, Marina Del Rey, USA.

出版信息

BMC Med Genomics. 2017 Mar 15;10(1):12. doi: 10.1186/s12920-017-0245-6.

DOI:10.1186/s12920-017-0245-6

PMID:28292312

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5351169/

Abstract

BACKGROUND

De novo inference of clinically relevant gene function relationships from tumor RNA-seq remains a challenging task. Current methods typically either partition patient samples into a few subtypes or rely upon analysis of pairwise gene correlations that will miss some groups in noisy data. Leveraging higher dimensional information can be expected to increase the power to discern targetable pathways, but this is commonly thought to be an intractable computational problem.

METHODS

In this work we adapt a recently developed machine learning algorithm for sensitive detection of complex gene relationships. The algorithm, CorEx, efficiently optimizes over multivariate mutual information and can be iteratively applied to generate a hierarchy of relatively independent latent factors. The learned latent factors are used to stratify patients for survival analysis with respect to both single factors and combinations. These analyses are performed and interpreted in the context of biological function annotations and protein network interactions that might be utilized to match patients to multiple therapies.

RESULTS

Analysis of ovarian tumor RNA-seq samples demonstrates the algorithm's power to infer well over one hundred biologically interpretable gene cohorts, several times more than standard methods such as hierarchical clustering and k-means. The CorEx factor hierarchy is also informative, with related but distinct gene clusters grouped by upper nodes. Some latent factors correlate with patient survival, including one for a pathway connected with the epithelial-mesenchymal transition in breast cancer that is regulated by a microRNA that modulates epigenetics. Further, combinations of factors lead to a synergistic survival advantage in some cases.

CONCLUSIONS

In contrast to studies that attempt to partition patients into a small number of subtypes (typically 4 or fewer) for treatment purposes, our approach utilizes subgroup information for combinatoric transcriptional phenotyping. Considering only the 66 gene expression groups that are found to both have significant Gene Ontology enrichment and are small enough to indicate specific drug targets implies a computational phenotype for ovarian cancer that allows for 3 possible patient profiles, enabling truly personalized treatment. The findings here demonstrate a new technique that sheds light on the complexity of gene expression dependencies in tumors and could eventually enable the use of patient RNA-seq profiles for selection of personalized and effective cancer treatments.

摘要

背景

从肿瘤RNA测序中重新推断临床相关基因功能关系仍然是一项具有挑战性的任务。当前方法通常要么将患者样本划分为少数几种亚型，要么依赖于成对基因相关性分析，而这种分析会在噪声数据中遗漏一些组。利用更高维度的信息有望提高识别可靶向通路的能力，但人们普遍认为这是一个难以解决的计算问题。

方法

在这项工作中，我们采用了一种最近开发的机器学习算法来灵敏地检测复杂的基因关系。该算法CorEx在多变量互信息上进行高效优化，并且可以迭代应用以生成相对独立的潜在因子层次结构。所学习到的潜在因子用于对患者进行分层，以便就单一因素及其组合进行生存分析。这些分析是在生物学功能注释和蛋白质网络相互作用的背景下进行的，这些注释和相互作用可用于将患者与多种治疗方法进行匹配。

结果

对卵巢肿瘤RNA测序样本的分析表明该算法能够推断出超过一百个具有生物学可解释性的基因群组，比诸如层次聚类和k均值等标准方法多出几倍。CorEx因子层次结构也具有信息性，相关但不同的基因簇由上层节点分组。一些潜在因子与患者生存相关，包括一个与乳腺癌上皮-间质转化相关的通路，该通路由一种调节表观遗传学的微小RNA调控。此外，在某些情况下，因子组合会带来协同的生存优势。

结论

与试图为治疗目的将患者划分为少数几种亚型（通常为4种或更少）的研究不同，我们的方法利用亚组信息进行组合转录表型分析。仅考虑发现既具有显著基因本体富集且又足够小以指示特定药物靶点的66个基因表达组，就意味着卵巢癌的一种计算表型，该表型允许3种可能的患者概况，从而实现真正的个性化治疗。此处的研究结果展示了一种新技术，该技术揭示了肿瘤中基因表达依赖性的复杂性，并最终可能使患者RNA测序概况用于选择个性化且有效的癌症治疗成为可能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8488/5351169/a9cd6ef1c64e/12920_2017_245_Fig1_HTML.jpg

相似文献

Comprehensive discovery of subsample gene expression components by information explanation: therapeutic implications in cancer.通过信息解释全面发现亚样本基因表达成分：对癌症治疗的启示

BMC Med Genomics. 2017 Mar 15;10(1):12. doi: 10.1186/s12920-017-0245-6.

AID/APOBEC-network reconstruction identifies pathways associated with survival in ovarian cancer.AID/载脂蛋白B mRNA编辑酶催化多肽样家族网络重建可识别与卵巢癌生存相关的通路。

BMC Genomics. 2016 Aug 16;17(1):643. doi: 10.1186/s12864-016-3001-y.

High-grade serous tubo-ovarian cancer refined with single-cell RNA sequencing: specific cell subtypes influence survival and determine molecular subtype classification.单细胞 RNA 测序细化的高级别浆液性卵巢癌：特定细胞亚型影响生存并决定分子亚型分类。

Genome Med. 2021 Jul 9;13(1):111. doi: 10.1186/s13073-021-00922-x.

Gene expression profiles and pathway enrichment analysis to identification of differentially expressed gene and signaling pathways in epithelial ovarian cancer based on high-throughput RNA-seq data.基于高通量RNA测序数据的基因表达谱和通路富集分析，以鉴定上皮性卵巢癌中差异表达的基因和信号通路。

Genomics. 2022 Jan;114(1):161-170. doi: 10.1016/j.ygeno.2021.11.031. Epub 2021 Nov 25.

Analysis of gene expression signatures identifies prognostic and functionally distinct ovarian clear cell carcinoma subtypes.分析基因表达特征可识别预后和功能不同的卵巢透明细胞癌亚型。

EBioMedicine. 2019 Dec;50:203-210. doi: 10.1016/j.ebiom.2019.11.017. Epub 2019 Nov 21.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Investigation of hypoxia networks in ovarian cancer via bioinformatics analysis.基于生物信息学分析的卵巢癌缺氧网络研究。

J Ovarian Res. 2018 Feb 26;11(1):16. doi: 10.1186/s13048-018-0388-x.

Platform-Independent Classification System to Predict Molecular Subtypes of High-Grade Serous Ovarian Carcinoma.用于预测高级别浆液性卵巢癌分子亚型的平台无关分类系统

JCO Clin Cancer Inform. 2019 Apr;3:1-9. doi: 10.1200/CCI.18.00096.

Expression and methylation patterns partition luminal-A breast tumors into distinct prognostic subgroups.表达和甲基化模式将腔面A型乳腺肿瘤分为不同的预后亚组。

Breast Cancer Res. 2016 Jul 7;18(1):74. doi: 10.1186/s13058-016-0724-2.

Single cell sequencing reveals heterogeneity within ovarian cancer epithelium and cancer associated stromal cells.单细胞测序揭示了卵巢癌上皮细胞和癌症相关基质细胞内的异质性。

Gynecol Oncol. 2017 Mar;144(3):598-606. doi: 10.1016/j.ygyno.2017.01.015. Epub 2017 Jan 19.

引用本文的文献

Improving the performance and interpretability on medical datasets using graphical ensemble feature selection.使用图形集成特征选择提高医学数据集的性能和可解释性。

Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae341.

Cross-linking breast tumor transcriptomic states and tissue histology.交联乳腺肿瘤转录组状态和组织组织学。

Cell Rep Med. 2023 Dec 19;4(12):101313. doi: 10.1016/j.xcrm.2023.101313.

Native glycan fragments detected by MALDI mass spectrometry imaging are independent prognostic factors in pancreatic ductal adenocarcinoma.通过基质辅助激光解吸电离质谱成像检测到的天然聚糖片段是胰腺导管腺癌的独立预后因素。

EJNMMI Res. 2021 Dec 1;11(1):120. doi: 10.1186/s13550-021-00862-y.

Concepts and Applications of Information Theory to Immuno-Oncology.信息论在免疫肿瘤学中的概念和应用。

Trends Cancer. 2021 Apr;7(4):335-346. doi: 10.1016/j.trecan.2020.12.013. Epub 2021 Feb 20.

NetExtractor: Extracting a Cerebellar Tissue Gene Regulatory Network Using Differentially Expressed High Mutual Information Binary RNA Profiles.NetExtractor：利用差异表达的高互信息二元RNA谱提取小脑组织基因调控网络

G3 (Bethesda). 2020 Sep 2;10(9):2953-2963. doi: 10.1534/g3.120.401067.

Translating cancer genomics into precision medicine with artificial intelligence: applications, challenges and future perspectives.将癌症基因组学转化为人工智能导向的精准医学：应用、挑战和未来展望。

Hum Genet. 2019 Feb;138(2):109-124. doi: 10.1007/s00439-019-01970-5. Epub 2019 Jan 22.

Uncovering Biologically Coherent Peripheral Signatures of Health and Risk for Alzheimer's Disease in the Aging Brain.揭示衰老大脑中阿尔茨海默病健康与风险的生物学相关外周特征。

Front Aging Neurosci. 2018 Nov 29;10:390. doi: 10.3389/fnagi.2018.00390. eCollection 2018.

Mutation pattern analysis reveals polygenic mini-drivers associated with relapse after surgery in lung adenocarcinoma.突变模式分析揭示了与肺腺癌手术后复发相关的多基因微驱动。

Sci Rep. 2018 Oct 4;8(1):14830. doi: 10.1038/s41598-018-33276-3.

The Challenge for Development of Valuable Immuno-oncology Biomarkers.免疫肿瘤学有价值的生物标志物的发展挑战。

Clin Cancer Res. 2017 Sep 1;23(17):4970-4979. doi: 10.1158/1078-0432.CCR-16-3063.

本文引用的文献

Toward a Shared Vision for Cancer Genomic Data.迈向癌症基因组数据的共同愿景。

N Engl J Med. 2016 Sep 22;375(12):1109-12. doi: 10.1056/NEJMp1607591.

The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.基因卡片套件：从基因数据挖掘到疾病基因组序列分析

Curr Protoc Bioinformatics. 2016 Jun 20;54:1.30.1-1.30.33. doi: 10.1002/cpbi.5.

DGIdb 2.0: mining clinically relevant drug-gene interactions.DGIdb 2.0：挖掘临床相关的药物-基因相互作用

Nucleic Acids Res. 2016 Jan 4;44(D1):D1036-44. doi: 10.1093/nar/gkv1165. Epub 2015 Nov 3.

Targeting Programmed Cell Death 1 in Ovarian Cancer.靶向卵巢癌中的程序性细胞死亡蛋白1

J Clin Oncol. 2015 Dec 1;33(34):3987-9. doi: 10.1200/JCO.2015.63.7785. Epub 2015 Oct 26.

Information-Theoretic Characterization of Blood Panel Predictors for Brain Atrophy and Cognitive Decline in the Elderly.老年人脑萎缩和认知衰退的血液检测指标的信息论特征

Proc IEEE Int Symp Biomed Imaging. 2015 Apr;2015:980-984. doi: 10.1109/ISBI.2015.7164035.

Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types.对来自12种TCGA癌症类型的4043例癌症和548例正常组织对照进行大规模RNA测序转录组分析。

Sci Rep. 2015 Aug 21;5:13413. doi: 10.1038/srep13413.

An apoptosis-enhancing drug overcomes platinum resistance in a tumour-initiating subpopulation of ovarian cancer.一种促凋亡药物可克服卵巢癌肿瘤起始亚群中的铂耐药性。

Nat Commun. 2015 Aug 3;6:7956. doi: 10.1038/ncomms8956.

ER Stress Sensor XBP1 Controls Anti-tumor Immunity by Disrupting Dendritic Cell Homeostasis.内质网应激传感器XBP1通过破坏树突状细胞稳态来控制抗肿瘤免疫。

Cell. 2015 Jun 18;161(7):1527-38. doi: 10.1016/j.cell.2015.05.025. Epub 2015 Jun 11.

A network model for angiogenesis in ovarian cancer.卵巢癌血管生成的网络模型。

BMC Bioinformatics. 2015 Apr 11;16:115. doi: 10.1186/s12859-015-0551-y.

GeneFriends: a human RNA-seq-based gene and transcript co-expression database.基因之友：一个基于人类RNA测序的基因和转录本共表达数据库。

Nucleic Acids Res. 2015 Jan;43(Database issue):D1124-32. doi: 10.1093/nar/gku1042. Epub 2014 Oct 31.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过信息解释全面发现亚样本基因表达成分：对癌症治疗的启示

Comprehensive discovery of subsample gene expression components by information explanation: therapeutic implications in cancer.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献