• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于乳腺癌分类的多组学主题建模

Multiomics Topic Modeling for Breast Cancer Classification.

作者信息

Valle Filippo, Osella Matteo, Caselle Michele

机构信息

Physics Department, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy.

出版信息

Cancers (Basel). 2022 Feb 23;14(5):1150. doi: 10.3390/cancers14051150.

DOI:10.3390/cancers14051150
PMID:35267458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8909787/
Abstract

The integration of transcriptional data with other layers of information, such as the post-transcriptional regulation mediated by microRNAs, can be crucial to identify the driver genes and the subtypes of complex and heterogeneous diseases such as cancer. This paper presents an approach based on topic modeling to accomplish this integration task. More specifically, we show how an algorithm based on a hierarchical version of stochastic block modeling can be naturally extended to integrate any combination of 'omics data. We test this approach on breast cancer samples from the TCGA database, integrating data on messenger RNA, microRNAs, and copy number variations. We show that the inclusion of the microRNA layer significantly improves the accuracy of subtype classification. Moreover, some of the hidden structures or "topics" that the algorithm extracts actually correspond to genes and microRNAs involved in breast cancer development and are associated to the survival probability.

摘要

将转录数据与其他信息层(如由微小RNA介导的转录后调控)整合,对于识别驱动基因以及复杂异质性疾病(如癌症)的亚型可能至关重要。本文提出了一种基于主题建模的方法来完成此整合任务。更具体地说,我们展示了如何自然地扩展基于随机块建模分层版本的算法,以整合任何“组学”数据组合。我们在来自TCGA数据库的乳腺癌样本上测试了这种方法,整合了信使RNA、微小RNA和拷贝数变异的数据。我们表明,纳入微小RNA层显著提高了亚型分类的准确性。此外,该算法提取的一些隐藏结构或“主题”实际上对应于参与乳腺癌发展的基因和微小RNA,并与生存概率相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/f0bc1cdfb74f/cancers-14-01150-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/749d3bfb51e0/cancers-14-01150-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/487803271ef4/cancers-14-01150-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/b95681cc6702/cancers-14-01150-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/90696c5ac9ae/cancers-14-01150-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/4091c8f54b63/cancers-14-01150-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/00c2f3f37924/cancers-14-01150-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/0634245d5721/cancers-14-01150-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/e3000c826fcd/cancers-14-01150-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/f8ecd0846bf5/cancers-14-01150-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/f0bc1cdfb74f/cancers-14-01150-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/749d3bfb51e0/cancers-14-01150-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/487803271ef4/cancers-14-01150-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/b95681cc6702/cancers-14-01150-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/90696c5ac9ae/cancers-14-01150-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/4091c8f54b63/cancers-14-01150-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/00c2f3f37924/cancers-14-01150-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/0634245d5721/cancers-14-01150-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/e3000c826fcd/cancers-14-01150-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/f8ecd0846bf5/cancers-14-01150-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17e8/8909787/f0bc1cdfb74f/cancers-14-01150-g010.jpg

相似文献

1
Multiomics Topic Modeling for Breast Cancer Classification.用于乳腺癌分类的多组学主题建模
Cancers (Basel). 2022 Feb 23;14(5):1150. doi: 10.3390/cancers14051150.
2
A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data.一种通过整合多组学数据进行癌症亚型分类的层次化集成深度灵活神经森林框架。
BMC Bioinformatics. 2019 Oct 28;20(1):527. doi: 10.1186/s12859-019-3116-7.
3
Attention-based GCN integrates multi-omics data for breast cancer subtype classification and patient-specific gene marker identification.基于注意力的 GCN 集成了多组学数据,用于乳腺癌亚型分类和患者特异性基因标志物识别。
Brief Funct Genomics. 2023 Nov 10;22(5):463-474. doi: 10.1093/bfgp/elad013.
4
A deep learning approach based on multi-omics data integration to construct a risk stratification prediction model for skin cutaneous melanoma.基于多组学数据整合的深度学习方法构建皮肤黑色素瘤风险分层预测模型。
J Cancer Res Clin Oncol. 2023 Nov;149(17):15923-15938. doi: 10.1007/s00432-023-05358-x. Epub 2023 Sep 7.
5
Identification of Interpretable Clusters and Associated Signatures in Breast Cancer Single-Cell Data: A Topic Modeling Approach.乳腺癌单细胞数据中可解释聚类和相关特征的识别:一种主题建模方法。
Cancers (Basel). 2024 Mar 29;16(7):1350. doi: 10.3390/cancers16071350.
6
Identifying the driver miRNAs with somatic copy number alterations driving dysregulated ceRNA networks in cancers.鉴定具有体细胞拷贝数改变的驱动 miRNA,驱动癌症中失调的 ceRNA 网络。
Biol Direct. 2023 Nov 22;18(1):79. doi: 10.1186/s13062-023-00438-x.
7
Integrative clustering reveals a novel split in the luminal A subtype of breast cancer with impact on outcome.综合聚类分析揭示了乳腺癌腔面A型亚型的一种新分类,这对预后有影响。
Breast Cancer Res. 2017 Mar 29;19(1):44. doi: 10.1186/s13058-017-0812-y.
8
Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.基于深度学习的多组学生物标志物数据特征层融合在乳腺癌患者生存分析中的应用。
BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.
9
Identification of Novel Breast Cancer Subtype-Specific Biomarkers by Integrating Genomics Analysis of DNA Copy Number Aberrations and miRNA-mRNA Dual Expression Profiling.通过整合DNA拷贝数变异的基因组分析和miRNA-mRNA双表达谱来鉴定新型乳腺癌亚型特异性生物标志物。
Biomed Res Int. 2015;2015:746970. doi: 10.1155/2015/746970. Epub 2015 Apr 15.
10
Machine learning and multi-omics data reveal driver gene-based molecular subtypes in hepatocellular carcinoma for precision treatment.机器学习和多组学数据揭示了基于驱动基因的肝细胞癌分子亚型,以实现精准治疗。
PLoS Comput Biol. 2024 May 10;20(5):e1012113. doi: 10.1371/journal.pcbi.1012113. eCollection 2024 May.

引用本文的文献

1
Novel cancer subtyping method guided by tumor-normal sample in latent space of transcriptomic variational autoencoder.基于转录组变分自编码器潜在空间中肿瘤-正常样本引导的新型癌症亚型分类方法。
Sci Rep. 2025 Jul 21;15(1):26444. doi: 10.1038/s41598-025-07813-w.
2
Gene expression and agent-based modeling improve precision prognosis in breast cancer.基因表达与基于主体的建模改善乳腺癌的精准预后。
Sci Rep. 2025 May 16;15(1):17059. doi: 10.1038/s41598-025-01275-w.
3
Exploring the latent space of transcriptomic data with topic modeling.运用主题模型探索转录组数据的潜在空间。

本文引用的文献

1
Emergent statistical laws in single-cell transcriptomic data.单细胞转录组数据中的紧急统计规律。
Phys Rev E. 2023 Apr;107(4-1):044403. doi: 10.1103/PhysRevE.107.044403.
2
Nested Stochastic Block Models applied to the analysis of single cell data.嵌套随机块模型在单细胞数据分析中的应用。
BMC Bioinformatics. 2021 Nov 30;22(1):576. doi: 10.1186/s12859-021-04489-7.
3
Breast Cancer Consensus Subtypes: A system for subtyping breast cancer tumors based on gene expression.乳腺癌共识亚型:一种基于基因表达对乳腺癌肿瘤进行亚型分类的系统。
NAR Genom Bioinform. 2025 Apr 22;7(2):lqaf049. doi: 10.1093/nargab/lqaf049. eCollection 2025 Jun.
4
Topic modeling analysis of the Allen Human Brain Atlas.艾伦人类大脑图谱的主题建模分析
Sci Rep. 2025 Feb 26;15(1):6928. doi: 10.1038/s41598-025-91079-9.
5
Identification of Interpretable Clusters and Associated Signatures in Breast Cancer Single-Cell Data: A Topic Modeling Approach.乳腺癌单细胞数据中可解释聚类和相关特征的识别:一种主题建模方法。
Cancers (Basel). 2024 Mar 29;16(7):1350. doi: 10.3390/cancers16071350.
6
Molecular Classifiers in Skin Cancers: Challenges and Promises.皮肤癌中的分子分类器:挑战与前景
Cancers (Basel). 2023 Sep 7;15(18):4463. doi: 10.3390/cancers15184463.
7
Latent disease similarities and therapeutic repurposing possibilities uncovered by multi-modal generative topic modeling of human diseases.通过人类疾病的多模态生成主题建模发现潜在疾病相似性和治疗方法重新利用的可能性。
Bioinform Adv. 2023 Apr 12;3(1):vbad047. doi: 10.1093/bioadv/vbad047. eCollection 2023.
8
Pancancer survival prediction using a deep learning architecture with multimodal representation and integration.使用具有多模态表示和整合的深度学习架构进行泛癌生存预测。
Bioinform Adv. 2023 Jan 23;3(1):vbad006. doi: 10.1093/bioadv/vbad006. eCollection 2023.
NPJ Breast Cancer. 2021 Oct 12;7(1):136. doi: 10.1038/s41523-021-00345-2.
4
Transcriptomic and Genetic Associations between Alzheimer's Disease, Parkinson's Disease, and Cancer.阿尔茨海默病、帕金森病和癌症之间的转录组学与遗传学关联
Cancers (Basel). 2021 Jun 15;13(12):2990. doi: 10.3390/cancers13122990.
5
A Topic Modeling Analysis of TCGA Breast and Lung Cancer Transcriptomic Data.对TCGA乳腺癌和肺癌转录组数据的主题建模分析。
Cancers (Basel). 2020 Dec 16;12(12):3799. doi: 10.3390/cancers12123799.
6
Community detection in bipartite networks with stochastic block models.基于随机块模型的二分网络中的社区检测
Phys Rev E. 2020 Sep;102(3-1):032309. doi: 10.1103/PhysRevE.102.032309.
7
Merge-split Markov chain Monte Carlo for community detection.用于社区检测的合并-分裂马尔可夫链蒙特卡罗方法
Phys Rev E. 2020 Jul;102(1-1):012305. doi: 10.1103/PhysRevE.102.012305.
8
Investigating the epi-miRNome: identification of epi-miRNAs using transfection experiments.研究 epi-miRNome:使用转染实验鉴定 epi-miRNAs。
Epigenomics. 2019 Nov;11(14):1581-1599. doi: 10.2217/epi-2019-0050. Epub 2019 Nov 6.
9
Distinct signatures of lung cancer types: aberrant mucin O-glycosylation and compromised immune response.肺癌类型的独特特征:异常的粘蛋白 O-糖基化和受损的免疫反应。
BMC Cancer. 2019 Aug 20;19(1):824. doi: 10.1186/s12885-019-5965-x.
10
Molecular Inverse Comorbidity between Alzheimer's Disease and Lung Cancer: New Insights from Matrix Factorization.阿尔茨海默病与肺癌的分子反向共病:矩阵分解的新见解。
Int J Mol Sci. 2019 Jun 26;20(13):3114. doi: 10.3390/ijms20133114.