• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网络的双聚类分析用于发现癌症亚型。

Network-aided Bi-Clustering for discovering cancer subtypes.

机构信息

College of Computer and Information Science, Southwest University, Chongqing, China.

出版信息

Sci Rep. 2017 Apr 21;7(1):1046. doi: 10.1038/s41598-017-01064-0.

DOI:10.1038/s41598-017-01064-0
PMID:28432308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5430742/
Abstract

Bi-clustering is a widely used data mining technique for analyzing gene expression data. It simultaneously groups genes and samples of an input gene expression data matrix to discover bi-clusters that relevant samples exhibit similar gene expression profiles over a subset of genes. The discovered bi-clusters bring insights for categorization of cancer subtypes, gene treatments and others. Most existing bi-clustering approaches can only enumerate bi-clusters with constant values. Gene interaction networks can help to understand the pattern of cancer subtypes, but they are rarely integrated with gene expression data for exploring cancer subtypes. In this paper, we propose a novel method called Network-aided Bi-Clustering (NetBC). NetBC assigns weights to genes based on the structure of gene interaction network, and it iteratively optimizes sum-squared residue to obtain the row and column indicative matrices of bi-clusters by matrix factorization. NetBC can not only efficiently discover bi-clusters with constant values, but also bi-clusters with coherent trends. Empirical study on large-scale cancer gene expression datasets demonstrates that NetBC can more accurately discover cancer subtypes than other related algorithms.

摘要

双聚类是一种广泛使用的数据挖掘技术,用于分析基因表达数据。它同时对基因和输入基因表达数据矩阵的样本进行分组,以发现相关样本在基因子集上表现出相似的基因表达谱的双聚类。发现的双聚类为癌症亚型的分类、基因治疗等提供了深入的了解。大多数现有的双聚类方法只能枚举具有恒定值的双聚类。基因相互作用网络有助于理解癌症亚型的模式,但它们很少与基因表达数据集成以探索癌症亚型。在本文中,我们提出了一种称为基于网络的双聚类(NetBC)的新方法。NetBC 根据基因相互作用网络的结构为基因分配权重,并通过矩阵分解迭代优化平方和残差,以获得双聚类的行和列指示矩阵。NetBC 不仅可以有效地发现具有恒定值的双聚类,还可以发现具有一致趋势的双聚类。在大规模癌症基因表达数据集上的实证研究表明,NetBC 可以比其他相关算法更准确地发现癌症亚型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/9f258f272252/41598_2017_1064_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/b32e683a4e97/41598_2017_1064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/caba2caf9671/41598_2017_1064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/fb5d0f0cdaee/41598_2017_1064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/8ef5142603ba/41598_2017_1064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/c661514ad70d/41598_2017_1064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/c1156afbda59/41598_2017_1064_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/6024db51d646/41598_2017_1064_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/e0b9e1d1ab78/41598_2017_1064_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/4e8ec75bea46/41598_2017_1064_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/b025b15360cf/41598_2017_1064_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/9f258f272252/41598_2017_1064_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/b32e683a4e97/41598_2017_1064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/caba2caf9671/41598_2017_1064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/fb5d0f0cdaee/41598_2017_1064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/8ef5142603ba/41598_2017_1064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/c661514ad70d/41598_2017_1064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/c1156afbda59/41598_2017_1064_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/6024db51d646/41598_2017_1064_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/e0b9e1d1ab78/41598_2017_1064_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/4e8ec75bea46/41598_2017_1064_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/b025b15360cf/41598_2017_1064_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4c/5430742/9f258f272252/41598_2017_1064_Fig11_HTML.jpg

相似文献

1
Network-aided Bi-Clustering for discovering cancer subtypes.基于网络的双聚类分析用于发现癌症亚型。
Sci Rep. 2017 Apr 21;7(1):1046. doi: 10.1038/s41598-017-01064-0.
2
A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression.基于基因表达的网络辅助协同聚类算法发现癌症亚型。
BMC Bioinformatics. 2014 Feb 4;15:37. doi: 10.1186/1471-2105-15-37.
3
Clustering approaches for visual knowledge exploration in molecular interaction networks.分子相互作用网络中视觉知识探索的聚类方法。
BMC Bioinformatics. 2018 Aug 29;19(1):308. doi: 10.1186/s12859-018-2314-z.
4
Feature related multi-view nonnegative matrix factorization for identifying conserved functional modules in multiple biological networks.基于特征相关的多视图非负矩阵分解识别多个生物网络中的保守功能模块。
BMC Bioinformatics. 2018 Oct 29;19(1):394. doi: 10.1186/s12859-018-2434-5.
5
Network based stratification of major cancers by integrating somatic mutation and gene expression data.通过整合体细胞突变和基因表达数据对主要癌症进行基于网络的分层。
PLoS One. 2017 May 16;12(5):e0177662. doi: 10.1371/journal.pone.0177662. eCollection 2017.
6
Bi-clustering of metabolic data using matrix factorization tools.基于矩阵分解工具的代谢数据双聚类分析。
Methods. 2018 Dec 1;151:12-20. doi: 10.1016/j.ymeth.2018.02.004. Epub 2018 Feb 10.
7
SCNrank: spectral clustering for network-based ranking to reveal potential drug targets and its application in pancreatic ductal adenocarcinoma.SCNrank:基于网络的排序的谱聚类揭示潜在的药物靶点及其在胰腺导管腺癌中的应用。
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):50. doi: 10.1186/s12920-020-0681-6.
8
Integrative clustering by nonnegative matrix factorization can reveal coherent functional groups from gene profile data.非负矩阵分解的集成聚类可以从基因谱数据中揭示出一致的功能组。
IEEE J Biomed Health Inform. 2015 Mar;19(2):698-708. doi: 10.1109/JBHI.2014.2316508. Epub 2014 Apr 10.
9
Incorporating topological information for predicting robust cancer subnetwork markers in human protein-protein interaction network.整合拓扑信息以预测人类蛋白质-蛋白质相互作用网络中稳健的癌症子网标志物。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):351. doi: 10.1186/s12859-016-1224-1.
10
Multi-layer Bundling as a New Approach for Determining Multi-scale Correlations Within a High-Dimensional Dataset.多层捆绑作为一种新方法,用于确定高维数据集内的多尺度相关性。
Bull Math Biol. 2024 Jul 12;86(9):105. doi: 10.1007/s11538-024-01335-8.

引用本文的文献

1
Network Approaches for Precision Oncology.网络方法在精准肿瘤学中的应用。
Adv Exp Med Biol. 2022;1361:199-213. doi: 10.1007/978-3-030-91836-1_11.
2
Knowledge-Guided Statistical Learning Methods for Analysis of High-Dimensional -Omics Data in Precision Oncology.用于精准肿瘤学中高维组学数据分析的知识引导统计学习方法
JCO Precis Oncol. 2019 Oct 24;3. doi: 10.1200/PO.19.00018. eCollection 2019 Oct.
3
Unsupervised Algorithms for Microarray Sample Stratification.非监督算法在微阵列样本分层中的应用。

本文引用的文献

1
UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data.UniBic:用于基因表达数据分析的基于行的序列双聚类算法。
Sci Rep. 2016 Mar 22;6:23466. doi: 10.1038/srep23466.
2
STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10:整合了整个生命之树的蛋白质-蛋白质相互作用网络。
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.
3
Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering.
Methods Mol Biol. 2022;2401:121-146. doi: 10.1007/978-1-0716-1839-4_9.
4
Framework for classification of cancer gene expression data using Bayesian hyper-parameter optimization.基于贝叶斯超参数优化的癌症基因表达数据分类框架。
Med Biol Eng Comput. 2021 Nov;59(11-12):2353-2371. doi: 10.1007/s11517-021-02442-7. Epub 2021 Oct 5.
5
Knowledge-Guided Biclustering via Sparse Variational EM Algorithm.基于稀疏变分期望最大化算法的知识引导双聚类
10th IEEE Int Conf Big Knowl (2019). 2019 Nov;2019:25-32. doi: 10.1109/icbk.2019.00012. Epub 2019 Dec 30.
6
Short Time-Series Expression Transcriptome Data Reveal the Gene Expression Patterns of Dairy Cow Mammary Gland as Milk Yield Decreased Process.短时间序列表达转录组数据揭示了奶牛乳腺产奶量下降过程中的基因表达模式。
Genes (Basel). 2021 Jun 20;12(6):942. doi: 10.3390/genes12060942.
7
Knowledge-guided analysis of "omics" data using the KnowEnG cloud platform.基于 KnowEnG 云平台的“组学”数据知识引导分析。
PLoS Biol. 2020 Jan 23;18(1):e3000583. doi: 10.1371/journal.pbio.3000583. eCollection 2020 Jan.
8
Bayesian generalized biclustering analysis via adaptive structured shrinkage.基于自适应结构收缩的贝叶斯广义双聚类分析。
Biostatistics. 2020 Jul 1;21(3):610-624. doi: 10.1093/biostatistics/kxy081.
9
The Emerging Potential for Network Analysis to Inform Precision Cancer Medicine.网络分析在精准肿瘤医学中的新兴应用潜力
J Mol Biol. 2018 Sep 14;430(18 Pt A):2875-2899. doi: 10.1016/j.jmb.2018.06.016. Epub 2018 Jun 15.
双力法:大规模双聚类编辑及其在基因表达数据双聚类中的应用。
Nucleic Acids Res. 2014 May;42(9):e78. doi: 10.1093/nar/gku201. Epub 2014 Mar 20.
4
BiCluE - Exact and heuristic algorithms for weighted bi-cluster editing of biomedical data.BiCluE - 用于生物医学数据加权双簇编辑的精确算法和启发式算法。
BMC Proc. 2013 Dec 20;7(Suppl 7):S9. doi: 10.1186/1753-6561-7-S7-S9.
5
A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression.基于基因表达的网络辅助协同聚类算法发现癌症亚型。
BMC Bioinformatics. 2014 Feb 4;15:37. doi: 10.1186/1471-2105-15-37.
6
Network-based stratification of tumor mutations.基于网络的肿瘤突变分层。
Nat Methods. 2013 Nov;10(11):1108-15. doi: 10.1038/nmeth.2651. Epub 2013 Sep 15.
7
Comprehensive molecular portraits of human breast tumours.人类乳腺肿瘤的全面分子特征图谱。
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
8
A comparative analysis of biclustering algorithms for gene expression data.基于基因表达数据的对比分析双聚类算法。
Brief Bioinform. 2013 May;14(3):279-92. doi: 10.1093/bib/bbs032. Epub 2012 Jul 6.
9
Network medicine: a network-based approach to human disease.网络医学:一种基于网络的人类疾病研究方法。
Nat Rev Genet. 2011 Jan;12(1):56-68. doi: 10.1038/nrg2918.
10
FABIA: factor analysis for bicluster acquisition.FABIA:双聚类因子分析。
Bioinformatics. 2010 Jun 15;26(12):1520-7. doi: 10.1093/bioinformatics/btq227. Epub 2010 Apr 23.