• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过网络平滑 T 统计量进行生物标志物特征发现的网络和数据集成。

Network and data integration for biomarker signature discovery via network smoothed T-statistics.

机构信息

Algorithmic Bioinformatics, Bonn-Aachen International Center for IT, Bonn, Germany.

出版信息

PLoS One. 2013 Sep 3;8(9):e73074. doi: 10.1371/journal.pone.0073074. eCollection 2013.

DOI:10.1371/journal.pone.0073074
PMID:24019896
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3760887/
Abstract

Predictive, stable and interpretable gene signatures are generally seen as an important step towards a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinics is the typical low reproducibility of signatures combined with the difficulty to achieve a clear biological interpretation. For that purpose in the last years there has been a growing interest in approaches that try to integrate information from molecular interaction networks. We here propose a technique that integrates network information as well as different kinds of experimental data (here exemplified by mRNA and miRNA expression) into one classifier. This is done by smoothing t-statistics of individual genes or miRNAs over the structure of a combined protein-protein interaction (PPI) and miRNA-target gene network. A permutation test is conducted to select features in a highly consistent manner, and subsequently a Support Vector Machine (SVM) classifier is trained. Compared to several other competing methods our algorithm reveals an overall better prediction performance for early versus late disease relapse and a higher signature stability. Moreover, obtained gene lists can be clearly associated to biological knowledge, such as known disease genes and KEGG pathways. We demonstrate that our data integration strategy can improve classification performance compared to using a single data source only. Our method, called stSVM, is available in R-package netClass on CRAN (http://cran.r-project.org).

摘要

预测性、稳定性和可解释性的基因特征通常被视为迈向更好的个性化医学的重要一步。在过去的十年中,已经提出了各种方法来实现这一目标。然而,使基因特征成为临床标准工具的一个重要障碍是特征的典型低重现性,加上难以实现明确的生物学解释。为此,近年来人们越来越关注试图整合分子相互作用网络信息的方法。我们在这里提出了一种技术,该技术将网络信息以及不同类型的实验数据(这里以 mRNA 和 miRNA 表达为例)整合到一个分类器中。这是通过在组合的蛋白质-蛋白质相互作用(PPI)和 miRNA 靶基因网络的结构上对单个基因或 miRNA 的 t 统计进行平滑处理来实现的。通过置换检验以高度一致的方式选择特征,随后训练支持向量机(SVM)分类器。与其他几种竞争方法相比,我们的算法在早期与晚期疾病复发的预测性能以及更高的特征稳定性方面均表现出整体更好的性能。此外,获得的基因列表可以清楚地与生物学知识相关联,例如已知的疾病基因和 KEGG 途径。我们证明,与仅使用单一数据源相比,我们的数据集成策略可以提高分类性能。我们的方法称为 stSVM,可在 CRAN 上的 R 包 netClass 中获得(http://cran.r-project.org)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/d1dddfd623b4/pone.0073074.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/e94d9eee9aa7/pone.0073074.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/e1a9fbc113cf/pone.0073074.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/69569be45cef/pone.0073074.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/82aa7de45b31/pone.0073074.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/31cd2db814f4/pone.0073074.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/d1dddfd623b4/pone.0073074.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/e94d9eee9aa7/pone.0073074.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/e1a9fbc113cf/pone.0073074.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/69569be45cef/pone.0073074.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/82aa7de45b31/pone.0073074.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/31cd2db814f4/pone.0073074.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/798d/3760887/d1dddfd623b4/pone.0073074.g006.jpg

相似文献

1
Network and data integration for biomarker signature discovery via network smoothed T-statistics.通过网络平滑 T 统计量进行生物标志物特征发现的网络和数据集成。
PLoS One. 2013 Sep 3;8(9):e73074. doi: 10.1371/journal.pone.0073074. eCollection 2013.
2
netClass: an R-package for network based, integrative biomarker signature discovery.netClass:一个基于网络的综合生物标志物特征发现的 R 包。
Bioinformatics. 2014 May 1;30(9):1325-6. doi: 10.1093/bioinformatics/btu025. Epub 2014 Jan 17.
3
Biomarker gene signature discovery integrating network knowledge.整合网络知识的生物标志物基因特征发现。
Biology (Basel). 2012 Feb 27;1(1):5-17. doi: 10.3390/biology1010005.
4
Including network knowledge into Cox regression models for biomarker signature discovery.将网络知识纳入Cox回归模型以发现生物标志物特征。
Biom J. 2014 Mar;56(2):287-306. doi: 10.1002/bimj.201300035. Epub 2014 Jan 15.
5
Network based consensus gene signatures for biomarker discovery in breast cancer.基于网络的乳腺癌生物标志物发现的共识基因特征。
PLoS One. 2011;6(10):e25364. doi: 10.1371/journal.pone.0025364. Epub 2011 Oct 25.
6
pathClass: an R-package for integration of pathway knowledge into support vector machines for biomarker discovery.pathClass:一个 R 包,用于将途径知识集成到支持向量机中,以发现生物标志物。
Bioinformatics. 2011 May 15;27(10):1442-3. doi: 10.1093/bioinformatics/btr157. Epub 2011 Mar 30.
7
Prognostic gene signatures for patient stratification in breast cancer: accuracy, stability and interpretability of gene selection approaches using prior knowledge on protein-protein interactions.用于乳腺癌患者分层的预后基因特征:利用蛋白质 - 蛋白质相互作用的先验知识选择基因的方法的准确性、稳定性和可解释性。
BMC Bioinformatics. 2012 May 1;13:69. doi: 10.1186/1471-2105-13-69.
8
Robust edge-based biomarker discovery improves prediction of breast cancer metastasis.基于稳健边缘的生物标志物发现可提高乳腺癌转移的预测能力。
BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):359. doi: 10.1186/s12859-020-03692-2.
9
Cancer survival classification using integrated data sets and intermediate information.基于整合数据集和中间信息的癌症生存分类。
Artif Intell Med. 2014 Sep;62(1):23-31. doi: 10.1016/j.artmed.2014.06.003. Epub 2014 Jun 21.
10
Regularized logistic regression with network-based pairwise interaction for biomarker identification in breast cancer.用于乳腺癌生物标志物识别的基于网络的成对相互作用的正则化逻辑回归
BMC Bioinformatics. 2016 Feb 27;17:108. doi: 10.1186/s12859-016-0951-7.

引用本文的文献

1
Identification of Prognostic Biomarkers in Gene Expression Profile of Neuroblastoma Via Machine Learning.通过机器学习鉴定神经母细胞瘤基因表达谱中的预后生物标志物
Pediatr Discov. 2025 May 27;3(2):e70009. doi: 10.1002/pdi3.70009. eCollection 2025 Jun.
2
Potential Mechanisms Underlying the Minimal Impact of Cry1Ab1 Protein on .Cry1Ab1蛋白产生最小影响的潜在机制
Int J Mol Sci. 2025 Mar 24;26(7):2924. doi: 10.3390/ijms26072924.
3
CTRR-ncRNA: A Knowledgebase for Cancer Therapy Resistance and Recurrence Associated Non-coding RNAs.

本文引用的文献

1
Biomarker gene signature discovery integrating network knowledge.整合网络知识的生物标志物基因特征发现。
Biology (Basel). 2012 Feb 27;1(1):5-17. doi: 10.3390/biology1010005.
2
Interactogeneous: disease gene prioritization using heterogeneous networks and full topology scores.互适性:使用异质网络和全拓扑评分进行疾病基因优先级排序。
PLoS One. 2012;7(11):e49634. doi: 10.1371/journal.pone.0049634. Epub 2012 Nov 19.
3
Computational tools for prioritizing candidate genes: boosting disease gene discovery.计算工具在候选基因优先级排序中的应用:提高疾病基因发现的效率。
CTRR-ncRNA:一个与癌症治疗抵抗和复发相关的非编码 RNA 知识库。
Genomics Proteomics Bioinformatics. 2023 Apr;21(2):292-299. doi: 10.1016/j.gpb.2022.10.003. Epub 2022 Oct 17.
4
Network-based integration of multi-omics data for clinical outcome prediction in neuroblastoma.基于网络的多组学数据整合用于神经母细胞瘤的临床预后预测。
Sci Rep. 2022 Sep 14;12(1):15425. doi: 10.1038/s41598-022-19019-5.
5
Loss of CHGA Protein as a Potential Biomarker for Colon Cancer Diagnosis: A Study on Biomarker Discovery by Machine Learning and Confirmation by Immunohistochemistry in Colorectal Cancer Tissue Microarrays.CHGA蛋白缺失作为结肠癌诊断的潜在生物标志物:一项通过机器学习发现生物标志物并在结直肠癌组织微阵列中进行免疫组化验证的研究
Cancers (Basel). 2022 May 27;14(11):2664. doi: 10.3390/cancers14112664.
6
Feature selection and classification over the network with missing node observations.基于带有节点观测缺失的网络进行特征选择和分类。
Stat Med. 2022 Mar 30;41(7):1242-1262. doi: 10.1002/sim.9267. Epub 2021 Nov 23.
7
Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions.理解微生物组与宿主相互作用机制的计算生物学和机器学习方法
Front Microbiol. 2021 May 11;12:618856. doi: 10.3389/fmicb.2021.618856. eCollection 2021.
8
EARN: an ensemble machine learning algorithm to predict driver genes in metastatic breast cancer.EARN:一种集成机器学习算法,用于预测转移性乳腺癌的驱动基因。
BMC Med Genomics. 2021 May 7;14(1):122. doi: 10.1186/s12920-021-00974-3.
9
The effect of statistical normalization on network propagation scores.统计归一化对网络传播评分的影响。
Bioinformatics. 2021 May 5;37(6):845-852. doi: 10.1093/bioinformatics/btaa896.
10
On fusion methods for knowledge discovery from multi-omics datasets.关于从多组学数据集中进行知识发现的融合方法
Comput Struct Biotechnol J. 2020 Mar 5;18:509-517. doi: 10.1016/j.csbj.2020.02.011. eCollection 2020.
Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253.
4
Google goes cancer: improving outcome prediction for cancer patients by network-based ranking of marker genes.谷歌对抗癌症:通过基于网络的标记基因排名改善癌症患者的预后预测。
PLoS Comput Biol. 2012;8(5):e1002511. doi: 10.1371/journal.pcbi.1002511. Epub 2012 May 17.
5
Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer.外显子组测序鉴定前列腺癌中 SPOP、FOXA1 和 MED12 的高频突变。
Nat Genet. 2012 May 20;44(6):685-9. doi: 10.1038/ng.2279.
6
A critical evaluation of network and pathway-based classifiers for outcome prediction in breast cancer.基于网络和途径的分类器在乳腺癌预后预测中的评价。
PLoS One. 2012;7(4):e34796. doi: 10.1371/journal.pone.0034796. Epub 2012 Apr 27.
7
Prognostic gene signatures for patient stratification in breast cancer: accuracy, stability and interpretability of gene selection approaches using prior knowledge on protein-protein interactions.用于乳腺癌患者分层的预后基因特征:利用蛋白质 - 蛋白质相互作用的先验知识选择基因的方法的准确性、稳定性和可解释性。
BMC Bioinformatics. 2012 May 1;13:69. doi: 10.1186/1471-2105-13-69.
8
Cancer genomics: technology, discovery, and translation.癌症基因组学:技术、发现与转化。
J Clin Oncol. 2012 Feb 20;30(6):647-60. doi: 10.1200/JCO.2011.39.2316. Epub 2012 Jan 23.
9
Graph based fusion of miRNA and mRNA expression data improves clinical outcome prediction in prostate cancer.基于图的 miRNA 和 mRNA 表达数据融合可改善前列腺癌的临床预后预测。
BMC Bioinformatics. 2011 Dec 21;12:488. doi: 10.1186/1471-2105-12-488.
10
Adipocytes promote ovarian cancer metastasis and provide energy for rapid tumor growth.脂肪细胞促进卵巢癌转移,并为肿瘤的快速生长提供能量。
Nat Med. 2011 Oct 30;17(11):1498-503. doi: 10.1038/nm.2492.