• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于协同成分分析的知识引导基因排序。

Knowledge-guided gene ranking by coordinative component analysis.

机构信息

Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA, USA.

出版信息

BMC Bioinformatics. 2010 Mar 30;11:162. doi: 10.1186/1471-2105-11-162.

DOI:10.1186/1471-2105-11-162
PMID:20353603
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2865494/
Abstract

BACKGROUND

In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to identify biologically relevant member genes, motivating researchers to incorporate biological knowledge into gene ranking methods. However, current integration strategies are often heuristic and fail to incorporate fully the true interplay between biological knowledge and gene expression data.

RESULTS

To improve knowledge-guided gene ranking, we propose a novel method called coordinative component analysis (COCA) in this paper. COCA explicitly captures those genes within a specific biological context that are likely to be expressed in a coordinative manner. Formulated as an optimization problem to maximize the coordinative effort, COCA is designed to first extract the coordinative components based on a partial guidance from knowledge genes and then rank the genes according to their participation strengths. An embedded bootstrapping procedure is implemented to improve statistical robustness of the solutions. COCA was initially tested on simulation data and then on published gene expression microarray data to demonstrate its improved performance as compared to traditional statistical methods. Finally, the COCA approach has been applied to stem cell data to identify biologically relevant genes in signaling pathways. As a result, the COCA approach uncovers novel pathway members that may shed light into the pathway deregulation in cancers.

CONCLUSION

We have developed a new integrative strategy to combine biological knowledge and microarray data for gene ranking. The method utilizes knowledge genes for a guidance to first extract coordinative components, and then rank the genes according to their contribution related to a network or pathway. The experimental results show that such a knowledge-guided strategy can provide context-specific gene ranking with an improved performance in pathway member identification.

摘要

背景

在癌症中,基因网络和途径通常表现出动态行为,特别是在癌变过程中。因此,重要的是要优先考虑那些与网络功能密切相关的基因。传统的统计方法通常无法识别具有生物学意义的成员基因,这促使研究人员将生物学知识纳入基因排名方法中。然而,当前的整合策略往往是启发式的,未能充分纳入生物学知识与基因表达数据之间的真实相互作用。

结果

为了改进知识引导的基因排名,我们在本文中提出了一种名为协同成分分析(COCA)的新方法。COCA 明确捕获了特定生物学背景下可能以协调方式表达的那些基因。作为最大化协同作用的优化问题,COCA 旨在首先根据知识基因的部分指导提取协同成分,然后根据它们的参与强度对基因进行排名。实施了一个嵌入式自举程序来提高解决方案的统计稳健性。COCA 最初在模拟数据上进行了测试,然后在已发表的基因表达微阵列数据上进行了测试,以证明与传统统计方法相比其性能得到了提高。最后,COCA 方法已应用于干细胞数据,以鉴定信号通路中的生物学相关基因。结果,COCA 方法揭示了可能揭示癌症中途径失调的新途径成员。

结论

我们开发了一种新的整合策略,将生物学知识和微阵列数据结合起来进行基因排名。该方法利用知识基因进行指导,首先提取协同成分,然后根据它们与网络或途径的贡献对基因进行排名。实验结果表明,这种知识引导的策略可以提供特定于上下文的基因排名,并在途径成员识别方面提高性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/29282fac588b/1471-2105-11-162-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/451b26f4e4ec/1471-2105-11-162-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/edc83e369259/1471-2105-11-162-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/21d562081578/1471-2105-11-162-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/06630e1f6392/1471-2105-11-162-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/29282fac588b/1471-2105-11-162-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/451b26f4e4ec/1471-2105-11-162-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/edc83e369259/1471-2105-11-162-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/21d562081578/1471-2105-11-162-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/06630e1f6392/1471-2105-11-162-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/885e/2865494/29282fac588b/1471-2105-11-162-5.jpg

相似文献

1
Knowledge-guided gene ranking by coordinative component analysis.基于协同成分分析的知识引导基因排序。
BMC Bioinformatics. 2010 Mar 30;11:162. doi: 10.1186/1471-2105-11-162.
2
Knowledge-guided multi-scale independent component analysis for biomarker identification.用于生物标志物识别的知识引导多尺度独立成分分析
BMC Bioinformatics. 2008 Oct 6;9:416. doi: 10.1186/1471-2105-9-416.
3
Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data.将生物学知识融入基于距离的微阵列基因表达数据聚类分析中。
Bioinformatics. 2006 May 15;22(10):1259-68. doi: 10.1093/bioinformatics/btl065. Epub 2006 Feb 24.
4
Is cross-validation better than resubstitution for ranking genes?在对基因进行排名时,交叉验证是否比重替代法更好?
Bioinformatics. 2004 Jan 22;20(2):253-8. doi: 10.1093/bioinformatics/btg399.
5
Biological pathway selection through nonlinear dimension reduction.通过非线性降维选择生物途径。
Biostatistics. 2011 Jul;12(3):429-44. doi: 10.1093/biostatistics/kxq081. Epub 2011 Jan 20.
6
Mining gene expression data based on template theory.基于模板理论挖掘基因表达数据。
Bioinformatics. 2004 Nov 1;20(16):2759-66. doi: 10.1093/bioinformatics/bth323. Epub 2004 May 27.
7
Robust PCA based method for discovering differentially expressed genes.基于鲁棒主成分分析的差异表达基因发现方法。
BMC Bioinformatics. 2013;14 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2105-14-S8-S3. Epub 2013 May 9.
8
Reordering based integrative expression profiling for microarray classification.基于重排的整合表达谱分析用于微阵列分类。
BMC Bioinformatics. 2012 Mar 13;13 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-13-S2-S1.
9
Very Important Pool (VIP) genes--an application for microarray-based molecular signatures.非常重要的基因池(VIP)基因——基于微阵列的分子特征的一种应用。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S9. doi: 10.1186/1471-2105-9-S9-S9.
10
Significance of gene ranking for classification of microarray samples.基因排序在微阵列样本分类中的意义。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Jul-Sep;3(3):312-20. doi: 10.1109/TCBB.2006.42.

引用本文的文献

1
Network tuned multiple rank aggregation and applications to gene ranking.网络调谐的多重排序聚合及其在基因排序中的应用。
BMC Bioinformatics. 2015;16 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-16-S1-S6. Epub 2015 Jan 21.
2
Gene set bagging for estimating the probability a statistically significant result will replicate.基因集装袋法用于估计统计显著结果再现的概率。
BMC Bioinformatics. 2013 Dec 12;14:360. doi: 10.1186/1471-2105-14-360.
3
Physical and genetic-interaction density reveals functional organization and informs significance cutoffs in genome-wide screens.

本文引用的文献

1
Gene module identification from microarray data using nonnegative independent component analysis.使用非负独立成分分析从微阵列数据中识别基因模块。
Gene Regul Syst Bio. 2008 Jan 15;1:349-63.
2
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.利用DAVID生物信息学资源对大型基因列表进行系统和综合分析。
Nat Protoc. 2009;4(1):44-57. doi: 10.1038/nprot.2008.211.
3
Nonnegative matrix factorization: an analytical and interpretive tool in computational biology.非负矩阵分解:计算生物学中的一种分析和解释工具。
物理和遗传相互作用密度揭示了功能组织,并为全基因组筛选中的显著截断提供了信息。
Proc Natl Acad Sci U S A. 2013 Apr 30;110(18):7389-94. doi: 10.1073/pnas.1219582110. Epub 2013 Apr 15.
4
Computational Analysis of Muscular Dystrophy Sub-types Using A Novel Integrative Scheme.使用新型综合方案对肌营养不良亚型进行计算分析。
Neurocomputing (Amst). 2012 Sep 1;92:9-17. doi: 10.1016/j.neucom.2011.08.037.
PLoS Comput Biol. 2008 Jul 25;4(7):e1000029. doi: 10.1371/journal.pcbi.1000029.
4
Unraveling transcriptional regulatory programs by integrative analysis of microarray and transcription factor binding data.通过微阵列和转录因子结合数据的综合分析来揭示转录调控程序。
Bioinformatics. 2008 Sep 1;24(17):1874-80. doi: 10.1093/bioinformatics/btn332. Epub 2008 Jun 27.
5
Motif-directed network component analysis for regulatory network inference.用于调控网络推断的基序导向网络组件分析。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S21. doi: 10.1186/1471-2105-9-S1-S21.
6
Genomic studies to explore self-renewal and differentiation properties of embryonic stem cells.探索胚胎干细胞自我更新和分化特性的基因组学研究。
Front Biosci. 2008 Jan 1;13:276-83. doi: 10.2741/2678.
7
Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data.微阵列数据的可重复性:对微阵列质量控制(MAQC)数据的进一步分析。
BMC Bioinformatics. 2007 Oct 25;8:412. doi: 10.1186/1471-2105-8-412.
8
Elucidating the altered transcriptional programs in breast cancer using independent component analysis.使用独立成分分析阐明乳腺癌中改变的转录程序。
PLoS Comput Biol. 2007 Aug;3(8):e161. doi: 10.1371/journal.pcbi.0030161. Epub 2007 Jun 29.
9
A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification.用于估计微阵列分类中预测误差的自助法与调整后的自助法的比较。
Stat Med. 2007 Dec 20;26(29):5320-34. doi: 10.1002/sim.2968.
10
GeneTrail--advanced gene set enrichment analysis.GeneTrail——高级基因集富集分析
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W186-92. doi: 10.1093/nar/gkm323. Epub 2007 May 25.