• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在 DREAM3 基因表达预测挑战赛中表现优异的算法。

A top-performing algorithm for the DREAM3 gene expression prediction challenge.

机构信息

Department of Computer Science, University of Texas at San Antonio, San Antonio, Texas, United States of America.

出版信息

PLoS One. 2010 Feb 4;5(2):e8944. doi: 10.1371/journal.pone.0008944.

DOI:10.1371/journal.pone.0008944
PMID:20140212
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2816205/
Abstract

A wealth of computational methods has been developed to address problems in systems biology, such as modeling gene expression. However, to objectively evaluate and compare such methods is notoriously difficult. The DREAM (Dialogue on Reverse Engineering Assessments and Methods) project is a community-wide effort to assess the relative strengths and weaknesses of different computational methods for a set of core problems in systems biology. This article presents a top-performing algorithm for one of the challenge problems in the third annual DREAM (DREAM3), namely the gene expression prediction challenge. In this challenge, participants are asked to predict the expression levels of a small set of genes in a yeast deletion strain, given the expression levels of all other genes in the same strain and complete gene expression data for several other yeast strains. I propose a simple -nearest-neighbor (KNN) method to solve this problem. Despite its simplicity, this method works well for this challenge, sharing the "top performer" honor with a much more sophisticated method. I also describe several alternative, simple strategies, including a modified KNN algorithm that further improves the performance of the standard KNN method. The success of these methods suggests that complex methods attempting to integrate multiple data sets do not necessarily lead to better performance than simple yet robust methods. Furthermore, none of these top-performing methods, including the one by a different team, are based on gene regulatory networks, which seems to suggest that accurately modeling gene expression using gene regulatory networks is unfortunately still a difficult task.

摘要

已经开发了大量的计算方法来解决系统生物学中的问题,例如基因表达建模。然而,客观地评估和比较这些方法是非常困难的。DREAM(反向工程评估和方法对话)项目是一个社区范围内的努力,旨在评估不同计算方法在系统生物学核心问题上的相对优势和劣势。本文介绍了第三届 DREAM(DREAM3)挑战赛中的一个核心问题的表现最佳算法,即基因表达预测挑战赛。在这个挑战中,参与者被要求根据同一菌株中所有其他基因的表达水平以及其他几个酵母菌株的完整基因表达数据,预测一小部分基因在酵母缺失菌株中的表达水平。我提出了一种简单的最近邻(KNN)方法来解决这个问题。尽管这种方法很简单,但它在这个挑战中表现良好,与一种更加复杂的方法共同获得了“表现最佳”的荣誉。我还描述了几种替代的简单策略,包括一种改进的 KNN 算法,它进一步提高了标准 KNN 方法的性能。这些方法的成功表明,试图整合多个数据集的复杂方法不一定比简单但稳健的方法表现更好。此外,包括一个不同团队提出的方法在内,这些表现最佳的方法都没有基于基因调控网络,这似乎表明,使用基因调控网络准确地建模基因表达仍然是一项困难的任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/2b4d34454b93/pone.0008944.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/9693425dc052/pone.0008944.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/e541d04017d8/pone.0008944.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/127e9613cddf/pone.0008944.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/2b4d34454b93/pone.0008944.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/9693425dc052/pone.0008944.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/e541d04017d8/pone.0008944.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/127e9613cddf/pone.0008944.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0010/2816205/2b4d34454b93/pone.0008944.g004.jpg

相似文献

1
A top-performing algorithm for the DREAM3 gene expression prediction challenge.在 DREAM3 基因表达预测挑战赛中表现优异的算法。
PLoS One. 2010 Feb 4;5(2):e8944. doi: 10.1371/journal.pone.0008944.
2
Towards a rigorous assessment of systems biology models: the DREAM3 challenges.迈向系统生物学模型的严格评估:DREAM3 挑战。
PLoS One. 2010 Feb 23;5(2):e9202. doi: 10.1371/journal.pone.0009202.
3
Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach.从核糖体启动子序列推断基因表达,一种众包方法。
Genome Res. 2013 Nov;23(11):1928-37. doi: 10.1101/gr.157420.113. Epub 2013 Aug 15.
4
Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.基因表达的软整合预测和 DREAM3 基因表达挑战的弹性网络-最佳性能。
PLoS One. 2010 Feb 16;5(2):e9134. doi: 10.1371/journal.pone.0009134.
5
Generating realistic in silico gene networks for performance assessment of reverse engineering methods.生成用于逆向工程方法性能评估的逼真的计算机模拟基因网络。
J Comput Biol. 2009 Feb;16(2):229-39. doi: 10.1089/cmb.2008.09TT.
6
Inferring regulatory networks from expression data using tree-based methods.基于树的方法从表达数据推断调控网络。
PLoS One. 2010 Sep 28;5(9):e12776. doi: 10.1371/journal.pone.0012776.
7
Gene interaction networks based on kernel correlation metrics.基于核相关度量的基因相互作用网络。
Int J Comput Biol Drug Des. 2013;6(1-2):72-92. doi: 10.1504/IJCBDD.2013.052203. Epub 2013 Feb 21.
8
Revealing strengths and weaknesses of methods for gene network inference.揭示基因网络推断方法的优缺点。
Proc Natl Acad Sci U S A. 2010 Apr 6;107(14):6286-91. doi: 10.1073/pnas.0913357107. Epub 2010 Mar 22.
9
Sample scale-free gene regulatory network using gene ontology.使用基因本体论的无标度基因调控网络示例。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5523-6. doi: 10.1109/IEMBS.2006.259261.
10
Lessons from the DREAM2 Challenges.来自DREAM2挑战赛的经验教训。
Ann N Y Acad Sci. 2009 Mar;1158:159-95. doi: 10.1111/j.1749-6632.2009.04497.x.

引用本文的文献

1
Experimental noise cutoff boosts inferability of transcriptional networks in large-scale gene-deletion studies.实验性噪声截止提升了大规模基因敲除研究中转录网络的可推断性。
Nat Commun. 2018 Jan 9;9(1):133. doi: 10.1038/s41467-017-02489-x.
2
Data- and knowledge-based modeling of gene regulatory networks: an update.基于数据和知识的基因调控网络建模:最新进展
EXCLI J. 2015 Mar 2;14:346-78. doi: 10.17179/excli2015-168. eCollection 2015.
3
Differential expression analysis with global network adjustment.基于全局网络调整的差异表达分析。

本文引用的文献

1
Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.基因表达的软整合预测和 DREAM3 基因表达挑战的弹性网络-最佳性能。
PLoS One. 2010 Feb 16;5(2):e9134. doi: 10.1371/journal.pone.0009134.
2
Transcriptional regulatory circuits: predicting numbers from alphabets.转录调控回路:从字母预测数量
Science. 2009 Jul 24;325(5939):429-32. doi: 10.1126/science.1171347.
3
An ensemble learning approach to reverse-engineering transcriptional regulatory networks from time-series gene expression data.
BMC Bioinformatics. 2013 Aug 23;14:258. doi: 10.1186/1471-2105-14-258.
4
Towards a rigorous assessment of systems biology models: the DREAM3 challenges.迈向系统生物学模型的严格评估:DREAM3 挑战。
PLoS One. 2010 Feb 23;5(2):e9202. doi: 10.1371/journal.pone.0009202.
5
Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.基因表达的软整合预测和 DREAM3 基因表达挑战的弹性网络-最佳性能。
PLoS One. 2010 Feb 16;5(2):e9134. doi: 10.1371/journal.pone.0009134.
一种从时间序列基因表达数据反向构建转录调控网络的集成学习方法。
BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2164-10-S1-S8.
4
Computational representation of biological systems.生物系统的计算表示
Methods Mol Biol. 2009;541:535-49. doi: 10.1007/978-1-59745-243-4_23.
5
Lessons from the DREAM2 Challenges.来自DREAM2挑战赛的经验教训。
Ann N Y Acad Sci. 2009 Mar;1158:159-95. doi: 10.1111/j.1749-6632.2009.04497.x.
6
Gene regulatory network inference: data integration in dynamic models-a review.基因调控网络推断:动态模型中的数据整合——综述
Biosystems. 2009 Apr;96(1):86-103. doi: 10.1016/j.biosystems.2008.12.004. Epub 2008 Dec 27.
7
Predicting expression patterns from regulatory sequence in Drosophila segmentation.从果蝇体节调控序列预测表达模式。
Nature. 2008 Jan 31;451(7178):535-40. doi: 10.1038/nature06496. Epub 2008 Jan 2.
8
Dialogue on reverse-engineering assessment and methods: the DREAM of high-throughput pathway inference.逆向工程评估与方法对话:高通量通路推断之梦
Ann N Y Acad Sci. 2007 Dec;1115:1-22. doi: 10.1196/annals.1407.021. Epub 2007 Oct 9.
9
Computational methodologies for modelling, analysis and simulation of signalling networks.用于信号网络建模、分析和模拟的计算方法。
Brief Bioinform. 2006 Dec;7(4):339-53. doi: 10.1093/bib/bbl043. Epub 2006 Nov 20.
10
From in vivo to in silico biology and back.从体内生物学到计算机生物学,再回归。
Nature. 2006 Oct 5;443(7111):527-33. doi: 10.1038/nature05127.