• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合基因组数据集以预测酵母中的蛋白质复合物。

Integration of genomic datasets to predict protein complexes in yeast.

作者信息

Jansen Ronald, Lan Ning, Qian Jiang, Gerstein Mark

机构信息

Department of Molecular Biophysics & Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA.

出版信息

J Struct Funct Genomics. 2002;2(2):71-81. doi: 10.1023/a:1020495201615.

DOI:10.1023/a:1020495201615
PMID:12836664
Abstract

The ultimate goal of functional genomics is to define the function of all the genes in the genome of an organism. A large body of information of the biological roles of genes has been accumulated and aggregated in the past decades of research, both from traditional experiments detailing the role of individual genes and proteins, and from newer experimental strategies that aim to characterize gene function on a genomic scale. It is clear that the goal of functional genomics can only be achieved by integrating information and data sources from the variety of these different experiments. Integration of different data is thus an important challenge for bioinformatics. The integration of different data sources often helps to uncover non-obvious relationships between genes, but there are also two further benefits. First, it is likely that whenever information from multiple independent sources agrees, it should be more valid and reliable. Secondly, by looking at the union of multiple sources, one can cover larger parts of the genome. This is obvious for integrating results from multiple single gene or protein experiments, but also necessary for many of the results from genome-wide experiments since they are often confined to certain (although sizable) subsets of the genome. In this paper, we explore an example of such a data integration procedure. We focus on the prediction of membership in protein complexes for individual genes. For this, we recruit six different data sources that include expression profiles, interaction data, essentiality and localization information. Each of these data sources individually contains some weakly predictive information with respect to protein complexes, but we show how this prediction can be improved by combining all of them. Supplementary information is available at http:// bioinfo.mbb.yale.edu/integrate/interactions/.

摘要

功能基因组学的最终目标是确定生物体基因组中所有基因的功能。在过去几十年的研究中,已经积累并汇总了大量有关基因生物学作用的信息,这些信息既来自详细阐述单个基因和蛋白质作用的传统实验,也来自旨在在基因组规模上表征基因功能的更新实验策略。显然,只有通过整合来自这些不同实验的信息和数据源,才能实现功能基因组学的目标。因此,整合不同数据是生物信息学面临的一项重要挑战。整合不同数据源通常有助于揭示基因之间不明显的关系,但还有另外两个好处。首先,每当来自多个独立来源的信息一致时,它可能会更有效且可靠。其次,通过查看多个来源的并集,可以覆盖基因组的更大区域。这对于整合多个单基因或蛋白质实验的结果很明显,但对于许多全基因组实验的结果也是必要的,因为它们通常局限于基因组的某些(尽管规模较大)子集。在本文中,我们探讨了这样一个数据整合过程的示例。我们专注于预测单个基因在蛋白质复合物中的成员身份。为此,我们收集了六个不同的数据源,包括表达谱、相互作用数据、必需性和定位信息。这些数据源中的每一个单独对于蛋白质复合物都包含一些预测性较弱的信息,但我们展示了如何通过将所有这些信息结合起来改善这种预测。补充信息可在http://bioinfo.mbb.yale.edu/integrate/interactions/获取。

相似文献

1
Integration of genomic datasets to predict protein complexes in yeast.整合基因组数据集以预测酵母中的蛋白质复合物。
J Struct Funct Genomics. 2002;2(2):71-81. doi: 10.1023/a:1020495201615.
2
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.超越共表达关系:时移和反向基因表达谱的局部聚类可识别新的生物学相关相互作用。
J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219.
3
Genome-scale gene function prediction using multiple sources of high-throughput data in yeast Saccharomyces cerevisiae.利用多种高通量数据来源对酿酒酵母进行全基因组规模的基因功能预测。
OMICS. 2004 Winter;8(4):322-33. doi: 10.1089/omi.2004.8.322.
4
Combining multisource information through functional-annotation-based weighting: gene function prediction in yeast.通过基于功能注释的加权整合多源信息:酵母中的基因功能预测
IEEE Trans Biomed Eng. 2009 Feb;56(2):229-36. doi: 10.1109/TBME.2008.2005955. Epub 2008 Sep 30.
5
A weighted power framework for integrating multisource information: gene function prediction in yeast.加权幂框架用于整合多源信息:酵母基因功能预测。
IEEE Trans Biomed Eng. 2012 Apr;59(4):1162-8. doi: 10.1109/TBME.2012.2186689. Epub 2012 Feb 3.
6
Struct2net: integrating structure into protein-protein interaction prediction.Struct2net:将结构整合到蛋白质-蛋白质相互作用预测中。
Pac Symp Biocomput. 2006:403-14.
7
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.
8
[Decade of genomics--methods for genome investigation in yeast Saccharomyces cerevisiae].基因组学十年——酿酒酵母基因组研究方法
Postepy Biochem. 2006;52(4):435-47.
9
Predicting protein-protein interactions from protein sequences using meta predictor.利用元预测器从蛋白质序列预测蛋白质-蛋白质相互作用。
Amino Acids. 2010 Nov;39(5):1595-9. doi: 10.1007/s00726-010-0588-1. Epub 2010 Apr 13.
10
Relating whole-genome expression data with protein-protein interactions.将全基因组表达数据与蛋白质-蛋白质相互作用相关联。
Genome Res. 2002 Jan;12(1):37-46. doi: 10.1101/gr.205602.

引用本文的文献

1
Bayesian data integration and variable selection for pan-cancer survival prediction using protein expression data.基于蛋白质表达数据的泛癌生存预测的贝叶斯数据整合和变量选择。
Biometrics. 2020 Mar;76(1):316-325. doi: 10.1111/biom.13132. Epub 2019 Oct 3.
2
Cutaneous transcriptome analysis in NIH hairless mice.国立卫生研究院无毛小鼠的皮肤转录组分析
PLoS One. 2017 Aug 7;12(8):e0182463. doi: 10.1371/journal.pone.0182463. eCollection 2017.
3
Single-Cell Co-expression Analysis Reveals Distinct Functional Modules, Co-regulation Mechanisms and Clinical Outcomes.

本文引用的文献

1
Proteomics. Integrating interactomes.蛋白质组学。整合相互作用组。
Science. 2002 Jan 11;295(5553):284-7. doi: 10.1126/science.1068664.
2
Relating whole-genome expression data with protein-protein interactions.将全基因组表达数据与蛋白质-蛋白质相互作用相关联。
Genome Res. 2002 Jan;12(1):37-46. doi: 10.1101/gr.205602.
3
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.超越共表达关系:时移和反向基因表达谱的局部聚类可识别新的生物学相关相互作用。
单细胞共表达分析揭示了不同的功能模块、协同调控机制和临床结果。
PLoS Comput Biol. 2016 Apr 21;12(4):e1004892. doi: 10.1371/journal.pcbi.1004892. eCollection 2016 Apr.
4
Incorporating Ontology-Driven Similarity Knowledge into Functional Genomics: An Exploratory Study.将本体驱动的相似性知识融入功能基因组学:一项探索性研究。
BIBE 2004. 2004 May;2004:317-324. doi: 10.1109/BIBE.2004.1317360.
5
Rank-based genome-wide analysis reveals the association of ryanodine receptor-2 gene variants with childhood asthma among human populations.基于等级的全基因组分析揭示了兰尼碱受体 2 基因变异与人群中儿童哮喘的关联。
Hum Genomics. 2013 Jul 5;7(1):16. doi: 10.1186/1479-7364-7-16.
6
Data integration in genetics and genomics: methods and challenges.遗传学与基因组学中的数据整合:方法与挑战
Hum Genomics Proteomics. 2009 Jan 12;2009:869093. doi: 10.4061/2009/869093.
7
Spectral affinity in protein networks.蛋白质网络中的光谱亲和力。
BMC Syst Biol. 2009 Nov 29;3:112. doi: 10.1186/1752-0509-3-112.
8
Detection of mammalian virulence determinants in highly pathogenic avian influenza H5N1 viruses: multivariate analysis of published data.高致病性禽流感H5N1病毒中哺乳动物毒力决定因素的检测:已发表数据的多变量分析
J Virol. 2009 Oct;83(19):9901-10. doi: 10.1128/JVI.00608-09. Epub 2009 Jul 22.
9
Predicting and validating protein interactions using network structure.利用网络结构预测和验证蛋白质相互作用。
PLoS Comput Biol. 2008 Jul 25;4(7):e1000118. doi: 10.1371/journal.pcbi.1000118.
10
Mining and state-space modeling and verification of sub-networks from large-scale biomolecular networks.从大规模生物分子网络中挖掘子网并进行状态空间建模与验证。
BMC Bioinformatics. 2007 Aug 31;8:324. doi: 10.1186/1471-2105-8-324.
J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219.
4
Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae.酿酒酵母转录组与相互作用组图谱数据之间的相关性
Nat Genet. 2001 Dec;29(4):482-6. doi: 10.1038/ng776.
5
Interrelating different types of genomic data, from proteome to secretome: 'oming in on function.关联不同类型的基因组数据,从蛋白质组到分泌蛋白质组:深入探究功能。
Genome Res. 2001 Sep;11(9):1463-8. doi: 10.1101/gr.207401.
6
Global analysis of protein activities using proteome chips.使用蛋白质组芯片对蛋白质活性进行全局分析。
Science. 2001 Sep 14;293(5537):2101-5. doi: 10.1126/science.1062191. Epub 2001 Jul 26.
7
PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.部件列表:一个基于网络的系统,用于根据不同属性(包括全基因组表达和相互作用信息)对蛋白质折叠进行动态排名。
Nucleic Acids Res. 2001 Apr 15;29(8):1750-64. doi: 10.1093/nar/29.8.1750.
8
A comprehensive two-hybrid analysis to explore the yeast protein interactome.一项探索酵母蛋白质相互作用组的全面双杂交分析。
Proc Natl Acad Sci U S A. 2001 Apr 10;98(8):4569-74. doi: 10.1073/pnas.061034498. Epub 2001 Mar 13.
9
Regulatory element detection using correlation with expression.利用与表达的相关性进行调控元件检测。
Nat Genet. 2001 Feb;27(2):167-71. doi: 10.1038/84792.
10
General nonlinear framework for the analysis of gene interaction via multivariate expression arrays.通过多变量表达阵列分析基因相互作用的一般非线性框架。
J Biomed Opt. 2000 Oct;5(4):411-24. doi: 10.1117/1.1289142.