Suppr超能文献

拟南芥中潜在分子相互作用的计算识别

Computational identification of potential molecular interactions in Arabidopsis.

作者信息

Lin Mingzhi, Hu Bin, Chen Lijuan, Sun Peng, Fan Yi, Wu Ping, Chen Xin

机构信息

Department of Bioinformatics, Zhejiang University, Hangzhou, People's Republic of China, 310058.

出版信息

Plant Physiol. 2009 Sep;151(1):34-46. doi: 10.1104/pp.109.141317. Epub 2009 Jul 10.

Abstract

Knowledge of the protein interaction network is useful to assist molecular mechanism studies. Several major repositories have been established to collect and organize reported protein interactions. Many interactions have been reported in several model organisms, yet a very limited number of plant interactions can thus far be found in these major databases. Computational identification of potential plant interactions, therefore, is desired to facilitate relevant research. In this work, we constructed a support vector machine model to predict potential Arabidopsis (Arabidopsis thaliana) protein interactions based on a variety of indirect evidence. In a 100-iteration bootstrap evaluation, the confidence of our predicted interactions was estimated to be 48.67%, and these interactions were expected to cover 29.02% of the entire interactome. The sensitivity of our model was validated with an independent evaluation data set consisting of newly reported interactions that did not overlap with the examples used in model training and testing. Results showed that our model successfully recognized 28.91% of the new interactions, similar to its expected sensitivity (29.02%). Applying this model to all possible Arabidopsis protein pairs resulted in 224,206 potential interactions, which is the largest and most accurate set of predicted Arabidopsis interactions at present. In order to facilitate the use of our results, we present the Predicted Arabidopsis Interactome Resource, with detailed annotations and more specific per interaction confidence measurements. This database and related documents are freely accessible at http://www.cls.zju.edu.cn/pair/.

摘要

蛋白质相互作用网络的知识有助于辅助分子机制研究。已经建立了几个主要的数据库来收集和整理已报道的蛋白质相互作用。在几种模式生物中已经报道了许多相互作用,但目前在这些主要数据库中发现的植物相互作用数量非常有限。因此,需要通过计算来识别潜在的植物相互作用,以促进相关研究。在这项工作中,我们基于各种间接证据构建了一个支持向量机模型,用于预测拟南芥(Arabidopsis thaliana)潜在的蛋白质相互作用。在100次迭代的自助评估中,我们预测的相互作用的置信度估计为48.67%,并且这些相互作用预计覆盖整个相互作用组的29.02%。我们的模型的敏感性通过一个独立的评估数据集进行了验证,该数据集由新报道的相互作用组成,这些相互作用与模型训练和测试中使用的示例不重叠。结果表明,我们的模型成功识别了28.91%的新相互作用,与其预期的敏感性(29.02%)相似。将该模型应用于所有可能的拟南芥蛋白质对,得到了224,206个潜在的相互作用,这是目前最大且最准确的一组预测的拟南芥相互作用。为了便于使用我们的结果,我们提供了预测的拟南芥相互作用组资源,带有详细的注释和更具体的每个相互作用的置信度测量。该数据库和相关文档可在http://www.cls.zju.edu.cn/pair/免费获取。

相似文献

2
PAIR: the predicted Arabidopsis interactome resource.PAIR:预测的拟南芥相互作用组资源。
Nucleic Acids Res. 2011 Jan;39(Database issue):D1134-40. doi: 10.1093/nar/gkq938. Epub 2010 Oct 15.
4
A predicted interactome for Arabidopsis.拟南芥的预测相互作用组。
Plant Physiol. 2007 Oct;145(2):317-29. doi: 10.1104/pp.107.103465. Epub 2007 Aug 3.
7
The TAIR database.TAIR数据库。
Methods Mol Biol. 2007;406:179-212. doi: 10.1007/978-1-59745-535-0_8.

引用本文的文献

5
FunCoup 3.0: database of genome-wide functional coupling networks.FunCoup 3.0:全基因组功能耦合网络数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D380-8. doi: 10.1093/nar/gkt984. Epub 2013 Oct 31.
9
Os11Gsk gene from a wild rice, Oryza rufipogon improves yield in rice.野生稻 Os11Gsk 基因可提高水稻产量。
Funct Integr Genomics. 2012 Jun;12(2):277-89. doi: 10.1007/s10142-012-0265-4. Epub 2012 Feb 25.

本文引用的文献

4
ATTED-II provides coexpressed gene networks for Arabidopsis.ATTED-II为拟南芥提供共表达基因网络。
Nucleic Acids Res. 2009 Jan;37(Database issue):D987-91. doi: 10.1093/nar/gkn807. Epub 2008 Oct 25.
5
Identifying protein domains with the Pfam database.使用Pfam数据库鉴定蛋白质结构域。
Curr Protoc Bioinformatics. 2008 Sep;Chapter 2:2.5.1-2.5.17. doi: 10.1002/0471250953.bi0205s23.
8
Predicting co-complexed protein pairs from heterogeneous data.从异构数据中预测共复合蛋白质对。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000054. doi: 10.1371/journal.pcbi.1000054.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验