• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

训练集扩展:一种从有限且不均衡的可靠相互作用中改善生物网络重建的方法。

Training set expansion: an approach to improving the reconstruction of biological networks from limited and uneven reliable interactions.

作者信息

Yip Kevin Y, Gerstein Mark

机构信息

Department of Computer Science, Yale University, New Haven, CT 06511, USA.

出版信息

Bioinformatics. 2009 Jan 15;25(2):243-50. doi: 10.1093/bioinformatics/btn602. Epub 2008 Nov 17.

DOI:10.1093/bioinformatics/btn602
PMID:19015141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2639005/
Abstract

MOTIVATION

An important problem in systems biology is reconstructing complete networks of interactions between biological objects by extrapolating from a few known interactions as examples. While there are many computational techniques proposed for this network reconstruction task, their accuracy is consistently limited by the small number of high-confidence examples, and the uneven distribution of these examples across the potential interaction space, with some objects having many known interactions and others few.

RESULTS

To address this issue, we propose two computational methods based on the concept of training set expansion. They work particularly effectively in conjunction with kernel approaches, which are a popular class of approaches for fusing together many disparate types of features. Both our methods are based on semi-supervised learning and involve augmenting the limited number of gold-standard training instances with carefully chosen and highly confident auxiliary examples. The first method, prediction propagation, propagates highly confident predictions of one local model to another as the auxiliary examples, thus learning from information-rich regions of the training network to help predict the information-poor regions. The second method, kernel initialization, takes the most similar and most dissimilar objects of each object in a global kernel as the auxiliary examples. Using several sets of experimentally verified protein-protein interactions from yeast, we show that training set expansion gives a measurable performance gain over a number of representative, state-of-the-art network reconstruction methods, and it can correctly identify some interactions that are ranked low by other methods due to the lack of training examples of the involved proteins.

摘要

动机

系统生物学中的一个重要问题是通过从少数已知的相互作用示例进行推断,来重建生物对象之间完整的相互作用网络。虽然针对此网络重建任务提出了许多计算技术,但它们的准确性始终受到高可信度示例数量少以及这些示例在潜在相互作用空间中分布不均的限制,一些对象有许多已知相互作用,而另一些对象则很少。

结果

为了解决这个问题,我们提出了两种基于训练集扩展概念的计算方法。它们与核方法结合使用时特别有效,核方法是一类流行的方法,用于融合许多不同类型的特征。我们的两种方法都基于半监督学习,并且涉及用精心选择且高度可信的辅助示例来扩充有限数量的金标准训练实例。第一种方法是预测传播,将一个局部模型的高可信度预测作为辅助示例传播到另一个局部模型,从而从训练网络中信息丰富的区域学习,以帮助预测信息贫乏的区域。第二种方法是核初始化,将全局核中每个对象最相似和最不相似的对象作为辅助示例。使用来自酵母的几组经过实验验证的蛋白质 - 蛋白质相互作用,我们表明训练集扩展比许多具有代表性的、当前最先进的网络重建方法在性能上有可测量的提升,并且它可以正确识别一些由于所涉及蛋白质缺乏训练示例而在其他方法中排名较低的相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/46ce966b9574/btn602f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/65a68a77c37b/btn602f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/7609c876db16/btn602f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/e2bec076cc9d/btn602f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/46ce966b9574/btn602f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/65a68a77c37b/btn602f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/7609c876db16/btn602f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/e2bec076cc9d/btn602f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5053/2639005/46ce966b9574/btn602f4.jpg

相似文献

1
Training set expansion: an approach to improving the reconstruction of biological networks from limited and uneven reliable interactions.训练集扩展:一种从有限且不均衡的可靠相互作用中改善生物网络重建的方法。
Bioinformatics. 2009 Jan 15;25(2):243-50. doi: 10.1093/bioinformatics/btn602. Epub 2008 Nov 17.
2
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins.半监督多任务学习预测 HIV-1 与人类蛋白质相互作用。
Bioinformatics. 2010 Sep 15;26(18):i645-52. doi: 10.1093/bioinformatics/btq394.
3
A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network.一种通过组合功能基因网络的功能和拓扑性质来预测合成遗传相互作用的半监督学习方法。
BMC Bioinformatics. 2010 Jun 24;11:343. doi: 10.1186/1471-2105-11-343.
4
Kernel methods for predicting protein-protein interactions.用于预测蛋白质-蛋白质相互作用的核方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.
5
A transfer learning approach via procrustes analysis and mean shift for cancer drug sensitivity prediction.一种通过普罗克汝斯分析和均值漂移进行癌症药物敏感性预测的迁移学习方法。
J Bioinform Comput Biol. 2018 Jun;16(3):1840014. doi: 10.1142/S0219720018400140.
6
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
7
Reconstruction and validation of RefRec: a global model for the yeast molecular interaction network.重建和验证 RefRec:酵母分子相互作用网络的全球模型。
PLoS One. 2010 May 14;5(5):e10662. doi: 10.1371/journal.pone.0010662.
8
Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks.多任务学习用于同时重建人类和小鼠基因调控网络。
Sci Rep. 2020 Dec 18;10(1):22295. doi: 10.1038/s41598-020-78033-7.
9
SiPAN: simultaneous prediction and alignment of protein-protein interaction networks.SiPAN:蛋白质-蛋白质相互作用网络的同步预测与比对
Bioinformatics. 2015 Jul 15;31(14):2356-63. doi: 10.1093/bioinformatics/btv160. Epub 2015 Mar 18.
10
Adaptive diffusion kernel learning from biological networks for protein function prediction.基于生物网络的自适应扩散核学习用于蛋白质功能预测
BMC Bioinformatics. 2008 Mar 25;9:162. doi: 10.1186/1471-2105-9-162.

引用本文的文献

1
Automated Detection of Acute Myocardial Infarction Using Asynchronous Electrocardiogram Signals-Preview of Implementing Artificial Intelligence With Multichannel Electrocardiographs Obtained From Smartwatches: Retrospective Study.使用异步心电图信号自动检测急性心肌梗死——利用智能手表获取的多通道心电图实施人工智能的回顾性研究预览。
J Med Internet Res. 2021 Sep 10;23(9):e31129. doi: 10.2196/31129.
2
Rising Strengths Hong Kong SAR in Bioinformatics.香港特区生物信息学实力不断增强。
Interdiscip Sci. 2017 Jun;9(2):224-236. doi: 10.1007/s12539-016-0147-x. Epub 2016 Mar 9.
3
On protocols and measures for the validation of supervised methods for the inference of biological networks.

本文引用的文献

1
High-quality binary protein interaction map of the yeast interactome network.酵母相互作用组网络的高质量二元蛋白质相互作用图谱。
Science. 2008 Oct 3;322(5898):104-10. doi: 10.1126/science.1158684. Epub 2008 Aug 21.
2
Predicting co-complexed protein pairs from heterogeneous data.从异构数据中预测共复合蛋白质对。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000054. doi: 10.1371/journal.pcbi.1000054.
3
Where have all the interactions gone? Estimating the coverage of two-hybrid protein interaction maps.所有的相互作用都去哪儿了?估算双杂交蛋白质相互作用图谱的覆盖率。
关于生物网络推断监督方法验证的协议与措施
Front Genet. 2013 Dec 3;4:262. doi: 10.3389/fgene.2013.00262.
4
Network clustering: probing biological heterogeneity by sparse graphical models.网络聚类:稀疏图模型探测生物异质性。
Bioinformatics. 2011 Apr 1;27(7):994-1000. doi: 10.1093/bioinformatics/btr070. Epub 2011 Feb 10.
5
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins.半监督多任务学习预测 HIV-1 与人类蛋白质相互作用。
Bioinformatics. 2010 Sep 15;26(18):i645-52. doi: 10.1093/bioinformatics/btq394.
6
Triangle network motifs predict complexes by complementing high-error interactomes with structural information.三角形网络基序通过利用结构信息补充高误差相互作用组来预测复合物。
BMC Bioinformatics. 2009 Jun 27;10:196. doi: 10.1186/1471-2105-10-196.
PLoS Comput Biol. 2007 Nov;3(11):e214. doi: 10.1371/journal.pcbi.0030214. Epub 2007 Sep 21.
4
Supervised reconstruction of biological networks with local models.基于局部模型的生物网络监督重建
Bioinformatics. 2007 Jul 1;23(13):i57-65. doi: 10.1093/bioinformatics/btm204.
5
Context-sensitive data integration and prediction of biological networks.生物网络的上下文敏感数据整合与预测
Bioinformatics. 2007 Sep 1;23(17):2322-30. doi: 10.1093/bioinformatics/btm332. Epub 2007 Jun 28.
6
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.酿酒酵母中蛋白质复合物的全球格局。
Nature. 2006 Mar 30;440(7084):637-43. doi: 10.1038/nature04670. Epub 2006 Mar 22.
7
Proteome survey reveals modularity of the yeast cell machinery.蛋白质组研究揭示酵母细胞机制的模块化特性。
Nature. 2006 Mar 30;440(7084):631-6. doi: 10.1038/nature04532. Epub 2006 Jan 22.
8
BioGRID: a general repository for interaction datasets.生物通用互作数据集知识库(BioGRID):一个交互数据集的通用存储库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9. doi: 10.1093/nar/gkj109.
9
Supervised enzyme network inference from the integration of genomic data and chemical information.基于基因组数据与化学信息整合的监督式酶网络推断
Bioinformatics. 2005 Jun;21 Suppl 1:i468-77. doi: 10.1093/bioinformatics/bti1012.
10
Kernel methods for predicting protein-protein interactions.用于预测蛋白质-蛋白质相互作用的核方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.