• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过成对有理核进行代谢网络预测。

Metabolic network prediction through pairwise rational kernels.

作者信息

Roche-Lima Abiel, Domaratzki Michael, Fristensky Brian

机构信息

Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada.

出版信息

BMC Bioinformatics. 2014 Sep 26;15(1):318. doi: 10.1186/1471-2105-15-318.

DOI:10.1186/1471-2105-15-318
PMID:25260372
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4261252/
Abstract

BACKGROUND

Metabolic networks are represented by the set of metabolic pathways. Metabolic pathways are a series of biochemical reactions, in which the product (output) from one reaction serves as the substrate (input) to another reaction. Many pathways remain incompletely characterized. One of the major challenges of computational biology is to obtain better models of metabolic pathways. Existing models are dependent on the annotation of the genes. This propagates error accumulation when the pathways are predicted by incorrectly annotated genes. Pairwise classification methods are supervised learning methods used to classify new pair of entities. Some of these classification methods, e.g., Pairwise Support Vector Machines (SVMs), use pairwise kernels. Pairwise kernels describe similarity measures between two pairs of entities. Using pairwise kernels to handle sequence data requires long processing times and large storage. Rational kernels are kernels based on weighted finite-state transducers that represent similarity measures between sequences or automata. They have been effectively used in problems that handle large amount of sequence information such as protein essentiality, natural language processing and machine translations.

RESULTS

We create a new family of pairwise kernels using weighted finite-state transducers (called Pairwise Rational Kernel (PRK)) to predict metabolic pathways from a variety of biological data. PRKs take advantage of the simpler representations and faster algorithms of transducers. Because raw sequence data can be used, the predictor model avoids the errors introduced by incorrect gene annotations. We then developed several experiments with PRKs and Pairwise SVM to validate our methods using the metabolic network of Saccharomyces cerevisiae. As a result, when PRKs are used, our method executes faster in comparison with other pairwise kernels. Also, when we use PRKs combined with other simple kernels that include evolutionary information, the accuracy values have been improved, while maintaining lower construction and execution times.

CONCLUSIONS

The power of using kernels is that almost any sort of data can be represented using kernels. Therefore, completely disparate types of data can be combined to add power to kernel-based machine learning methods. When we compared our proposal using PRKs with other similar kernel, the execution times were decreased, with no compromise of accuracy. We also proved that by combining PRKs with other kernels that include evolutionary information, the accuracy can also also be improved. As our proposal can use any type of sequence data, genes do not need to be properly annotated, avoiding accumulation errors because of incorrect previous annotations.

摘要

背景

代谢网络由一系列代谢途径表示。代谢途径是一系列生化反应,其中一个反应的产物(输出)作为另一个反应的底物(输入)。许多途径的特征仍不完全清楚。计算生物学的主要挑战之一是获得更好的代谢途径模型。现有模型依赖于基因注释。当通过错误注释的基因预测途径时,这会导致错误积累。成对分类方法是用于对新的实体对进行分类的监督学习方法。其中一些分类方法,例如成对支持向量机(SVM),使用成对核。成对核描述了两对实体之间的相似性度量。使用成对核处理序列数据需要较长的处理时间和大量存储。有理核是基于加权有限状态变换器的核,用于表示序列或自动机之间的相似性度量。它们已有效地应用于处理大量序列信息的问题,如蛋白质必需性、自然语言处理和机器翻译。

结果

我们使用加权有限状态变换器创建了一个新的成对核家族(称为成对有理核(PRK)),以从各种生物数据预测代谢途径。PRK利用了变换器更简单的表示和更快的算法。由于可以使用原始序列数据,预测模型避免了因基因注释错误而引入的误差。然后,我们使用PRK和成对SVM进行了几个实验,以使用酿酒酵母的代谢网络验证我们的方法。结果,当使用PRK时,我们的方法与其他成对核相比执行速度更快。此外,当我们将PRK与其他包含进化信息的简单核结合使用时,准确率有所提高,同时保持较低的构建和执行时间。

结论

使用核的强大之处在于几乎任何类型的数据都可以用核来表示。因此,完全不同类型的数据可以组合起来,为基于核的机器学习方法增添力量。当我们将使用PRK的提议与其他类似核进行比较时,执行时间减少,而不影响准确性。我们还证明,通过将PRK与其他包含进化信息的核结合,可以提高准确率。由于我们的提议可以使用任何类型的序列数据,因此不需要对基因进行正确注释,避免了由于先前错误注释而导致的误差积累。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/4b886b342cf0/12859_2014_6638_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/6ec440fbda27/12859_2014_6638_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/5adb22eb40c1/12859_2014_6638_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/5bfcc3db5f1c/12859_2014_6638_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/4b886b342cf0/12859_2014_6638_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/6ec440fbda27/12859_2014_6638_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/5adb22eb40c1/12859_2014_6638_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/5bfcc3db5f1c/12859_2014_6638_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a8d/4261252/4b886b342cf0/12859_2014_6638_Fig4_HTML.jpg

相似文献

1
Metabolic network prediction through pairwise rational kernels.通过成对有理核进行代谢网络预测。
BMC Bioinformatics. 2014 Sep 26;15(1):318. doi: 10.1186/1471-2105-15-318.
2
A new pairwise kernel for biological network inference with support vector machines.一种用于支持向量机生物网络推理的新型成对核。
BMC Bioinformatics. 2007;8 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-8-S10-S8.
3
CompareSVM: supervised, Support Vector Machine (SVM) inference of gene regularity networks.CompareSVM:监督式支持向量机(SVM)对基因调控网络的推断
BMC Bioinformatics. 2014 Nov 30;15(1):395. doi: 10.1186/s12859-014-0395-x.
4
Learning with multiple pairwise kernels for drug bioactivity prediction.使用多种成对核函数进行药物生物活性预测。
Bioinformatics. 2018 Jul 1;34(13):i509-i518. doi: 10.1093/bioinformatics/bty277.
5
LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.LZW-Kernel:快速内核,利用 LZW 压缩器中的变长码块对蛋白质序列进行分类。
Bioinformatics. 2018 Oct 1;34(19):3281-3288. doi: 10.1093/bioinformatics/bty349.
6
Implementation and comparison of kernel-based learning methods to predict metabolic networks.用于预测代谢网络的基于核的学习方法的实现与比较
Netw Model Anal Health Inform Bioinform. 2016;5(1):26. doi: 10.1007/s13721-016-0134-5. Epub 2016 Jul 15.
7
Improving prediction of heterodimeric protein complexes using combination with pairwise kernel.利用组合对预测异源二聚体蛋白复合物进行改进,使用成对核函数。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):39. doi: 10.1186/s12859-018-2017-5.
8
Kernel methods for predicting protein-protein interactions.用于预测蛋白质-蛋白质相互作用的核方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.
9
Learned random-walk kernels and empirical-map kernels for protein sequence classification.用于蛋白质序列分类的学习型随机游走核和经验映射核。
J Comput Biol. 2009 Mar;16(3):457-74. doi: 10.1089/cmb.2008.0031.
10
Profile-based direct kernels for remote homology detection and fold recognition.用于远程同源性检测和折叠识别的基于轮廓的直接内核。
Bioinformatics. 2005 Dec 1;21(23):4239-47. doi: 10.1093/bioinformatics/bti687. Epub 2005 Sep 27.

引用本文的文献

1
Machine Learning Study of Metabolic Networks ChEMBL Data of Antibacterial Compounds.基于抗菌化合物 ChEMBL 数据的代谢网络机器学习研究
Mol Pharm. 2022 Jul 4;19(7):2151-2163. doi: 10.1021/acs.molpharmaceut.2c00029. Epub 2022 Jun 7.
2
Implementation and comparison of kernel-based learning methods to predict metabolic networks.用于预测代谢网络的基于核的学习方法的实现与比较
Netw Model Anal Health Inform Bioinform. 2016;5(1):26. doi: 10.1007/s13721-016-0134-5. Epub 2016 Jul 15.

本文引用的文献

1
Metabolic network reconstruction and their topological analysis.代谢网络重建及其拓扑分析。
Methods Mol Biol. 2014;1090:19-38. doi: 10.1007/978-1-62703-688-7_2.
2
Stoichiometric identification with maximum likelihood principal component analysis.基于最大似然主成分分析的化学计量学识别
J Math Biol. 2013 Oct;67(4):739-65. doi: 10.1007/s00285-012-0559-0. Epub 2012 Jul 21.
3
The pathway tools pathway prediction algorithm.通路工具通路预测算法。
Stand Genomic Sci. 2011 Dec 31;5(3):424-9. doi: 10.4056/sigs.1794338. Epub 2011 Dec 23.
4
GENIES: gene network inference engine based on supervised analysis.GENIES:基于监督分析的基因网络推理引擎。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W162-7. doi: 10.1093/nar/gks459. Epub 2012 May 18.
5
Browsing metabolic and regulatory networks with BioCyc.使用BioCyc浏览代谢和调控网络。
Methods Mol Biol. 2012;804:197-216. doi: 10.1007/978-1-61779-361-5_11.
6
Predicting metabolic pathways by sub-network extraction.通过子网络提取预测代谢途径。
Methods Mol Biol. 2012;804:107-30. doi: 10.1007/978-1-61779-361-5_7.
7
Simple sequence-based kernels do not predict protein-protein interactions.基于简单序列的核函数不能预测蛋白质-蛋白质相互作用。
Bioinformatics. 2010 Oct 15;26(20):2610-4. doi: 10.1093/bioinformatics/btq483. Epub 2010 Aug 27.
8
Note on the sampling error of the difference between correlated proportions or percentages.关于相关比例或百分比差异的抽样误差说明。
Psychometrika. 1947 Jun;12(2):153-7. doi: 10.1007/BF02295996.
9
Support vector machines and kernels for computational biology.用于计算生物学的支持向量机和核函数。
PLoS Comput Biol. 2008 Oct;4(10):e1000173. doi: 10.1371/journal.pcbi.1000173. Epub 2008 Oct 31.
10
A pseudo-stoichiometric dynamic model of anaerobic hydrogen production from molasses.基于糖蜜的厌氧产氢拟化学计量动力学模型。
Water Res. 2008 May;42(10-11):2539-50. doi: 10.1016/j.watres.2008.02.018. Epub 2008 Feb 29.