• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iMPTCE-Hnetwork:一种基于异构网络的用于识别化学物质和酶代谢途径类型的多标签分类器。

iMPTCE-Hnetwork: A Multilabel Classifier for Identifying Metabolic Pathway Types of Chemicals and Enzymes with a Heterogeneous Network.

机构信息

College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.

State Key Laboratory of Livestock and Poultry Breeding, Guangdong Public Laboratory of Animal Breeding and Nutrition, Guangdong Provincial Key Laboratory of Animal Breeding and Nutrition, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China.

出版信息

Comput Math Methods Med. 2021 Jan 4;2021:6683051. doi: 10.1155/2021/6683051. eCollection 2021.

DOI:10.1155/2021/6683051
PMID:33488764
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7803417/
Abstract

Metabolic pathway is an important type of biological pathways. It produces essential molecules and energies to maintain the life of living organisms. Each metabolic pathway consists of a chain of chemical reactions, which always need enzymes to participate in. Thus, chemicals and enzymes are two major components for each metabolic pathway. Although several metabolic pathways have been uncovered, the metabolic pathway system is still far from complete. Some hidden chemicals or enzymes are not discovered in a certain metabolic pathway. Besides the traditional experiments to detect hidden chemicals or enzymes, an alternative pipeline is to design efficient computational methods. In this study, we proposed a powerful multilabel classifier, called iMPTCE-Hnetwork, to uniformly assign chemicals and enzymes to metabolic pathway types reported in KEGG. Such classifier adopted the embedding features derived from a heterogeneous network, which defined chemicals and enzymes as nodes and the interactions between chemicals and enzymes as edges, through a powerful network embedding algorithm, Mashup. The popular RAndom k-labELsets (RAKEL) algorithm was employed to construct the classifier, which incorporated the support vector machine (polynomial kernel) as the basic classifier. The ten-fold cross-validation results indicated that such a classifier had good performance with accuracy higher than 0.800 and exact match higher than 0.750. Several comparisons were done to indicate the superiority of the iMPTCE-Hnetwork.

摘要

代谢途径是一种重要的生物途径类型。它产生维持生物生命所需的基本分子和能量。每条代谢途径由一系列化学反应组成,这些反应通常需要酶的参与。因此,化学物质和酶是每条代谢途径的两个主要组成部分。尽管已经发现了几种代谢途径,但代谢途径系统还远未完善。某些代谢途径中仍有一些隐藏的化学物质或酶未被发现。除了传统的检测隐藏化学物质或酶的实验方法外,还可以设计高效的计算方法。在这项研究中,我们提出了一种强大的多标签分类器,称为 iMPTCE-Hnetwork,用于将 KEGG 中报道的代谢途径类型统一分配给化学物质和酶。该分类器采用了来自异构网络的嵌入特征,通过强大的网络嵌入算法 Mashup 将化学物质和酶定义为节点,将化学物质和酶之间的相互作用定义为边。流行的 RAndom k-labELsets (RAKEL) 算法被用于构建分类器,该算法将支持向量机(多项式核)作为基本分类器。十折交叉验证结果表明,该分类器具有良好的性能,准确率高于 0.800,精确匹配率高于 0.750。进行了几次比较以表明 iMPTCE-Hnetwork 的优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/0f00a5812e92/CMMM2021-6683051.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/709b55688b1f/CMMM2021-6683051.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/e32d5f28014a/CMMM2021-6683051.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/60c0b822a6b8/CMMM2021-6683051.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/1d5734dcad85/CMMM2021-6683051.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/0f00a5812e92/CMMM2021-6683051.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/709b55688b1f/CMMM2021-6683051.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/e32d5f28014a/CMMM2021-6683051.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/60c0b822a6b8/CMMM2021-6683051.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/1d5734dcad85/CMMM2021-6683051.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27bd/7803417/0f00a5812e92/CMMM2021-6683051.005.jpg

相似文献

1
iMPTCE-Hnetwork: A Multilabel Classifier for Identifying Metabolic Pathway Types of Chemicals and Enzymes with a Heterogeneous Network.iMPTCE-Hnetwork:一种基于异构网络的用于识别化学物质和酶代谢途径类型的多标签分类器。
Comput Math Methods Med. 2021 Jan 4;2021:6683051. doi: 10.1155/2021/6683051. eCollection 2021.
2
A Binary Classifier for Prediction of the Types of Metabolic Pathway of Chemicals.用于预测化学物质代谢途径类型的二元分类器。
Comb Chem High Throughput Screen. 2017;20(2):140-146. doi: 10.2174/1386207319666161215142130.
3
A Network Integration Method for Deciphering the Types of Metabolic Pathway of Chemicals with Heterogeneous Information.一种用于利用异构信息解读化学物质代谢途径类型的网络整合方法。
Comb Chem High Throughput Screen. 2018;21(9):670-680. doi: 10.2174/1386207322666181206112641.
4
iATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs.iATC-NRAKEL:一种用于识别药物解剖治疗化学类别的高效多标签分类器。
Bioinformatics. 2020 Mar 1;36(5):1391-1396. doi: 10.1093/bioinformatics/btz757.
5
Drug Target Group Prediction with Multiple Drug Networks.基于多个药物网络的药物靶标群组预测。
Comb Chem High Throughput Screen. 2020;23(4):274-284. doi: 10.2174/1386207322666190702103927.
6
PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes.PredictEFC:一种用于预测酶家族类别的快速高效的多标签分类器。
BMC Bioinformatics. 2024 Jan 30;25(1):50. doi: 10.1186/s12859-024-05665-1.
7
iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.iMPT-FDNPL:基于功能域和自然语言处理方法识别膜蛋白类型。
Comput Math Methods Med. 2021 Oct 11;2021:7681497. doi: 10.1155/2021/7681497. eCollection 2021.
8
Predicting the types of metabolic pathway of compounds using molecular fragments and sequential minimal optimization.利用分子片段和序列最小优化预测化合物的代谢途径类型
Comb Chem High Throughput Screen. 2016;19(2):136-43. doi: 10.2174/1386207319666151110122453.
9
An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds.一种用于化合物间生化途径从头预测的高效算法。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S8. doi: 10.1186/1471-2105-13-S17-S8. Epub 2012 Dec 13.
10
Efficient searching and annotation of metabolic networks using chemical similarity.利用化学相似性对代谢网络进行高效搜索和注释。
Bioinformatics. 2015 Apr 1;31(7):1016-24. doi: 10.1093/bioinformatics/btu760. Epub 2014 Nov 21.

引用本文的文献

1
Prediction of plant secondary metabolic pathways using deep transfer learning.基于深度迁移学习的植物次生代谢途径预测。
BMC Bioinformatics. 2023 Sep 19;24(1):348. doi: 10.1186/s12859-023-05485-9.
2
Identification of cortical interneuron cell markers in mouse embryos based on machine learning analysis of single-cell transcriptomics.基于单细胞转录组学的机器学习分析鉴定小鼠胚胎中的皮质中间神经元细胞标记物。
Front Neurosci. 2022 Jul 15;16:841145. doi: 10.3389/fnins.2022.841145. eCollection 2022.
3
MLGL-MP: a Multi-Label Graph Learning framework enhanced by pathway interdependence for Metabolic Pathway prediction.

本文引用的文献

1
Discriminating Origin Tissues of Tumor Cell Lines by Methylation Signatures and Dys-Methylated Rules.通过甲基化特征和异常甲基化规则鉴别肿瘤细胞系的起源组织
Front Bioeng Biotechnol. 2020 May 26;8:507. doi: 10.3389/fbioe.2020.00507. eCollection 2020.
2
Prediction of Drug Side Effects with a Refined Negative Sample Selection Strategy.采用改进的负样本选择策略预测药物副作用。
Comput Math Methods Med. 2020 May 9;2020:1573543. doi: 10.1155/2020/1573543. eCollection 2020.
3
iATC-FRAKEL: a simple multi-label web server for recognizing anatomical therapeutic chemical classes of drugs with their fingerprints only.
MLGL-MP:一种通过途径相互依赖性增强的多标签图学习框架,用于代谢途径预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i325-i332. doi: 10.1093/bioinformatics/btac222.
4
Identification of Type 2 Diabetes Biomarkers From Mixed Single-Cell Sequencing Data With Feature Selection Methods.利用特征选择方法从混合单细胞测序数据中鉴定2型糖尿病生物标志物
Front Bioeng Biotechnol. 2022 Jun 2;10:890901. doi: 10.3389/fbioe.2022.890901. eCollection 2022.
5
Identifying Functions of Proteins in Mice With Functional Embedding Features.利用功能嵌入特征鉴定小鼠体内蛋白质的功能
Front Genet. 2022 May 16;13:909040. doi: 10.3389/fgene.2022.909040. eCollection 2022.
6
Detecting Brain Structure-Specific Methylation Signatures and Rules for Alzheimer's Disease.检测阿尔茨海默病的脑结构特异性甲基化特征及规律
Front Neurosci. 2022 May 2;16:895181. doi: 10.3389/fnins.2022.895181. eCollection 2022.
7
Identifying luminal and basal mammary cell specific genes and their expression patterns during pregnancy.鉴定妊娠期间腔上皮和基底乳腺细胞特异基因及其表达模式。
PLoS One. 2022 Apr 29;17(4):e0267211. doi: 10.1371/journal.pone.0267211. eCollection 2022.
8
Similarity-Based Method with Multiple-Feature Sampling for Predicting Drug Side Effects.基于相似性的多特征采样方法预测药物副作用。
Comput Math Methods Med. 2022 Apr 1;2022:9547317. doi: 10.1155/2022/9547317. eCollection 2022.
9
Identification of Novel Lung Cancer Driver Genes Connecting Different Omics Levels With a Heat Diffusion Algorithm.利用热扩散算法鉴定连接不同组学水平的新型肺癌驱动基因
Front Cell Dev Biol. 2022 Jan 26;10:825272. doi: 10.3389/fcell.2022.825272. eCollection 2022.
10
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.基于关键序列特征和分布预测 RNA 5-甲基胞嘧啶位点
Biomed Res Int. 2022 Jan 13;2022:4035462. doi: 10.1155/2022/4035462. eCollection 2022.
iATC-FRAKEL:一个简单的多标签网络服务器,仅使用药物的指纹识别其解剖治疗化学类别。
Bioinformatics. 2020 Jun 1;36(11):3568-3569. doi: 10.1093/bioinformatics/btaa166.
4
A deep learning architecture for metabolic pathway prediction.一种用于代谢途径预测的深度学习架构。
Bioinformatics. 2020 Apr 15;36(8):2547-2553. doi: 10.1093/bioinformatics/btz954.
5
iATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs.iATC-NRAKEL:一种用于识别药物解剖治疗化学类别的高效多标签分类器。
Bioinformatics. 2020 Mar 1;36(5):1391-1396. doi: 10.1093/bioinformatics/btz757.
6
Drug Target Group Prediction with Multiple Drug Networks.基于多个药物网络的药物靶标群组预测。
Comb Chem High Throughput Screen. 2020;23(4):274-284. doi: 10.2174/1386207322666190702103927.
7
A Network Integration Method for Deciphering the Types of Metabolic Pathway of Chemicals with Heterogeneous Information.一种用于利用异构信息解读化学物质代谢途径类型的网络整合方法。
Comb Chem High Throughput Screen. 2018;21(9):670-680. doi: 10.2174/1386207322666181206112641.
8
A similarity-based method for prediction of drug side effects with heterogeneous information.基于相似性的方法预测具有异构信息的药物副作用。
Math Biosci. 2018 Dec;306:136-144. doi: 10.1016/j.mbs.2018.09.010. Epub 2018 Oct 5.
9
Predicting overlapping protein complexes based on core-attachment and a local modularity structure.基于核心附着和局部模块结构预测重叠蛋白质复合物。
BMC Bioinformatics. 2018 Aug 22;19(1):305. doi: 10.1186/s12859-018-2309-9.
10
Predicting membrane protein types by incorporating a novel feature set into Chou's general PseAAC.通过将一种新的特征集纳入到周的通用 PseAAC 中,来预测膜蛋白类型。
J Theor Biol. 2018 Oct 14;455:319-328. doi: 10.1016/j.jtbi.2018.07.032. Epub 2018 Jul 27.