iTAGPred：一种用于识别血管生成和肿瘤血管生成生物标志物的两级预测模型。

iTAGPred: A Two-Level Prediction Model for Identification of Angiogenesis and Tumor Angiogenesis Biomarkers.

作者信息

Allehaibi Khalid, Daanial Khan Yaser, Khan Sher Afzal

机构信息

Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.

Department of Computer Science, University of Management and Technology, Lahore, Pakistan.

出版信息

Appl Bionics Biomech. 2021 Sep 27;2021:2803147. doi: 10.1155/2021/2803147. eCollection 2021.

DOI:10.1155/2021/2803147

PMID:34616486

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8490072/

Abstract

A crucial biological process called angiogenesis plays a vital role in migration, growth, and wound healing of endothelial cells and other processes that are controlled by chemical signals. Angiogenesis is the process that controls the growth of blood vessels within tissues while angiogenesis proteins play a significant role in the proper working of this process. The balancing of these signals is necessary for the proper working of angiogenesis. Unbalancing of these signals increases blood vessel formation, which causes abnormal growth or several diseases including cancer. The proposed work focuses on developing a two-layered prediction model using different classifiers like random forest (RF), neural network, and support vector machine. The first level performs in silico identification of angiogenesis proteins based on the primary structure. In the case the protein is an angiogenesis protein, then the second level predicts whether the protein is linked with tumor angiogenesis or not. The performance of the model is evaluated through various validation techniques. The model was evaluated using -fold cross-validation, independent, self-consistency, and jackknife testing. The overall accuracy using an RF classifier for angiogenesis at the first level was 97.8% and for tumor angiogenesis at the second level was 99.5%, ANN showed 94.1% accuracy for angiogenesis and 79.9% for tumor angiogenesis, and the accuracy of SVM for angiogenesis was 78.8% and for tumor angiogenesis was 65.19%.

摘要

一种名为血管生成的关键生物学过程在内皮细胞的迁移、生长和伤口愈合以及其他由化学信号控制的过程中起着至关重要的作用。血管生成是控制组织内血管生长的过程，而血管生成蛋白在这一过程的正常运作中发挥着重要作用。这些信号的平衡对于血管生成的正常运作是必要的。这些信号的失衡会增加血管形成，从而导致异常生长或包括癌症在内的多种疾病。所提出的工作重点是使用随机森林（RF）、神经网络和支持向量机等不同分类器开发一个两层预测模型。第一级基于一级结构对血管生成蛋白进行计算机识别。如果该蛋白质是血管生成蛋白，那么第二级预测该蛋白质是否与肿瘤血管生成有关。通过各种验证技术对模型的性能进行评估。该模型使用 - 折交叉验证、独立、自一致性和留一法测试进行评估。在第一级使用RF分类器对血管生成的总体准确率为97.8%，对肿瘤血管生成在第二级的准确率为99.5%，人工神经网络对血管生成的准确率为94.1%，对肿瘤血管生成的准确率为79.9%，支持向量机对血管生成的准确率为78.8%，对肿瘤血管生成的准确率为65.19%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eba8/8490072/78f6b5b6f3c8/ABB2021-2803147.001.jpg

相似文献

iTAGPred: A Two-Level Prediction Model for Identification of Angiogenesis and Tumor Angiogenesis Biomarkers.iTAGPred：一种用于识别血管生成和肿瘤血管生成生物标志物的两级预测模型。

Appl Bionics Biomech. 2021 Sep 27;2021:2803147. doi: 10.1155/2021/2803147. eCollection 2021.

DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.DP-BINDER：一种通过融合进化和物理化学信息来预测 DNA 结合蛋白的机器学习模型。

J Comput Aided Mol Des. 2019 Jul;33(7):645-658. doi: 10.1007/s10822-019-00207-x. Epub 2019 May 23.

Predictive Modeling for Frailty Conditions in Elderly People: Machine Learning Approaches.老年人衰弱状况的预测建模：机器学习方法

JMIR Med Inform. 2020 Jun 4;8(6):e16678. doi: 10.2196/16678.

iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique.iHyd-LysSite（EPSV）：通过提取增强位置和序列变异特征技术，使用统计公式识别蛋白质中的羟赖氨酸位点。

Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.

A comparative study of support vector machine, artificial neural network and bayesian classifier for mutagenicity prediction.支持向量机、人工神经网络和贝叶斯分类器在致突变性预测中的比较研究。

Interdiscip Sci. 2011 Sep;3(3):232-9. doi: 10.1007/s12539-011-0102-9. Epub 2011 Jun 14.

Research on an Identification Method for Gas Disaster Risk Based on the Selective Ensemble Classification Model.基于选择性集成分类模型的瓦斯灾害风险识别方法研究

ACS Omega. 2021 May 25;6(22):14059-14067. doi: 10.1021/acsomega.1c00426. eCollection 2021 Jun 8.

DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.DNAPred_Prot：利用基于组成和位置的特征识别DNA结合蛋白。

Appl Bionics Biomech. 2022 Apr 13;2022:5483115. doi: 10.1155/2022/5483115. eCollection 2022.

AOPs-SVM: A Sequence-Based Classifier of Antioxidant Proteins Using a Support Vector Machine.AOPs-SVM：一种基于序列的使用支持向量机的抗氧化蛋白分类器。

Front Bioeng Biotechnol. 2019 Sep 18;7:224. doi: 10.3389/fbioe.2019.00224. eCollection 2019.

Prediction of students' awareness level towards ICT and mobile technology in Indian and Hungarian University for the real-time: preliminary results.印度和匈牙利大学学生对信息通信技术和移动技术实时认知水平的预测：初步结果。

Heliyon. 2019 Jun 18;5(6):e01806. doi: 10.1016/j.heliyon.2019.e01806. eCollection 2019 Jun.

MACHINE LEARNING ALGORITHMS FOR IDENTIFICATION OF ABNORMAL GLOW CURVES AND ASSOCIATED ABNORMALITY IN CaSO4:DY-BASED PERSONNEL MONITORING DOSIMETERS.机器算法识别基于 CaSO4:Dy 的个人剂量计异常发光曲线及相关异常。

Radiat Prot Dosimetry. 2020 Sep 16;190(3):342-351. doi: 10.1093/rpd/ncaa108.

引用本文的文献

PADG-Pred: Exploring Ensemble Approaches for Identifying Parkinson's Disease Associated Biomarkers Using Genomic Sequences Analysis.PADG-Pred：利用基因组序列分析探索用于识别帕金森病相关生物标志物的集成方法。

IET Syst Biol. 2025 Jan-Dec;19(1):e70006. doi: 10.1049/syb2.70006.

eNSMBL-PASD: Spearheading early autism spectrum disorder detection through advanced genomic computational frameworks utilizing ensemble learning models.欧洲生物信息学研究所自闭症谱系障碍预测分析系统（eNSMBL-PASD）：通过利用集成学习模型的先进基因组计算框架引领早期自闭症谱系障碍检测。

Digit Health. 2025 Jan 27;11:20552076241313407. doi: 10.1177/20552076241313407. eCollection 2025 Jan-Dec.

RCCC_Pred: A Novel Method for Sequence-Based Identification of Renal Clear Cell Carcinoma Genes through DNA Mutations and a Blend of Features.RCCC_Pred：一种通过DNA突变和特征融合基于序列鉴定肾透明细胞癌基因的新方法。

Diagnostics (Basel). 2022 Dec 3;12(12):3036. doi: 10.3390/diagnostics12123036.

Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations.用于识别肉瘤致癌突变的深度学习技术评估

Digit Health. 2022 Oct 22;8:20552076221133703. doi: 10.1177/20552076221133703. eCollection 2022 Jan-Dec.

本文引用的文献

iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.iSUMOK-PseAAC：利用统计矩和周氏伪氨基酸组成预测赖氨酸的类泛素化位点

PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.

iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions.iGluK-Deep：利用具有通用伪氨基酸组成的深度神经网络对赖氨酸戊二酰化位点进行计算识别。

J Biomol Struct Dyn. 2022;40(22):11691-11704. doi: 10.1080/07391102.2021.1962738. Epub 2021 Aug 16.

Evaluating machine learning methodologies for identification of cancer driver genes.评估用于识别癌症驱动基因的机器学习方法。

Sci Rep. 2021 Jun 10;11(1):12281. doi: 10.1038/s41598-021-91656-8.

Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.通过比较人类工程特征和深度表示来优化蛋白质丝氨酸磷酸化预测。

Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.

Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.

Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点

Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.

A Sequence-Based Predictor of Zika Virus Proteins Developed by Integration of PseAAC and Statistical Moments.基于序列的 Zika 病毒蛋白预测器的开发，通过 PseAAC 与统计矩的整合。

Comb Chem High Throughput Screen. 2020;23(8):797-804. doi: 10.2174/1386207323666200428115449.

Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment.使用 CHOU'S 5 步规则，通过混合位置相对特征和统计矩来预测 O-链接丝氨酸糖基化位点。

IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):2045-2056. doi: 10.1109/TCBB.2020.2968441. Epub 2021 Oct 11.

TargetAntiAngio: A Sequence-Based Tool for the Prediction and Analysis of Anti-Angiogenic Peptides.TargetAntiAngio：一种基于序列的抗血管生成肽预测和分析工具。

Int J Mol Sci. 2019 Jun 17;20(12):2950. doi: 10.3390/ijms20122950.

Multiscale modeling reveals angiogenesis-induced drug resistance in brain tumors and predicts a synergistic drug combination targeting EGFR and VEGFR pathways.多尺度建模揭示了血管生成诱导的脑肿瘤耐药性，并预测了针对 EGFR 和 VEGFR 通路的协同药物组合。

BMC Bioinformatics. 2019 May 1;20(Suppl 7):203. doi: 10.1186/s12859-019-2737-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

iTAGPred：一种用于识别血管生成和肿瘤血管生成生物标志物的两级预测模型。

iTAGPred: A Two-Level Prediction Model for Identification of Angiogenesis and Tumor Angiogenesis Biomarkers.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献