• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ApoPred:具有多种特征的载脂蛋白及其亚家族的鉴定

ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.

作者信息

Liu Ting, Chen Jia-Mao, Zhang Dan, Zhang Qian, Peng Bowen, Xu Lei, Tang Hua

机构信息

School of Basic Medical Sciences, Southwest Medical University, Luzhou, China.

Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

Front Cell Dev Biol. 2021 Jan 8;8:621144. doi: 10.3389/fcell.2020.621144. eCollection 2020.

DOI:10.3389/fcell.2020.621144
PMID:33490085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7820372/
Abstract

Apolipoprotein is a group of plasma proteins that are associated with a variety of diseases, such as hyperlipidemia, atherosclerosis, Alzheimer's disease, and diabetes. In order to investigate the function of apolipoproteins and to develop effective targets for related diseases, it is necessary to accurately identify and classify apolipoproteins. Although it is possible to identify apolipoproteins accurately through biochemical experiments, they are expensive and time-consuming. This work aims to establish a high-efficiency and high-accuracy prediction model for recognition of apolipoproteins and their subfamilies. We firstly constructed a high-quality benchmark dataset including 270 apolipoproteins and 535 non-apolipoproteins. Based on the dataset, pseudo-amino acid composition (PseAAC) and composition of k-spaced amino acid pairs (CKSAAP) were used as input vectors. To improve the prediction accuracy and eliminate redundant information, analysis of variance (ANOVA) was used to rank the features. And the incremental feature selection was utilized to obtain the best feature subset. Support vector machine (SVM) was proposed to construct the classification model, which could produce the accuracy of 97.27%, sensitivity of 96.30%, and specificity of 97.76% for discriminating apolipoprotein from non-apolipoprotein in 10-fold cross-validation. In addition, the same process was repeated to generate a new model for predicting apolipoprotein subfamilies. The new model could achieve an overall accuracy of 95.93% in 10-fold cross-validation. According to our proposed model, a convenient webserver called ApoPred was established, which can be freely accessed at http://tang-biolab.com/server/ApoPred/service.html. We expect that this work will contribute to apolipoprotein function research and drug development in relevant diseases.

摘要

载脂蛋白是一类血浆蛋白,与多种疾病相关,如高脂血症、动脉粥样硬化、阿尔茨海默病和糖尿病。为了研究载脂蛋白的功能并开发针对相关疾病的有效靶点,准确识别和分类载脂蛋白是必要的。虽然通过生化实验可以准确识别载脂蛋白,但这些实验成本高且耗时。这项工作旨在建立一个高效、准确的载脂蛋白及其亚家族识别预测模型。我们首先构建了一个高质量的基准数据集,包括270种载脂蛋白和535种非载脂蛋白。基于该数据集,将伪氨基酸组成(PseAAC)和k间隔氨基酸对组成(CKSAAP)用作输入向量。为了提高预测准确性并消除冗余信息,使用方差分析(ANOVA)对特征进行排序。并利用增量特征选择获得最佳特征子集。提出支持向量机(SVM)构建分类模型,在10折交叉验证中,该模型区分载脂蛋白和非载脂蛋白的准确率为97.27%,灵敏度为96.30%,特异性为97.76%。此外,重复相同的过程以生成预测载脂蛋白亚家族的新模型。新模型在10折交叉验证中可实现95.93%的总体准确率。根据我们提出的模型,建立了一个名为ApoPred的便捷网络服务器,可通过http://tang-biolab.com/server/ApoPred/service.html免费访问。我们期望这项工作将有助于载脂蛋白功能研究和相关疾病的药物开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/ab8a7df30979/fcell-08-621144-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/c54f4b7b3a59/fcell-08-621144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/16f434d9a6a9/fcell-08-621144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/859426a382bb/fcell-08-621144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/7a3bb3bff6bb/fcell-08-621144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/849c0d934ab8/fcell-08-621144-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/55dd5cf02351/fcell-08-621144-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/ab8a7df30979/fcell-08-621144-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/c54f4b7b3a59/fcell-08-621144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/16f434d9a6a9/fcell-08-621144-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/859426a382bb/fcell-08-621144-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/7a3bb3bff6bb/fcell-08-621144-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/849c0d934ab8/fcell-08-621144-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/55dd5cf02351/fcell-08-621144-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8aa3/7820372/ab8a7df30979/fcell-08-621144-g007.jpg

相似文献

1
ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.ApoPred:具有多种特征的载脂蛋白及其亚家族的鉴定
Front Cell Dev Biol. 2021 Jan 8;8:621144. doi: 10.3389/fcell.2020.621144. eCollection 2020.
2
Identification of apolipoprotein using feature selection technique.利用特征选择技术鉴定载脂蛋白。
Sci Rep. 2016 Jul 22;6:30441. doi: 10.1038/srep30441.
3
Comprehensive Prediction of Lipocalin Proteins Using Artificial Intelligence Strategy.基于人工智能策略的载脂蛋白全面预测
Front Biosci (Landmark Ed). 2022 Mar 5;27(3):84. doi: 10.31083/j.fbl2703084.
4
PMTPred: machine-learning-based prediction of protein methyltransferases using the composition of k-spaced amino acid pairs.PMTPred:基于k间隔氨基酸对组成的蛋白质甲基转移酶的机器学习预测
Mol Divers. 2024 Aug;28(4):2301-2315. doi: 10.1007/s11030-024-10937-2. Epub 2024 Jul 21.
5
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
6
DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.DPP-PseAAC:一种基于 Chou 的通用 PseAAC 的 DNA 结合蛋白预测模型。
J Theor Biol. 2018 Sep 7;452:22-34. doi: 10.1016/j.jtbi.2018.05.006. Epub 2018 May 16.
7
PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine.PVP-SVM:使用支持向量机基于序列预测噬菌体病毒粒子蛋白
Front Microbiol. 2018 Mar 16;9:476. doi: 10.3389/fmicb.2018.00476. eCollection 2018.
8
ECAmyloid: An amyloid predictor based on ensemble learning and comprehensive sequence-derived features.ECAmyloid:一种基于集成学习和综合序列衍生特征的淀粉样蛋白预测器。
Comput Biol Chem. 2023 Jun;104:107853. doi: 10.1016/j.compbiolchem.2023.107853. Epub 2023 Mar 23.
9
Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs.利用k间隔氨基酸对的组成预测哺乳动物蛋白质中的粘蛋白型O-糖基化位点
BMC Bioinformatics. 2008 Feb 18;9:101. doi: 10.1186/1471-2105-9-101.
10
Identification of adaptor proteins using the ANOVA feature selection technique.利用方差分析特征选择技术鉴定衔接蛋白。
Methods. 2022 Dec;208:42-47. doi: 10.1016/j.ymeth.2022.10.008. Epub 2022 Oct 29.

引用本文的文献

1
NeXtMD: a new generation of machine learning and deep learning stacked hybrid framework for accurate identification of anti-inflammatory peptides.NeXtMD:用于精确识别抗炎肽的新一代机器学习与深度学习堆叠混合框架。
BMC Biol. 2025 Jul 15;23(1):212. doi: 10.1186/s12915-025-02314-8.
2
Apolipoprotein and sphingolipid measurements: Can be used in the clinical practice of atrial fibrillation diagnosing and evaluating the cryoablation effectiveness?载脂蛋白和鞘脂测量:可用于心房颤动诊断及评估冷冻消融疗效的临床实践吗?
PLoS One. 2025 Mar 4;20(3):e0315905. doi: 10.1371/journal.pone.0315905. eCollection 2025.
3
Effect of yogurt probiotic liquid and powder supplementation on hematology and biochemistry blood levels of layer-phase laying hens.

本文引用的文献

1
IMS-CDA: Prediction of CircRNA-Disease Associations From the Integration of Multisource Similarity Information With Deep Stacked Autoencoder Model.IMS-CDA:基于深度堆叠自编码器模型的多源相似性信息融合预测 circRNA-疾病关联
IEEE Trans Cybern. 2021 Nov;51(11):5522-5531. doi: 10.1109/TCYB.2020.3022852. Epub 2021 Nov 9.
2
Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework.Meta-i6mA:利用集成机器学习框架中的信息特征,用于识别植物基因组中 DNA N6-甲基腺嘌呤位点的种间预测因子。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa202.
3
补充酸奶益生菌液和粉剂对蛋鸡产蛋期血液学和生化血液指标的影响。
J Adv Vet Anim Res. 2024 Dec 27;11(4):936-943. doi: 10.5455/javar.2024.k843. eCollection 2024 Dec.
4
Empirical Comparison and Analysis of Artificial Intelligence-Based Methods for Identifying Phosphorylation Sites of SARS-CoV-2 Infection.基于人工智能的新冠病毒感染磷酸化位点识别方法的实证比较与分析
Int J Mol Sci. 2024 Dec 21;25(24):13674. doi: 10.3390/ijms252413674.
5
Proteome profiling reveals opportunities to investigate biomarkers of oxidative stress and immune responses in blubber biopsies from free-ranging baleen whales.蛋白质组分析揭示了在自由放养须鲸的鲸脂活检中研究氧化应激和免疫反应生物标志物的机会。
Conserv Physiol. 2024 Aug 19;12(1):coae059. doi: 10.1093/conphys/coae059. eCollection 2024.
6
Comparative analysis of human tear fluid and aqueous humor proteomes.比较人泪液和房水蛋白质组。
Ocul Surf. 2024 Jul;33:16-22. doi: 10.1016/j.jtos.2024.03.011. Epub 2024 Mar 30.
7
Imidazolium-based zwitterionic liquid-modified PEG-PLGA nanoparticles as a potential intravenous drug delivery carrier.基于咪唑鎓的两性离子液体修饰的 PEG-PLGA 纳米粒作为一种有潜力的静脉注射药物递送载体。
Nanoscale. 2024 Mar 14;16(11):5584-5600. doi: 10.1039/d3nr06349f.
8
The Role of Apolipoproteins in the Commonest Cancers: A Review.载脂蛋白在最常见癌症中的作用:综述
Cancers (Basel). 2023 Nov 24;15(23):5565. doi: 10.3390/cancers15235565.
9
Development of New Genome Editing Tools for the Treatment of Hyperlipidemia.新型基因组编辑工具治疗高血脂症的研究进展
Cells. 2023 Oct 16;12(20):2466. doi: 10.3390/cells12202466.
10
A First Computational Frame for Recognizing Heparin-Binding Protein.一种用于识别肝素结合蛋白的首个计算框架。
Diagnostics (Basel). 2023 Jul 24;13(14):2465. doi: 10.3390/diagnostics13142465.
Gut Microbiota-Associated Activation of TLR5 Induces Apolipoprotein A1 Production in the Liver.
肠道微生物群相关的 TLR5 激活诱导肝脏载脂蛋白 A1 的产生。
Circ Res. 2020 Oct 23;127(10):1236-1252. doi: 10.1161/CIRCRESAHA.120.317362. Epub 2020 Aug 21.
4
Prediction of antioxidant proteins using hybrid feature representation method and random forest.基于混合特征表示方法和随机森林的抗氧化蛋白预测。
Genomics. 2020 Nov;112(6):4666-4674. doi: 10.1016/j.ygeno.2020.08.016. Epub 2020 Aug 17.
5
gutMDisorder: a comprehensive database for dysbiosis of the gut microbiota in disorders and interventions.肠道疾病:一个关于疾病和干预中肠道微生物群失调的综合数据库。
Nucleic Acids Res. 2020 Jul 27;48(13):7603. doi: 10.1093/nar/gkaa511.
6
DeepLGP: a novel deep learning method for prioritizing lncRNA target genes.DeepLGP:一种用于优先化 lncRNA 靶基因的新型深度学习方法。
Bioinformatics. 2020 Aug 15;36(16):4466-4472. doi: 10.1093/bioinformatics/btaa428.
7
Computational identification of N6-methyladenosine sites in multiple tissues of mammals.哺乳动物多个组织中N6-甲基腺嘌呤位点的计算识别
Comput Struct Biotechnol J. 2020 Apr 30;18:1084-1091. doi: 10.1016/j.csbj.2020.04.015. eCollection 2020.
8
Evaluating the relationship between circulating lipoprotein lipids and apolipoproteins with risk of coronary heart disease: A multivariable Mendelian randomisation analysis.评估循环脂蛋白脂质与载脂蛋白与冠心病风险的关系:多变量孟德尔随机分析。
PLoS Med. 2020 Mar 23;17(3):e1003062. doi: 10.1371/journal.pmed.1003062. eCollection 2020 Mar.
9
Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps.深度学习从高分辨率冷冻电镜密度图预测蛋白质骨架结构。
Sci Rep. 2020 Mar 9;10(1):4282. doi: 10.1038/s41598-020-60598-y.
10
HLPpred-Fuse: improved and robust prediction of hemolytic peptide and its activity by fusing multiple feature representation.HLPpred-Fuse:通过融合多种特征表示提高和增强溶血肽及其活性的预测
Bioinformatics. 2020 Jun 1;36(11):3350-3356. doi: 10.1093/bioinformatics/btaa160.