• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用特征选择技术鉴定载脂蛋白。

Identification of apolipoprotein using feature selection technique.

作者信息

Tang Hua, Zou Ping, Zhang Chunmei, Chen Rong, Chen Wei, Lin Hao

机构信息

Department of Pathophysiology, Southwest Medical University, Luzhou 646000, China.

Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.

出版信息

Sci Rep. 2016 Jul 22;6:30441. doi: 10.1038/srep30441.

DOI:10.1038/srep30441
PMID:27443605
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4957217/
Abstract

Apolipoprotein is a kind of protein which can transport the lipids through the lymphatic and circulatory systems. The abnormal expression level of apolipoprotein always causes angiocardiopathy. Thus, correct recognition of apolipoprotein from proteomic data is very crucial to the comprehension of cardiovascular system and drug design. This study is to develop a computational model to predict apolipoproteins. In the model, the apolipoproteins and non-apolipoproteins were collected to form benchmark dataset. On the basis of the dataset, we extracted the g-gap dipeptide composition information from residue sequences to formulate protein samples. To exclude redundant information or noise, the analysis of various (ANOVA)-based feature selection technique was proposed to find out the best feature subset. The support vector machine (SVM) was selected as discrimination algorithm. Results show that 96.2% of sensitivity and 99.3% of specificity were achieved in five-fold cross-validation. These findings open new perspectives to improve apolipoproteins prediction by considering the specific dipeptides. We expect that these findings will help to improve drug development in anti-angiocardiopathy disease.

摘要

载脂蛋白是一种能够通过淋巴系统和循环系统运输脂质的蛋白质。载脂蛋白的异常表达水平常常会引发心血管疾病。因此,从蛋白质组学数据中准确识别载脂蛋白对于理解心血管系统和药物设计至关重要。本研究旨在开发一种预测载脂蛋白的计算模型。在该模型中,收集载脂蛋白和非载脂蛋白以形成基准数据集。基于该数据集,我们从残基序列中提取g-gap二肽组成信息来构建蛋白质样本。为了排除冗余信息或噪声,提出了基于方差分析(ANOVA)的特征选择技术来找出最佳特征子集。选择支持向量机(SVM)作为判别算法。结果表明,在五折交叉验证中,灵敏度达到96.2%,特异性达到99.3%。这些发现为通过考虑特定二肽来改进载脂蛋白预测开辟了新的视角。我们期望这些发现将有助于改善抗心血管疾病药物的研发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/2b23a5f079f8/srep30441-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/eb8c7f88d724/srep30441-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/8946cac4c896/srep30441-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/2b23a5f079f8/srep30441-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/eb8c7f88d724/srep30441-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/8946cac4c896/srep30441-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c12a/4957217/2b23a5f079f8/srep30441-f3.jpg

相似文献

1
Identification of apolipoprotein using feature selection technique.利用特征选择技术鉴定载脂蛋白。
Sci Rep. 2016 Jul 22;6:30441. doi: 10.1038/srep30441.
2
ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.ApoPred:具有多种特征的载脂蛋白及其亚家族的鉴定
Front Cell Dev Biol. 2021 Jan 8;8:621144. doi: 10.3389/fcell.2020.621144. eCollection 2020.
3
Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.通过将二肽组成纳入周氏广义伪氨基酸组成来预测蛋白质的亚线粒体定位
J Membr Biol. 2016 Jun;249(3):293-304. doi: 10.1007/s00232-015-9868-8. Epub 2016 Jan 8.
4
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
5
IonchanPred 2.0: A Tool to Predict Ion Channels and Their Types.IonchanPred 2.0:一种预测离子通道及其类型的工具。
Int J Mol Sci. 2017 Aug 24;18(9):1838. doi: 10.3390/ijms18091838.
6
Identification of voltage-gated potassium channel subfamilies from sequence information using support vector machine.使用支持向量机从序列信息中鉴定电压门控钾通道亚家族。
Comput Biol Med. 2012 Apr;42(4):504-7. doi: 10.1016/j.compbiomed.2012.01.003. Epub 2012 Jan 31.
7
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
8
Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods.利用两步特征选择方法鉴定噬菌体病毒蛋白。
Molecules. 2018 Aug 10;23(8):2000. doi: 10.3390/molecules23082000.
9
CTKPred: an SVM-based method for the prediction and classification of the cytokine superfamily.CTKPred:一种基于支持向量机的细胞因子超家族预测与分类方法。
Protein Eng Des Sel. 2005 Aug;18(8):365-8. doi: 10.1093/protein/gzi041. Epub 2005 Jun 24.
10
Computational identification of human long intergenic non-coding RNAs using a GA-SVM algorithm.基于 GA-SVM 算法的人类长链非编码 RNA 计算识别。
Gene. 2014 Jan 1;533(1):94-9. doi: 10.1016/j.gene.2013.09.118. Epub 2013 Oct 9.

引用本文的文献

1
The Yin and Yang of the protein corona on the delivery journey of nanoparticles.纳米颗粒递送过程中蛋白质冠层的阴阳两面
Nano Res. 2023;16(1):715-734. doi: 10.1007/s12274-022-4849-6. Epub 2022 Sep 16.
2
Large-scale comparative review and assessment of computational methods for phage virion proteins identification.噬菌体病毒粒子蛋白质鉴定计算方法的大规模比较综述与评估
EXCLI J. 2022 Jan 3;21:11-29. doi: 10.17179/excli2021-4411. eCollection 2022.
3
CASPredict: a web service for identifying Cas proteins.CASPredict:一种用于识别Cas蛋白的网络服务。

本文引用的文献

1
Recombination spot identification Based on gapped k-mers.基于缺口 k- -mer 的重组位点识别。
Sci Rep. 2016 Mar 31;6:23934. doi: 10.1038/srep23934.
2
N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.通过N端二甲基标记法测定蛋白质的N端氨基酸序列:与艾德曼降解序列分析相比的陷阱与优势
J Biomol Tech. 2016 Jul;27(2):61-74. doi: 10.7171/jbt.16-2702-002. Epub 2016 Mar 7.
3
Prediction and identification of the effectors of heterotrimeric G proteins in rice (Oryza sativa L.).
PeerJ. 2021 Jul 30;9:e11887. doi: 10.7717/peerj.11887. eCollection 2021.
4
ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.ApoPred:具有多种特征的载脂蛋白及其亚家族的鉴定
Front Cell Dev Biol. 2021 Jan 8;8:621144. doi: 10.3389/fcell.2020.621144. eCollection 2020.
5
Prediction of Recombination Spots Using Novel Hybrid Feature Extraction Method via Deep Learning Approach.通过深度学习方法使用新型混合特征提取方法预测重组位点
Front Genet. 2020 Sep 17;11:539227. doi: 10.3389/fgene.2020.539227. eCollection 2020.
6
A computational method for prediction of xylanase enzymes activity in strains of Bacillus subtilis based on pseudo amino acid composition features.基于伪氨基酸组成特征的枯草芽孢杆菌木聚糖酶活力预测的计算方法。
PLoS One. 2018 Oct 22;13(10):e0205796. doi: 10.1371/journal.pone.0205796. eCollection 2018.
7
Identification of Inhibitors of MMPS Enzymes via a Novel Computational Approach.通过一种新的计算方法鉴定 MMPS 酶抑制剂。
Int J Biol Sci. 2018 May 22;14(8):863-871. doi: 10.7150/ijbs.24588. eCollection 2018.
8
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
9
iRSpot-Pse6NC: Identifying recombination spots in by incorporating hexamer composition into general PseKNC.iRSpot-Pse6NC:通过将六聚体组成纳入通用 PseKNC 来识别 中的重组热点。
Int J Biol Sci. 2018 May 22;14(8):883-891. doi: 10.7150/ijbs.24616. eCollection 2018.
10
Semi-Supervised Maximum Discriminative Local Margin for Gene Selection.半监督最大鉴别局部边缘的基因选择。
Sci Rep. 2018 Jun 5;8(1):8619. doi: 10.1038/s41598-018-26806-6.
预测和鉴定水稻(Oryza sativa L.)中异三聚体 G 蛋白的效应子。
Brief Bioinform. 2017 Mar 1;18(2):270-278. doi: 10.1093/bib/bbw021.
4
dPABBs: A Novel in silico Approach for Predicting and Designing Anti-biofilm Peptides.二肽基氨基肽酶B:一种用于预测和设计抗生物膜肽的新型计算机模拟方法。
Sci Rep. 2016 Feb 25;6:21839. doi: 10.1038/srep21839.
5
The UniProtKB guide to the human proteome.人类蛋白质组的UniProtKB指南。
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav120. Print 2016.
6
Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique.使用具有特征选择技术的周氏伪氨基酸组成鉴定免疫球蛋白。
Mol Biosyst. 2016 Apr;12(4):1269-75. doi: 10.1039/c5mb00883b. Epub 2016 Feb 17.
7
HIV coreceptor tropism determination and mutational pattern identification.HIV共受体嗜性测定及突变模式鉴定。
Sci Rep. 2016 Feb 17;6:21280. doi: 10.1038/srep21280.
8
SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites.SOHSite:整合进化信息和理化性质以识别蛋白质S-亚磺酰化位点。
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):9. doi: 10.1186/s12864-015-2299-1.
9
UbiSite: incorporating two-layered machine learning method with substrate motifs to predict ubiquitin-conjugation site on lysines.UbiSite:结合具有底物基序的两层机器学习方法来预测赖氨酸上的泛素结合位点。
BMC Syst Biol. 2016 Jan 11;10 Suppl 1(Suppl 1):6. doi: 10.1186/s12918-015-0246-z.
10
iMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions.iMiRNA-SSF:通过结合不同分布的负集改进微小RNA前体的识别
Sci Rep. 2016 Jan 12;6:19062. doi: 10.1038/srep19062.