• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用信息丰富的物理化学特性预测非经典分泌蛋白。

Prediction of non-classical secreted proteins using informative physicochemical properties.

机构信息

Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 300, Taiwan.

出版信息

Interdiscip Sci. 2010 Sep;2(3):263-70. doi: 10.1007/s12539-010-0023-z. Epub 2010 Jul 25.

DOI:10.1007/s12539-010-0023-z
PMID:20658339
Abstract

The prediction of non-classical secreted proteins is a significant problem for drug discovery and development of disease diagnosis. The characteristic of non-classical secreted proteins is they are leaderless proteins without signal peptides in N-terminal. This characteristic makes the prediction of non-classical proteins more difficult and complicated than the classical secreted proteins. We identify a set of informative physicochemical properties of amino acid indices cooperated with support vector machine (SVM) to find discrimination between secreted and non-secreted proteins and to predict non-classical secreted proteins. When the sequence identity of dataset was reduced to 25%, the prediction accuracy on training dataset is 85% which is much better than the traditional sequence similarity-based BLAST or PSI-BLAST tool. The accuracy of independent test is 82%. The most effective features of prediction revealed the fundamental differences of physicochemical properties between secreted and non-secreted proteins. The interpretable and valuable information could be beneficial for drug discovery or the development of new blood biochemical examinations.

摘要

非经典分泌蛋白的预测是药物发现和疾病诊断发展的一个重大问题。非经典分泌蛋白的特征是它们没有 N 端信号肽的无领导蛋白。这一特征使得非经典蛋白的预测比经典分泌蛋白更困难和复杂。我们确定了一组信息丰富的氨基酸指数理化性质,与支持向量机(SVM)相结合,以发现分泌蛋白和非分泌蛋白之间的区别,并预测非经典分泌蛋白。当数据集的序列同一性降低到 25%时,在训练数据集上的预测准确性为 85%,这明显优于传统的基于序列相似性的 BLAST 或 PSI-BLAST 工具。独立测试的准确性为 82%。预测最有效的特征揭示了分泌蛋白和非分泌蛋白理化性质的根本差异。可解释和有价值的信息可有益于药物发现或新的血液生化检查的发展。

相似文献

1
Prediction of non-classical secreted proteins using informative physicochemical properties.利用信息丰富的物理化学特性预测非经典分泌蛋白。
Interdiscip Sci. 2010 Sep;2(3):263-70. doi: 10.1007/s12539-010-0023-z. Epub 2010 Jul 25.
2
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
3
SecretP: a new method for predicting mammalian secreted proteins.SecretP:一种新的哺乳动物分泌蛋白预测方法。
Peptides. 2010 Apr;31(4):574-8. doi: 10.1016/j.peptides.2009.12.026. Epub 2010 Jan 4.
4
Sequence based human leukocyte antigen gene prediction using informative physicochemical properties.利用信息丰富的物理化学性质进行基于序列的人类白细胞抗原基因预测。
Int J Data Min Bioinform. 2015;13(3):211-24. doi: 10.1504/ijdmb.2015.072072.
5
Ranking Gene Ontology terms for predicting non-classical secretory proteins in eukaryotes and prokaryotes.对真核生物和原核生物中非经典分泌蛋白进行预测的基因本体论术语排序。
J Theor Biol. 2012 Nov 7;312:105-13. doi: 10.1016/j.jtbi.2012.07.027. Epub 2012 Aug 8.
6
A new hybrid coding for protein secondary structure prediction based on primary structure similarity.一种基于一级结构相似性的蛋白质二级结构预测新混合编码方法。
Gene. 2017 Jun 30;618:8-13. doi: 10.1016/j.gene.2017.03.011. Epub 2017 Mar 16.
7
Signal peptide discrimination and cleavage site identification using SVM and NN.使用 SVM 和 NN 进行信号肽识别和切割位点鉴定。
Comput Biol Med. 2014 Feb;45:98-110. doi: 10.1016/j.compbiomed.2013.11.017. Epub 2013 Dec 1.
8
SPRED: A machine learning approach for the identification of classical and non-classical secretory proteins in mammalian genomes.SPRED:一种用于鉴定哺乳动物基因组中经典和非经典分泌蛋白的机器学习方法。
Biochem Biophys Res Commun. 2010 Jan 15;391(3):1306-11. doi: 10.1016/j.bbrc.2009.12.019. Epub 2009 Dec 6.
9
Computational identification of ubiquitylation sites from protein sequences.从蛋白质序列中通过计算方法鉴定泛素化位点
BMC Bioinformatics. 2008 Jul 15;9:310. doi: 10.1186/1471-2105-9-310.
10
The accurate prediction of protein family from amino acid sequence by measuring features of sequence fragments.通过测量序列片段的特征从氨基酸序列准确预测蛋白质家族。
J Comput Biol. 2009 Dec;16(12):1671-88. doi: 10.1089/cmb.2008.0115.

引用本文的文献

1
Functional Analysis of a CTL-X-Type Lectin CTL16 in Development and Innate Immunity of .CTL16 型 CTL 凝集素在 发育和先天免疫中的功能分析。
Int J Mol Sci. 2023 Jun 27;24(13):10700. doi: 10.3390/ijms241310700.
2
Prediction of Human Secretory Proteins in Plasma Based on Discrete Firefly Optimization and Application to Cancer Biomarkers Identification.基于离散萤火虫优化算法的人血浆分泌蛋白预测及其在癌症生物标志物识别中的应用
Front Genet. 2019 Jun 6;10:542. doi: 10.3389/fgene.2019.00542. eCollection 2019.
3
High-Throughput Identification of Mammalian Secreted Proteins Using Species-Specific Scheme and Application to Human Proteome.
高通量鉴定哺乳动物分泌蛋白的物种特异性方案及其在人类蛋白质组中的应用。
Molecules. 2018 Jun 14;23(6):1448. doi: 10.3390/molecules23061448.
4
Designing novel construction for cell surface display of protein E on Escherichia coli using non-classical pathway based on Lpp-OmpA.基于Lpp-OmpA的非经典途径设计用于在大肠杆菌上进行蛋白质E细胞表面展示的新型构建体。
AMB Express. 2017 Dec;7(1):53. doi: 10.1186/s13568-017-0350-0. Epub 2017 Feb 28.