• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合CJ-SPHMM、TMHMM和PSORT的分泌蛋白预测系统。

Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT.

作者信息

Chen Yunjia, Yu Peng, Luo Jingchu, Jiang Ying

机构信息

College of Life Sciences, National Laboratory of Protein Engineering and Plant Genetic Engineering, and Centre of Bioinformatics, Peking University, Beijing 100871, China.

出版信息

Mamm Genome. 2003 Dec;14(12):859-65. doi: 10.1007/s00335-003-2296-6.

DOI:10.1007/s00335-003-2296-6
PMID:14724739
Abstract

To increase the coverage of secreted protein prediction, we describe a combination strategy. Instead of using a single method, we combine Hidden Markov Model (HMM)-based methods CJ-SPHMM and TMHMM with PSORT in secreted protein prediction. CJ-SPHMM is an HMM-based signal peptide prediction method, while TMHMM is an HMM-based transmembrane (TM) protein prediction algorithm. With CJ-SPHMM and TMHMM, proteins with predicted signal peptide and without predicted TM regions are taken as putative secreted proteins. This HMM-based approach predicts secreted protein with Ac (Accuracy) at 0.82 and Cc (Correlation coefficient) at 0.75, which are similar to PSORT with Ac at 0.82 and Cc at 0.76. When we further complement the HMM-based method, i.e., CJ-SPHMM + TMHMM with PSORT in secreted protein prediction, the Ac value is increased to 0.86 and the Cc value is increased to 0.81. Taking this combination strategy to search putative secreted proteins from the International Protein Index (IPI) maintained at the European Bioinformatics Institute (EBI), we constructed a putative human secretome with 5235 proteins. The prediction system described here can also be applied to predicting secreted proteins from other vertebrate proteomes.

摘要

为了提高分泌蛋白预测的覆盖率,我们描述了一种组合策略。我们在分泌蛋白预测中,不是使用单一方法,而是将基于隐马尔可夫模型(HMM)的方法CJ-SPHMM和TMHMM与PSORT相结合。CJ-SPHMM是一种基于HMM的信号肽预测方法,而TMHMM是一种基于HMM的跨膜(TM)蛋白预测算法。利用CJ-SPHMM和TMHMM,将预测有信号肽且无预测跨膜区域的蛋白质作为假定的分泌蛋白。这种基于HMM的方法预测分泌蛋白的准确率(Ac)为0.82,相关系数(Cc)为0.75,这与PSORT的准确率0.82和相关系数0.76相似。当我们在分泌蛋白预测中用PSORT进一步补充基于HMM的方法,即CJ-SPHMM + TMHMM时,Ac值提高到0.86,Cc值提高到0.81。采用这种组合策略从欧洲生物信息学研究所(EBI)维护的国际蛋白质索引(IPI)中搜索假定的分泌蛋白,我们构建了一个包含5235种蛋白质的假定人类分泌蛋白质组。这里描述的预测系统也可应用于预测其他脊椎动物蛋白质组中的分泌蛋白。

相似文献

1
Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT.结合CJ-SPHMM、TMHMM和PSORT的分泌蛋白预测系统。
Mamm Genome. 2003 Dec;14(12):859-65. doi: 10.1007/s00335-003-2296-6.
2
Combined prediction of transmembrane topology and signal peptide of beta-barrel proteins: using a hidden Markov model and genetic algorithms.β-桶状蛋白跨膜拓扑结构和信号肽的联合预测:使用隐马尔可夫模型和遗传算法。
Comput Biol Med. 2010 Jul;40(7):621-8. doi: 10.1016/j.compbiomed.2010.04.006. Epub 2010 May 21.
3
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.
4
Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.用隐马尔可夫模型预测跨膜蛋白拓扑结构:应用于完整基因组。
J Mol Biol. 2001 Jan 19;305(3):567-80. doi: 10.1006/jmbi.2000.4315.
5
A combined transmembrane topology and signal peptide prediction method.一种跨膜拓扑结构与信号肽联合预测方法。
J Mol Biol. 2004 May 14;338(5):1027-36. doi: 10.1016/j.jmb.2004.03.016.
6
[Analysis of the secreted proteins encoded by genes in genome of filamental fungus (Neurospora crassa)].[丝状真菌(粗糙脉孢菌)基因组中基因编码的分泌蛋白分析]
Yi Chuan. 2006 Feb;28(2):200-7.
7
WoLF PSORT: protein localization predictor.WoLF PSORT:蛋白质定位预测工具。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W585-7. doi: 10.1093/nar/gkm259. Epub 2007 May 21.
8
An HMM posterior decoder for sequence feature prediction that includes homology information.一种用于序列特征预测的隐马尔可夫模型后验解码器,其包含同源性信息。
Bioinformatics. 2005 Jun;21 Suppl 1:i251-7. doi: 10.1093/bioinformatics/bti1014.
9
PSORT-B: Improving protein subcellular localization prediction for Gram-negative bacteria.PSORT-B:改进革兰氏阴性菌蛋白质亚细胞定位预测
Nucleic Acids Res. 2003 Jul 1;31(13):3613-7. doi: 10.1093/nar/gkg602.
10
Elucidation of the CHO Super-Ome (CHO-SO) by Proteoinformatics.通过蛋白质信息学阐明中国仓鼠卵巢细胞超级组(CHO-SO)
J Proteome Res. 2015 Nov 6;14(11):4687-703. doi: 10.1021/acs.jproteome.5b00588. Epub 2015 Oct 13.

引用本文的文献

1
Cytotoxicity to cells revealed the virulence-associated protein E gene as a potential virulence factor in methicillin-resistantStaphylococcus aureus.对细胞的细胞毒性表明,毒力相关蛋白E基因是耐甲氧西林金黄色葡萄球菌中的一种潜在毒力因子。
BMC Microbiol. 2025 Jul 10;25(1):427. doi: 10.1186/s12866-025-04148-4.
2
Anti-Inflammatory Function Analysis of CP-1 Strain Based on Whole-Genome Sequencing.基于全基因组测序的CP-1菌株抗炎功能分析
BioTech (Basel). 2025 Jun 7;14(2):47. doi: 10.3390/biotech14020047.
3
FGSE02, a Novel Secreted Protein in FG-12, Leads to Cell Death in Plant Tissues and Modulates Fungal Virulence.

本文引用的文献

1
State-of-the-art in membrane protein prediction.膜蛋白预测的最新技术。
Appl Bioinformatics. 2002;1(1):21-35.
2
A profile hidden Markov model for signal peptides generated by HMMER.由HMMER生成的信号肽的轮廓隐马尔可夫模型。
Bioinformatics. 2003 Jan 22;19(2):307-8. doi: 10.1093/bioinformatics/19.2.307.
3
Initial sequencing and comparative analysis of the mouse genome.小鼠基因组的初步测序与比较分析。
FGSE02是FG-12中的一种新型分泌蛋白,可导致植物组织细胞死亡并调节真菌毒力。
J Fungi (Basel). 2025 May 21;11(5):397. doi: 10.3390/jof11050397.
4
Bioinformatics analysis of the tomato (Solanum lycopersicum) methylesterase gene family.番茄(Solanum lycopersicum)甲酯酶基因家族的生物信息学分析
BMC Plant Biol. 2025 May 16;25(1):649. doi: 10.1186/s12870-025-06625-4.
5
Genome Identification, Expression Profile Analysis, and Abiotic Stress Response Mechanism of Longan Gene.龙眼基因的基因组鉴定、表达谱分析及非生物胁迫响应机制
Int J Mol Sci. 2025 Mar 25;26(7):3003. doi: 10.3390/ijms26073003.
6
Discovery of novel vaccine candidates based on the immunogenic epitopes derived from membrane proteins.基于源自膜蛋白的免疫原性表位发现新型候选疫苗。
Clin Exp Vaccine Res. 2025 Jan;14(1):86-100. doi: 10.7774/cevr.2025.14.e4. Epub 2025 Jan 13.
7
A novel in-silico approach to design a multiepitope peptide as a vaccine candidate for .一种用于设计多表位肽作为[具体疾病名称]疫苗候选物的新型计算机辅助方法。 (注:原文中“for”后面缺少具体疾病名称)
Heliyon. 2024 Nov 26;10(23):e40733. doi: 10.1016/j.heliyon.2024.e40733. eCollection 2024 Dec 15.
8
Characterization of the emerging recombinant infectious bronchitis virus in China.中国新出现的重组传染性支气管炎病毒的特征分析
Front Microbiol. 2024 Oct 15;15:1456415. doi: 10.3389/fmicb.2024.1456415. eCollection 2024.
9
Comparative genomic analysis of pathogenic factors of Listeria spp. using whole-genome sequencing.应用全基因组测序技术对李斯特菌属致病因子进行比较基因组分析。
BMC Genomics. 2024 Oct 7;25(1):935. doi: 10.1186/s12864-024-10849-3.
10
Genome Sequencing of Three Pathogenic Fungi Provides Insights into the Evolution and Pathogenic Mechanisms of the Cobweb Disease on Cultivated Mushrooms.三种致病真菌的基因组测序为深入了解栽培蘑菇蛛网病的进化和致病机制提供了线索。
Foods. 2024 Aug 30;13(17):2779. doi: 10.3390/foods13172779.
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
4
Human secretory signal peptide description by hidden Markov model and generation of a strong artificial signal peptide for secreted protein expression.通过隐马尔可夫模型描述人类分泌信号肽并生成用于分泌蛋白表达的强人工信号肽。
Biochem Biophys Res Commun. 2002 Jun 21;294(4):835-42. doi: 10.1016/S0006-291X(02)00566-1.
5
Protein targeting (Nobel lecture).蛋白质靶向运输(诺贝尔演讲)。
Chembiochem. 2000 Aug 18;1(2):86-102. doi: 10.1002/1439-7633(20000818)1:2<86::AID-CBIC86>3.0.CO;2-A.
6
Cytokines as new treatment targets in chronic heart failure.
Curr Control Trials Cardiovasc Med. 2001;2(6):271-277. doi: 10.1186/cvm-2-6-271.
7
Evaluation of methods for the prediction of membrane spanning regions.膜跨越区域预测方法的评估
Bioinformatics. 2001 Jul;17(7):646-53. doi: 10.1093/bioinformatics/17.7.646.
8
Bioinformatics, target discovery and the pharmaceutical/biotechnology industry.生物信息学、靶点发现与制药/生物技术产业。
Curr Opin Mol Ther. 2000 Dec;2(6):655-61.
9
Involvement of chemokine receptors in breast cancer metastasis.趋化因子受体在乳腺癌转移中的作用。
Nature. 2001 Mar 1;410(6824):50-6. doi: 10.1038/35065016.
10
Initial sequencing and analysis of the human genome.人类基因组的初步测序与分析。
Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.