• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

详细预测蛋白质亚核定位。

Detailed prediction of protein sub-nuclear localization.

机构信息

Department of Informatics, Bioinformatics & Computational Biology - i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching/Munich, Germany.

School of Chemistry and Molecular Biosciences, UQ (University of Queensland), Cooper Rd, Brisbane City, QLD, 4072, Australia.

出版信息

BMC Bioinformatics. 2019 Apr 23;20(1):205. doi: 10.1186/s12859-019-2790-9.

DOI:10.1186/s12859-019-2790-9
PMID:31014229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6480651/
Abstract

BACKGROUND

Sub-nuclear structures or locations are associated with various nuclear processes. Proteins localized in these substructures are important to understand the interior nuclear mechanisms. Despite advances in high-throughput methods, experimental protein annotations remain limited. Predictions of cellular compartments have become very accurate, largely at the expense of leaving out substructures inside the nucleus making a fine-grained analysis impossible.

RESULTS

Here, we present a new method (LocNuclei) that predicts nuclear substructures from sequence alone. LocNuclei used a string-based Profile Kernel with Support Vector Machines (SVMs). It distinguishes sub-nuclear localization in 13 distinct substructures and distinguishes between nuclear proteins confined to the nucleus and those that are also native to other compartments (traveler proteins). High performance was achieved by implicitly leveraging a large biological knowledge-base in creating predictions by homology-based inference through BLAST. Using this approach, the performance reached AUC = 0.70-0.74 and Q13 = 59-65%. Travelling proteins (nucleus and other) were identified at Q2 = 70-74%. A Gene Ontology (GO) analysis of the enrichment of biological processes revealed that the predicted sub-nuclear compartments matched the expected functionality. Analysis of protein-protein interactions (PPI) show that formation of compartments and functionality of proteins in these compartments highly rely on interactions between proteins. This suggested that the LocNuclei predictions carry important information about function. The source code and data sets are available through GitHub: https://github.com/Rostlab/LocNuclei .

CONCLUSIONS

LocNuclei predicts subnuclear compartments and traveler proteins accurately. These predictions carry important information about functionality and PPIs.

摘要

背景

亚核结构或位置与各种核过程相关。定位于这些亚结构中的蛋白质对于理解核内机制非常重要。尽管高通量方法取得了进展,但实验性蛋白质注释仍然有限。细胞区室的预测已经非常准确,主要是因为忽略了核内的亚结构,使得精细的分析成为不可能。

结果

在这里,我们提出了一种新的方法(LocNuclei),该方法仅通过序列预测核亚结构。LocNuclei 使用基于字符串的 Profile Kernel 与支持向量机(SVMs)。它区分了 13 种不同亚结构中的核亚定位,并区分了局限于核内的核蛋白和那些也固有存在于其他区室(旅行者蛋白)的核蛋白。通过同源推断通过 BLAST 进行基于同源性的推断,隐含地利用了大量的生物学知识库来创建预测,从而实现了高性能。使用这种方法,性能达到 AUC=0.70-0.74 和 Q13=59-65%。旅行者蛋白(核内和其他)的识别率为 Q2=70-74%。对生物过程的富集进行基因本体论(GO)分析表明,预测的亚核区室与预期的功能相匹配。对蛋白质-蛋白质相互作用(PPI)的分析表明,区室的形成和这些区室中蛋白质的功能高度依赖于蛋白质之间的相互作用。这表明 LocNuclei 的预测携带了关于功能的重要信息。源代码和数据集可通过 GitHub 获得:https://github.com/Rostlab/LocNuclei。

结论

LocNuclei 准确预测亚核区室和旅行者蛋白。这些预测携带了关于功能和 PPI 的重要信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/a9c507b5ebb8/12859_2019_2790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/0983166b9cf2/12859_2019_2790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/02f316279aea/12859_2019_2790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/363bf9081ad7/12859_2019_2790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/60248751b41b/12859_2019_2790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/a9c507b5ebb8/12859_2019_2790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/0983166b9cf2/12859_2019_2790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/02f316279aea/12859_2019_2790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/363bf9081ad7/12859_2019_2790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/60248751b41b/12859_2019_2790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a138/6480651/a9c507b5ebb8/12859_2019_2790_Fig5_HTML.jpg

相似文献

1
Detailed prediction of protein sub-nuclear localization.详细预测蛋白质亚核定位。
BMC Bioinformatics. 2019 Apr 23;20(1):205. doi: 10.1186/s12859-019-2790-9.
2
Protein sub-nuclear localization prediction using SVM and Pfam domain information.利用支持向量机和Pfam结构域信息进行蛋白质亚核定位预测。
PLoS One. 2014 Jun 4;9(6):e98345. doi: 10.1371/journal.pone.0098345. eCollection 2014.
3
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
4
ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.ProLoc-GO:利用信息丰富的基因本体术语进行基于序列的蛋白质亚细胞定位预测。
BMC Bioinformatics. 2008 Feb 1;9:80. doi: 10.1186/1471-2105-9-80.
5
ProNA2020 predicts protein-DNA, protein-RNA, and protein-protein binding proteins and residues from sequence.ProNA2020 可从序列预测蛋白质-DNA、蛋白质-RNA 和蛋白质-蛋白质结合蛋白及残基。
J Mol Biol. 2020 Mar 27;432(7):2428-2443. doi: 10.1016/j.jmb.2020.02.026. Epub 2020 Mar 4.
6
LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.LZW-Kernel:快速内核,利用 LZW 压缩器中的变长码块对蛋白质序列进行分类。
Bioinformatics. 2018 Oct 1;34(19):3281-3288. doi: 10.1093/bioinformatics/bty349.
7
An SVM-based system for predicting protein subnuclear localizations.一种基于支持向量机的蛋白质亚核定位预测系统。
BMC Bioinformatics. 2005 Dec 7;6:291. doi: 10.1186/1471-2105-6-291.
8
Protein subcellular multi-localization prediction using a min-max modular support vector machine.基于最小-最大模块化支持向量机的蛋白质亚细胞多定位预测
Int J Neural Syst. 2010 Feb;20(1):13-28. doi: 10.1142/S0129065710002206.
9
Evolutionary profiles improve protein-protein interaction prediction from sequence.进化特征可提高基于序列的蛋白质-蛋白质相互作用预测。
Bioinformatics. 2015 Jun 15;31(12):1945-50. doi: 10.1093/bioinformatics/btv077. Epub 2015 Feb 4.
10
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

引用本文的文献

1
Deciphering metabolic pathways: A treasure map to therapeutic targets.解读代谢途径:治疗靶点的藏宝图。
Biotechnol Notes. 2024 Nov 23;6:1-9. doi: 10.1016/j.biotno.2024.11.006. eCollection 2025.
2
TSHZ2 is an EGF-regulated tumor suppressor that binds to the cytokinesis regulator PRC1 and inhibits metastasis.TSHZ2 是一种 EGF 调节的肿瘤抑制因子,它与胞质分裂调节剂 PRC1 结合并抑制转移。
Sci Signal. 2021 Jun 22;14(688):eabe6156. doi: 10.1126/scisignal.abe6156.
3
Correction to: Detailed prediction of protein sub-nuclear localization.

本文引用的文献

1
Correcting mistakes in predicting distributions.纠正分布预测中的错误。
Bioinformatics. 2018 Oct 1;34(19):3385-3386. doi: 10.1093/bioinformatics/bty346.
2
NLSdb-major update for database of nuclear localization signals and nuclear export signals.NLSdb 重大更新:核定位信号和核输出信号数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D503-D508. doi: 10.1093/nar/gkx1021.
3
Expansion of the Gene Ontology knowledgebase and resources.基因本体知识库及资源的扩展。
对《蛋白质亚核定位的详细预测》的修正
BMC Bioinformatics. 2019 Dec 20;20(1):727. doi: 10.1186/s12859-019-3305-4.
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338. doi: 10.1093/nar/gkw1108. Epub 2016 Nov 29.
4
A scored human protein-protein interaction network to catalyze genomic interpretation.一个用于催化基因组解读的评分人类蛋白质-蛋白质相互作用网络。
Nat Methods. 2017 Jan;14(1):61-64. doi: 10.1038/nmeth.4083. Epub 2016 Nov 28.
5
Integrated interactions database: tissue-specific view of the human and model organism interactomes.整合相互作用数据库:人类和模式生物相互作用组的组织特异性视图。
Nucleic Acids Res. 2016 Jan 4;44(D1):D536-41. doi: 10.1093/nar/gkv1115. Epub 2015 Oct 29.
6
A proteome-scale map of the human interactome network.人类相互作用组网络的蛋白质组规模图谱。
Cell. 2014 Nov 20;159(5):1212-1226. doi: 10.1016/j.cell.2014.10.050.
7
LocTree3 prediction of localization.LocTree3 定位预测。
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W350-5. doi: 10.1093/nar/gku396. Epub 2014 May 21.
8
mentha: a resource for browsing integrated protein-interaction networks.薄荷:一个用于浏览整合蛋白质相互作用网络的资源。
Nat Methods. 2013 Aug;10(8):690-1. doi: 10.1038/nmeth.2561.
9
Accelerating the Original Profile Kernel.加速原始轮廓内核。
PLoS One. 2013 Jun 18;8(6):e68459. doi: 10.1371/journal.pone.0068459. Print 2013.
10
Structure, dynamics, evolution, and function of a major scaffold component in the nuclear pore complex.核孔复合体中主要支架成分的结构、动力学、进化和功能。
Structure. 2013 Apr 2;21(4):560-71. doi: 10.1016/j.str.2013.02.005. Epub 2013 Mar 14.