Suppr超能文献

一个用于预测真核细胞中蛋白质定位位点的知识库。

A knowledge base for predicting protein localization sites in eukaryotic cells.

作者信息

Nakai K, Kanehisa M

机构信息

Institute for Chemical Research, Kyoto University, Japan.

出版信息

Genomics. 1992 Dec;14(4):897-911. doi: 10.1016/s0888-7543(05)80111-9.

Abstract

To automate examination of massive amounts of sequence data for biological function, it is important to computerize interpretation based on empirical knowledge of sequence-function relationships. For this purpose, we have been constructing a knowledge base by organizing various experimental and computational observations as a collection of if-then rules. Here we report an expert system, which utilizes this knowledge base, for predicting localization sites of proteins only from the information on the amino acid sequence and the source origin. We collected data for 401 eukaryotic proteins with known localization sites (subcellular and extracellular) and divided them into training data and testing data. Fourteen localization sites were distinguished for animal cells and 17 for plant cells. When sorting signals were not well characterized experimentally, various sequence features were computationally derived from the training data. It was found that 66% of the training data and 59% of the testing data were correctly predicted by our expert system. This artificial intelligence approach is powerful and flexible enough to be used in genome analyses.

摘要

为了实现对大量序列数据进行生物功能的自动化检测,基于序列 - 功能关系的经验知识进行计算机化解读非常重要。为此,我们通过将各种实验和计算观察结果组织成一系列“如果 - 那么”规则来构建一个知识库。在此,我们报告一个利用该知识库的专家系统,它仅根据氨基酸序列信息和来源就能预测蛋白质的定位位点。我们收集了401个具有已知定位位点(亚细胞和细胞外)的真核生物蛋白质的数据,并将它们分为训练数据和测试数据。动物细胞区分出14个定位位点,植物细胞区分出17个定位位点。当分选信号在实验上没有得到很好的表征时,从训练数据中通过计算得出各种序列特征。结果发现,我们的专家系统正确预测了66%的训练数据和59%的测试数据。这种人工智能方法强大且灵活,足以用于基因组分析。

相似文献

5
BaCelLo: a balanced subcellular localization predictor.BaCelLo:一种平衡的亚细胞定位预测器。
Bioinformatics. 2006 Jul 15;22(14):e408-16. doi: 10.1093/bioinformatics/btl222.
6
Prediction of protein subcellular localization.蛋白质亚细胞定位预测
Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018.
8
Computational prediction of subcellular localization.亚细胞定位的计算预测
Methods Mol Biol. 2007;390:429-66. doi: 10.1007/978-1-59745-466-7_29.

引用本文的文献

7
Fast and exact fixed-radius neighbor search based on sorting.基于排序的快速精确固定半径邻域搜索。
PeerJ Comput Sci. 2024 Mar 29;10:e1929. doi: 10.7717/peerj-cs.1929. eCollection 2024.
8
A review from biological mapping to computation-based subcellular localization.从生物图谱到基于计算的亚细胞定位的综述。
Mol Ther Nucleic Acids. 2023 Apr 20;32:507-521. doi: 10.1016/j.omtn.2023.04.015. eCollection 2023 Jun 13.

本文引用的文献

1
Three-dimensional structure of membrane and surface proteins.膜蛋白和表面蛋白的三维结构。
Annu Rev Biochem. 1984;53:595-623. doi: 10.1146/annurev.bi.53.070184.003115.
2
The detection and classification of membrane-spanning proteins.跨膜蛋白的检测与分类。
Biochim Biophys Acta. 1985 May 28;815(3):468-76. doi: 10.1016/0005-2736(85)90375-x.
3
A new method for predicting signal sequence cleavage sites.一种预测信号序列切割位点的新方法。
Nucleic Acids Res. 1986 Jun 11;14(11):4683-90. doi: 10.1093/nar/14.11.4683.
6
Prediction of in-vivo modification sites of proteins from their primary structures.
J Biochem. 1988 Nov;104(5):693-9. doi: 10.1093/oxfordjournals.jbchem.a122535.
8
Topogenic signals in integral membrane proteins.整合膜蛋白中的拓扑信号。
Eur J Biochem. 1988 Jul 1;174(4):671-8. doi: 10.1111/j.1432-1033.1988.tb14150.x.
9
Fatty acylation of proteins.蛋白质的脂肪酰化作用。
Annu Rev Cell Biol. 1988;4:611-47. doi: 10.1146/annurev.cb.04.110188.003143.
10
The biology and enzymology of eukaryotic protein acylation.真核生物蛋白质酰化的生物学与酶学
Annu Rev Biochem. 1988;57:69-99. doi: 10.1146/annurev.bi.57.070188.000441.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验