Suppr超能文献

数据库搜索中蛋白质序列相似性的统计学显著性检验。

Tests for the statistical significance of protein sequence similarities in data-bank searches.

作者信息

Mott R F, Kirkwood T B, Curnow R N

机构信息

Laboratory of Mathematical Biology, National Institute for Medical Research, UK.

出版信息

Protein Eng. 1990 Dec;4(2):149-54. doi: 10.1093/protein/4.2.149.

Abstract

A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.

摘要

开发了一套用于评估蛋白质序列相似性统计显著性的测试方法,以用于数据库搜索。这些测试基于威尔伯-利普曼词搜索算法,并考虑了序列长度和组成,以及氨基酸匹配的权重(可选)。该方法得到扩展,以允许在相似区域内存在序列插入/缺失。使用随机生成的序列和从数据库中随机选择的真实序列验证了测试所依据的统计分布的准确性。简要描述了一个执行这些测试的计算机程序。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验