Suppr超能文献

搜索分子序列数据库中的问题。

Issues in searching molecular sequence databases.

作者信息

Altschul S F, Boguski M S, Gish W, Wootton J C

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894.

出版信息

Nat Genet. 1994 Feb;6(2):119-29. doi: 10.1038/ng0294-119.

Abstract

Sequence similarity search programs are versatile tools for the molecular biologist, frequently able to identify possible DNA coding regions and to provide clues to gene and protein structure and function. While much attention had been paid to the precise algorithms these programs employ and to their relative speeds, there is a constellation of associated issues that are equally important to realize the full potential of these methods. Here, we consider a number of these issues, including the choice of scoring systems, the statistical significance of alignments, the masking of uninformative or potentially confounding sequence regions, the nature and extent of sequence redundancy in the databases and network access to similarity search services.

摘要

序列相似性搜索程序是分子生物学家的多功能工具,常常能够识别可能的DNA编码区域,并为基因和蛋白质的结构与功能提供线索。虽然人们已高度关注这些程序所采用的精确算法及其相对速度,但还有一系列相关问题对于充分发挥这些方法的潜力同样重要。在此,我们将探讨其中的一些问题,包括评分系统的选择、比对的统计显著性、无信息或可能造成混淆的序列区域的屏蔽、数据库中序列冗余的性质和程度以及对相似性搜索服务的网络访问。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验