Suppr超能文献

分子序列中多个高分片段的应用与统计

Applications and statistics for multiple high-scoring segments in molecular sequences.

作者信息

Karlin S, Altschul S F

机构信息

Department of Mathematics, Stanford University, CA 94305.

出版信息

Proc Natl Acad Sci U S A. 1993 Jun 15;90(12):5873-7. doi: 10.1073/pnas.90.12.5873.

Abstract

Score-based measures of molecular-sequence features provide versatile aids for the study of proteins and DNA. They are used by many sequence data base search programs, as well as for identifying distinctive properties of single sequences. For any such measure, it is important to know what can be expected to occur purely by chance. The statistical distribution of high-scoring segments has been described elsewhere. However, molecular sequences will frequently yield several high-scoring segments for which some combined assessment is in order. This paper describes the statistical distribution for the sum of the scores of multiple high-scoring segments and illustrates its application to the identification of possible transmembrane segments and the evaluation of sequence similarity.

摘要

基于分数的分子序列特征度量为蛋白质和DNA研究提供了多功能辅助工具。许多序列数据库搜索程序都使用它们,同时也用于识别单序列的独特属性。对于任何此类度量,了解纯粹偶然情况下可能发生的情况很重要。高分片段的统计分布已在其他地方描述过。然而,分子序列经常会产生几个高分片段,对此需要进行一些综合评估。本文描述了多个高分片段得分总和的统计分布,并说明了其在识别可能的跨膜片段和评估序列相似性方面的应用。

相似文献

4
Statistical studies of biomolecular sequences: score-based methods.生物分子序列的统计研究:基于分数的方法。
Philos Trans R Soc Lond B Biol Sci. 1994 Jun 29;344(1310):391-402. doi: 10.1098/rstb.1994.0078.
5
sevenless: Seven found?七缺失:发现七个了?
Cell. 1990 Apr 6;61(1):15-6. doi: 10.1016/0092-8674(90)90209-w.
9
Molecular drift of the bride of sevenless (boss) gene in Drosophila.果蝇中七无新娘(boss)基因的分子漂变
Mol Biol Evol. 1993 Sep;10(5):1030-40. doi: 10.1093/oxfordjournals.molbev.a040052.

引用本文的文献

4
A Scoring Algorithm for the Automated Analysis of Glycosaminoglycan MS/MS Data.糖胺聚糖 MS/MS 数据自动化分析的评分算法。
J Am Soc Mass Spectrom. 2019 Dec;30(12):2692-2703. doi: 10.1007/s13361-019-02338-9. Epub 2019 Oct 31.
9
GHOST: global hepatitis outbreak and surveillance technology.GHOST:全球肝炎爆发和监测技术。
BMC Genomics. 2017 Dec 6;18(Suppl 10):916. doi: 10.1186/s12864-017-4268-3.

本文引用的文献

3
Identification of common molecular subsequences.常见分子子序列的鉴定
J Mol Biol. 1981 Mar 25;147(1):195-7. doi: 10.1016/0022-2836(81)90087-5.
4
Random sequences.随机序列
J Mol Biol. 1983 Jan 15;163(2):171-6. doi: 10.1016/0022-2836(83)90002-5.
7
The statistical distribution of nucleic acid similarities.核酸相似性的统计分布。
Nucleic Acids Res. 1985 Jan 25;13(2):645-56. doi: 10.1093/nar/13.2.645.
9
On the PAM matrix model of protein evolution.关于蛋白质进化的PAM矩阵模型。
Mol Biol Evol. 1985 Sep;2(5):434-47. doi: 10.1093/oxfordjournals.molbev.a040360.
10
The significance of protein sequence similarities.蛋白质序列相似性的意义。
Comput Appl Biosci. 1988 Mar;4(1):67-71. doi: 10.1093/bioinformatics/4.1.67.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验