• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ESTIMATING THE GUMBEL SCALE PARAMETER FOR LOCAL ALIGNMENT OF RANDOM SEQUENCES BY IMPORTANCE SAMPLING WITH STOPPING TIMES.通过带停止时间的重要性抽样估计随机序列局部比对的耿贝尔尺度参数。
Ann Stat. 2009 Dec 1;37(6A):3697. doi: 10.1214/08-AOS663.
2
The Gumbel pre-factor k for gapped local alignment can be estimated from simulations of global alignment.用于间隙局部比对的耿贝尔前置因子k可通过全局比对模拟来估计。
Nucleic Acids Res. 2005 Sep 6;33(15):4987-94. doi: 10.1093/nar/gki800. Print 2005.
3
Local sequence alignments statistics: deviations from Gumbel statistics in the rare-event tail.局部序列比对统计:罕见事件尾部与耿贝尔统计的偏差。
Algorithms Mol Biol. 2007 Jul 11;2:9. doi: 10.1186/1748-7188-2-9.
4
Statistical significance of probabilistic sequence alignment and related local hidden Markov models.概率序列比对及相关局部隐马尔可夫模型的统计学显著性。
J Comput Biol. 2001;8(3):249-82. doi: 10.1089/10665270152530845.
5
Score distributions of gapped multiple sequence alignments down to the low-probability tail.有空隙的多重序列比对的分数分布到低概率尾部。
Phys Rev E. 2016 Aug;94(2-1):022127. doi: 10.1103/PhysRevE.94.022127. Epub 2016 Aug 19.
6
Significance of gapped sequence alignments.缺口序列比对的意义。
J Comput Biol. 2008 Nov;15(9):1187-94. doi: 10.1089/cmb.2008.0125.
7
Large-Deviation Properties of Sequence Alignment of Correlated Sequences.相关序列比对的大偏差性质
J Comput Biol. 2018 Sep 10. doi: 10.1089/cmb.2017.0269.
8
Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores.生物序列的进化意味着全局和局部两两比对得分都呈I型极值分布。
BMC Bioinformatics. 2008 Aug 7;9:332. doi: 10.1186/1471-2105-9-332.
9
A direct method for computing extreme value (Gumbel) parameters for gapped biological sequence alignments.一种用于计算有间隙生物序列比对的极值(耿贝尔)参数的直接方法。
Int J Bioinform Res Appl. 2014;10(2):177-89. doi: 10.1504/IJBRA.2014.059517.
10
Island method for estimating the statistical significance of profile-profile alignment scores.用于估计序列轮廓与序列轮廓比对得分统计显著性的岛方法。
BMC Bioinformatics. 2009 Apr 20;10:112. doi: 10.1186/1471-2105-10-112.

引用本文的文献

1
Estimating statistical significance of local protein profile-profile alignments.估计局部蛋白质图谱-图谱比对的统计显著性。
BMC Bioinformatics. 2019 Aug 13;20(1):419. doi: 10.1186/s12859-019-2913-3.
2
How sequence alignment scores correspond to probability models.序列比对分数如何对应概率模型。
Bioinformatics. 2020 Jan 15;36(2):408-415. doi: 10.1093/bioinformatics/btz576.
3
ALP & FALP: C++ libraries for pairwise local alignment E-values.ALP和FALP:用于成对局部比对E值的C++库。
Bioinformatics. 2016 Jan 15;32(2):304-5. doi: 10.1093/bioinformatics/btv575. Epub 2015 Oct 1.
4
Frameshift alignment: statistics and post-genomic applications.移码校正:统计与后基因组学应用。
Bioinformatics. 2014 Dec 15;30(24):3575-82. doi: 10.1093/bioinformatics/btu576. Epub 2014 Aug 28.
5
New finite-size correction for local alignment score distributions.局部比对得分分布的新有限尺寸校正。
BMC Res Notes. 2012 Jun 12;5:286. doi: 10.1186/1756-0500-5-286.
6
Objective method for estimating asymptotic parameters, with an application to sequence alignment.估计渐近参数的客观方法及其在序列比对中的应用。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Sep;84(3 Pt 1):031914. doi: 10.1103/PhysRevE.84.031914. Epub 2011 Sep 13.
7
A new repeat-masking method enables specific detection of homologous sequences.一种新的重复序列屏蔽方法可实现同源序列的特异性检测。
Nucleic Acids Res. 2011 Mar;39(4):e23. doi: 10.1093/nar/gkq1212. Epub 2010 Nov 24.
8
The whole alignment and nothing but the alignment: the problem of spurious alignment flanks.完全对齐且只有对齐:虚假对齐侧翼的问题。
Nucleic Acids Res. 2008 Oct;36(18):5863-71. doi: 10.1093/nar/gkn579. Epub 2008 Sep 16.

本文引用的文献

1
The Gumbel pre-factor k for gapped local alignment can be estimated from simulations of global alignment.用于间隙局部比对的耿贝尔前置因子k可通过全局比对模拟来估计。
Nucleic Acids Res. 2005 Sep 6;33(15):4987-94. doi: 10.1093/nar/gki800. Print 2005.
2
The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.用于比较具有非标准组成的蛋白质的氨基酸替换矩阵的构建。
Bioinformatics. 2005 Apr 1;21(7):902-11. doi: 10.1093/bioinformatics/bti070. Epub 2004 Oct 27.
3
Rapid significance estimation in local sequence alignment with gaps.带空位的局部序列比对中的快速显著性估计
J Comput Biol. 2002;9(2):243-60. doi: 10.1089/10665270252935449.
4
Asymmetric exclusion process and extremal statistics of random sequences.非对称排斥过程与随机序列的极值统计
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Mar;65(3 Pt 1):031911. doi: 10.1103/PhysRevE.65.031911. Epub 2002 Mar 5.
5
Approximate p-values for local sequence alignments: numerical studies.局部序列比对的近似p值:数值研究
J Comput Biol. 2001;8(5):549-56. doi: 10.1089/106652701753216530.
6
Statistical significance of probabilistic sequence alignment and related local hidden Markov models.概率序列比对及相关局部隐马尔可夫模型的统计学显著性。
J Comput Biol. 2001;8(3):249-82. doi: 10.1089/10665270152530845.
7
The estimation of statistical parameters for local alignment score distributions.局部比对得分分布的统计参数估计。
Nucleic Acids Res. 2001 Jan 15;29(2):351-61. doi: 10.1093/nar/29.2.351.
8
Accurate formula for P-values of gapped local sequence and profile alignments.带间隔的局部序列和轮廓比对P值的精确公式。
J Mol Biol. 2000 Jul 14;300(3):649-59. doi: 10.1006/jmbi.2000.3875.
9
Rapid assessment of extremal statistics for gapped local alignment.带间隙局部比对的极值统计量快速评估。
Proc Int Conf Intell Syst Mol Biol. 1999:211-22.
10
Local sequence alignments with monotonic gap penalties.具有单调空位罚分的局部序列比对。
Bioinformatics. 1999 Jun;15(6):455-62. doi: 10.1093/bioinformatics/15.6.455.

通过带停止时间的重要性抽样估计随机序列局部比对的耿贝尔尺度参数。

ESTIMATING THE GUMBEL SCALE PARAMETER FOR LOCAL ALIGNMENT OF RANDOM SEQUENCES BY IMPORTANCE SAMPLING WITH STOPPING TIMES.

作者信息

Park Yonil, Sheetlin Sergey, Spouge John L

机构信息

National Center for Biotechnology Information National Library of Medicine National Institutes of Health 8600 Rockville Pike Bethesda, Maryland 20894 USA.

出版信息

Ann Stat. 2009 Dec 1;37(6A):3697. doi: 10.1214/08-AOS663.

DOI:10.1214/08-AOS663
PMID:20148197
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2818155/
Abstract

The gapped local alignment score of two random sequences follows a Gumbel distribution. If computers could estimate the parameters of the Gumbel distribution within one second, the use of arbitrary alignment scoring schemes could increase the sensitivity of searching biological sequence databases over the web. Accordingly, this article gives a novel equation for the scale parameter of the relevant Gumbel distribution. We speculate that the equation is exact, although present numerical evidence is limited. The equation involves ascending ladder variates in the global alignment of random sequences. In global alignment simulations, the ladder variates yield stopping times specifying random sequence lengths. Because of the random lengths, and because our trial distribution for importance sampling occurs on a different sample space from our target distribution, our study led to a mapping theorem, which led naturally in turn to an efficient dynamic programming algorithm for the importance sampling weights. Numerical studies using several popular alignment scoring schemes then examined the efficiency and accuracy of the resulting simulations.

摘要

两个随机序列的间隙局部比对得分服从耿贝尔分布。如果计算机能够在一秒内估计出耿贝尔分布的参数,那么使用任意比对计分方案都可以提高在网络上搜索生物序列数据库的灵敏度。因此,本文给出了一个关于相关耿贝尔分布尺度参数的新方程。我们推测该方程是精确的,尽管目前的数值证据有限。该方程涉及随机序列全局比对中的上升阶梯变量。在全局比对模拟中,阶梯变量产生指定随机序列长度的停止时间。由于序列长度是随机的,并且由于我们用于重要性抽样的试验分布发生在与目标分布不同的样本空间上,我们的研究得出了一个映射定理,进而自然地引出了一种用于重要性抽样权重的高效动态规划算法。然后,使用几种流行的比对计分方案进行的数值研究检验了所得模拟的效率和准确性。