• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯自适应序列比对算法

Bayesian adaptive sequence alignment algorithms.

作者信息

Zhu J, Liu J S, Lawrence C E

机构信息

Wadsworth Center for Laboratories and Research, Albany, NY, USA.

出版信息

Bioinformatics. 1998;14(1):25-39. doi: 10.1093/bioinformatics/14.1.25.

DOI:10.1093/bioinformatics/14.1.25
PMID:9520499
Abstract

The selection of a scoring matrix and gap penalty parameters continues to be an important problem in sequence alignment. We describe here an algorithm, the 'Bayes block aligner, which bypasses this requirement. Instead of requiring a fixed set of parameter settings, this algorithm returns the Bayesian posterior probability for the number of gaps and for the scoring matrices in any series of interest. Furthermore, instead of returning the single best alignment for the chosen parameter settings, this algorithm returns the posterior distribution of all alignments considering the full range of gapping and scoring matrices selected, weighing each in proportion to its probability based on the data. We compared the Bayes aligner with the popular Smith-Waterman algorithm with parameter settings from the literature which had been optimized for the identification of structural neighbors, and found that the Bayes aligner correctly identified more structural neighbors. In a detailed examination of the alignment of a pair of kinase and a pair of GTPase sequences, we illustrate the algorithm's potential to identify subsequences that are conserved to different degrees. In addition, this example shows that the Bayes aligner returns an alignment-free assessment of the distance between a pair of sequences.

摘要

在序列比对中,评分矩阵和空位罚分参数的选择仍然是一个重要问题。我们在此描述一种算法,即“贝叶斯块比对器”,它绕过了这一要求。该算法不要求固定的参数设置集,而是返回任意感兴趣序列系列中空位数量和评分矩阵的贝叶斯后验概率。此外,该算法不是返回所选参数设置下的单一最佳比对结果,而是返回考虑所选空位和评分矩阵全范围的所有比对结果的后验分布,并根据数据按其概率比例对每个结果进行加权。我们将贝叶斯比对器与文献中针对识别结构邻居进行了优化的流行的史密斯-沃特曼算法进行了比较,发现贝叶斯比对器能正确识别更多的结构邻居。在对一对激酶序列和一对GTP酶序列的比对进行详细研究时,我们展示了该算法识别不同程度保守子序列的潜力。此外,这个例子表明贝叶斯比对器返回了一对序列之间距离的无比对评估。

相似文献

1
Bayesian adaptive sequence alignment algorithms.贝叶斯自适应序列比对算法
Bioinformatics. 1998;14(1):25-39. doi: 10.1093/bioinformatics/14.1.25.
2
Bayesian adaptive alignment and inference.贝叶斯自适应比对与推断
Proc Int Conf Intell Syst Mol Biol. 1997;5:358-68.
3
BALSA: Bayesian algorithm for local sequence alignment.BALSA:用于局部序列比对的贝叶斯算法。
Nucleic Acids Res. 2002 Mar 1;30(5):1268-77. doi: 10.1093/nar/30.5.1268.
4
Positional statistical significance in sequence alignment.序列比对中的位置统计显著性。
J Comput Biol. 1999 Summer;6(2):253-9. doi: 10.1089/cmb.1999.6.253.
5
Comparison of methods for searching protein sequence databases.蛋白质序列数据库搜索方法的比较。
Protein Sci. 1995 Jun;4(6):1145-60. doi: 10.1002/pro.5560040613.
6
From analysis of protein structural alignments toward a novel approach to align protein sequences.从蛋白质结构比对分析到一种比对蛋白质序列的新方法。
Proteins. 2004 Feb 15;54(3):569-82. doi: 10.1002/prot.10503.
7
Structure-based sequence alignment of elongation factors Tu and G with related GTPases involved in translation.延伸因子Tu和G与参与翻译的相关GTP酶的基于结构的序列比对。
J Mol Evol. 1995 Dec;41(6):1096-104.
8
New flexible approaches for multiple sequence alignment.用于多序列比对的新型灵活方法。
J Comput Biol. 1997 Fall;4(3):385-413. doi: 10.1089/cmb.1997.4.385.
9
Detailed protein sequence alignment based on Spectral Similarity Score (SSS).基于光谱相似性评分(SSS)的详细蛋白质序列比对。
BMC Bioinformatics. 2005 Apr 23;6:105. doi: 10.1186/1471-2105-6-105.
10
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

引用本文的文献

1
Chromosome structure modeling tools and their evaluation in bacteria.细菌中染色体结构建模工具及其评估。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae044.
2
Statistical compression of protein sequences and inference of marginal probability landscapes over competing alignments using finite state models and Dirichlet priors.使用有限状态模型和狄利克雷先验概率对蛋白质序列进行统计压缩,并对竞争比对进行边缘概率景观推断。
Bioinformatics. 2019 Jul 15;35(14):i360-i369. doi: 10.1093/bioinformatics/btz368.
3
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
贝叶斯蛋白质结构比对
Ann Appl Stat. 2014;8(4):2068-2095. doi: 10.1214/14-AOAS780. Epub 2014 Dec 19.
4
Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs.使用有向无环图对多序列比对中的不确定性进行有效表示。
BMC Bioinformatics. 2015 Apr 1;16:108. doi: 10.1186/s12859-015-0516-1.
5
Retinoblastoma protein and MyoD function together to effect the repression of Fra-1 and in turn cyclin D1 during terminal cell cycle arrest associated with myogenesis.视网膜母细胞瘤蛋白和肌分化抗原(MyoD)共同发挥作用,在与肌生成相关的终末细胞周期停滞过程中抑制Fra-1,进而抑制细胞周期蛋白D1。
J Biol Chem. 2014 Aug 22;289(34):23417-27. doi: 10.1074/jbc.M113.532572. Epub 2014 Jul 8.
6
Genome-wide inference of ancestral recombination graphs.全基因组祖先重组图推断
PLoS Genet. 2014 May 15;10(5):e1004342. doi: 10.1371/journal.pgen.1004342. eCollection 2014.
7
Position weight matrix, gibbs sampler, and the associated significance tests in motif characterization and prediction.位置权重矩阵、吉布斯采样器以及基序表征与预测中的相关显著性检验。
Scientifica (Cairo). 2012;2012:917540. doi: 10.6064/2012/917540. Epub 2012 Oct 23.
8
Molecular mechanisms of EGF signaling-dependent regulation of pipe, a gene crucial for dorsoventral axis formation in Drosophila.EGF 信号依赖性调控管基因的分子机制,该基因对果蝇背腹轴形成至关重要。
Dev Genes Evol. 2012 Mar;222(1):1-17. doi: 10.1007/s00427-011-0384-2. Epub 2011 Dec 24.
9
Three-dimensional modeling of chromatin structure from interaction frequency data using Markov chain Monte Carlo sampling.基于交互频率数据的 Markov 链蒙特卡罗采样的染色质结构三维建模。
BMC Bioinformatics. 2011 Oct 25;12:414. doi: 10.1186/1471-2105-12-414.
10
Sequence alignment as hypothesis testing.序列比对作为假设检验。
J Comput Biol. 2011 May;18(5):677-91. doi: 10.1089/cmb.2010.0328.