• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从多序列比对中自动提取可靠区域。

Automatic extraction of reliable regions from multiple sequence alignments.

作者信息

Lassmann Timo, Sonnhammer Erik Ll

机构信息

Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden.

出版信息

BMC Bioinformatics. 2007 May 24;8 Suppl 5(Suppl 5):S9. doi: 10.1186/1471-2105-8-S5-S9.

DOI:10.1186/1471-2105-8-S5-S9
PMID:17570868
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1892097/
Abstract

BACKGROUND

High quality multiple alignments are crucial in the transfer of annotation from one genome to another. Multiple alignment methods strive to achieve ever increasing levels of average accuracy on benchmark sets while the accuracy of individual alignments is often overlooked.

RESULTS

We have previously developed a method to automatically assess the accuracy and overall difficulty of multiple alignments. This was achieved by a per-residue comparison between alternate alignments of the same sequences. Here we present a key extension to this method, an algorithm to extract similarly aligned regions from several alignments and merge them into a new consensus alignment.

CONCLUSION

We demonstrate that the fraction of correctly aligned residues within the resulting alignments is increased by 25-100 percent compared to the original input alignments, as only the most reliably aligned parts are considered.

摘要

背景

高质量的多序列比对在将注释从一个基因组转移到另一个基因组的过程中至关重要。多序列比对方法致力于在基准数据集上不断提高平均准确率,而单个比对的准确性常常被忽视。

结果

我们之前开发了一种自动评估多序列比对的准确性和整体难度的方法。这是通过对相同序列的交替比对进行逐个残基比较来实现的。在此,我们展示了该方法的一个关键扩展,即一种从多个比对中提取相似比对区域并将它们合并成一个新的一致比对的算法。

结论

我们证明,与原始输入比对相比,所得比对中正确比对残基的比例提高了25%至100%,因为只考虑了最可靠比对的部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b362/1892097/c908c131d314/1471-2105-8-S5-S9-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b362/1892097/ff0212d1e0d9/1471-2105-8-S5-S9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b362/1892097/c908c131d314/1471-2105-8-S5-S9-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b362/1892097/ff0212d1e0d9/1471-2105-8-S5-S9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b362/1892097/c908c131d314/1471-2105-8-S5-S9-2.jpg

相似文献

1
Automatic extraction of reliable regions from multiple sequence alignments.从多序列比对中自动提取可靠区域。
BMC Bioinformatics. 2007 May 24;8 Suppl 5(Suppl 5):S9. doi: 10.1186/1471-2105-8-S5-S9.
2
Automatic assessment of alignment quality.对齐质量的自动评估。
Nucleic Acids Res. 2005 Dec 16;33(22):7120-8. doi: 10.1093/nar/gki1020. Print 2005.
3
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.
4
Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments.与结构比对相比,PSI-BLAST比对准确性的评估。
Protein Sci. 2000 Nov;9(11):2278-84. doi: 10.1110/ps.9.11.2278.
5
An adaptive and iterative algorithm for refining multiple sequence alignment.一种用于优化多序列比对的自适应迭代算法。
Comput Biol Chem. 2004 Apr;28(2):141-8. doi: 10.1016/j.compbiolchem.2004.02.001.
6
Multiple alignment by sequence annealing.通过序列退火进行多序列比对。
Bioinformatics. 2007 Jan 15;23(2):e24-9. doi: 10.1093/bioinformatics/btl311.
7
MISHIMA--a new method for high speed multiple alignment of nucleotide sequences of bacterial genome scale data.三岛法——一种用于细菌基因组规模数据的高速多重核苷酸序列比对的新方法。
BMC Bioinformatics. 2010 Mar 18;11:142. doi: 10.1186/1471-2105-11-142.
8
PSI-BLAST-ISS: an intermediate sequence search tool for estimation of the position-specific alignment reliability.PSI-BLAST-ISS:一种用于估计位置特异性比对可靠性的中间序列搜索工具。
BMC Bioinformatics. 2005 Jul 21;6:185. doi: 10.1186/1471-2105-6-185.
9
Analysis and comparison of benchmarks for multiple sequence alignment.多序列比对基准的分析与比较
In Silico Biol. 2006;6(4):321-39.
10
Kalign, Kalignvu and Mumsa: web servers for multiple sequence alignment.Kalign、Kalignvu和Mumsa:用于多序列比对的网络服务器。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W596-9. doi: 10.1093/nar/gkl191.

引用本文的文献

1
AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences.AlignMiner:一种基于网络的工具,用于检测保守序列多序列比对中的差异区域。
Algorithms Mol Biol. 2010 Jun 2;5:24. doi: 10.1186/1748-7188-5-24.
2
Reproducing the manual annotation of multiple sequence alignments using a SVM classifier.使用 SVM 分类器重现多重序列比对的手动注释。
Bioinformatics. 2009 Dec 1;25(23):3093-8. doi: 10.1093/bioinformatics/btp552. Epub 2009 Sep 21.
3
H2r: identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments.

本文引用的文献

1
Kalign, Kalignvu and Mumsa: web servers for multiple sequence alignment.Kalign、Kalignvu和Mumsa:用于多序列比对的网络服务器。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W596-9. doi: 10.1093/nar/gkl191.
2
MACSIMS: multiple alignment of complete sequences information management system.MACSIMS:完整序列信息管理系统的多序列比对
BMC Bioinformatics. 2006 Jun 23;7:318. doi: 10.1186/1471-2105-7-318.
3
Multiple sequence alignment accuracy and phylogenetic inference.多序列比对准确性和系统发育推断
H2r:通过对多序列比对进行基于熵的分析来识别进化上重要的残基。
BMC Bioinformatics. 2008 Mar 18;9:151. doi: 10.1186/1471-2105-9-151.
Syst Biol. 2006 Apr;55(2):314-28. doi: 10.1080/10635150500541730.
4
M-Coffee: combining multiple sequence alignment methods with T-Coffee.M-Coffee:将多种多序列比对方法与T-Coffee相结合。
Nucleic Acids Res. 2006 Mar 23;34(6):1692-9. doi: 10.1093/nar/gkl091. Print 2006.
5
Automatic assessment of alignment quality.对齐质量的自动评估。
Nucleic Acids Res. 2005 Dec 16;33(22):7120-8. doi: 10.1093/nar/gki1020. Print 2005.
6
Kalign--an accurate and fast multiple sequence alignment algorithm.Kalign——一种准确且快速的多序列比对算法。
BMC Bioinformatics. 2005 Dec 12;6:298. doi: 10.1186/1471-2105-6-298.
7
BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark.BAliBASE 3.0:多序列比对基准测试的最新进展。
Proteins. 2005 Oct 1;61(1):127-36. doi: 10.1002/prot.20527.
8
ProbCons: Probabilistic consistency-based multiple sequence alignment.ProbCons:基于概率一致性的多序列比对。
Genome Res. 2005 Feb;15(2):330-40. doi: 10.1101/gr.2821705.
9
MAFFT version 5: improvement in accuracy of multiple sequence alignment.MAFFT 5 版本:多重序列比对准确性的提升。
Nucleic Acids Res. 2005 Jan 20;33(2):511-8. doi: 10.1093/nar/gki198. Print 2005.
10
SABmark--a benchmark for sequence alignment that covers the entire known fold space.SABmark——一种涵盖整个已知折叠空间的序列比对基准。
Bioinformatics. 2005 Apr 1;21(7):1267-8. doi: 10.1093/bioinformatics/bth493. Epub 2004 Aug 27.