• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LAGAN和多LAGAN:用于基因组DNA大规模多重比对的高效工具。

LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.

作者信息

Brudno Michael, Do Chuong B, Cooper Gregory M, Kim Michael F, Davydov Eugene, Green Eric D, Sidow Arend, Batzoglou Serafim

机构信息

Department of Computer Science, Stanford University, Stanford, California 94305-9010, USA.

出版信息

Genome Res. 2003 Apr;13(4):721-31. doi: 10.1101/gr.926603. Epub 2003 Mar 12.

DOI:10.1101/gr.926603
PMID:12654723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC430158/
Abstract

To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. We present LAGAN, a system for rapid global alignment of two homologous genomic sequences, and Multi-LAGAN, a system for multiple global alignment of genomic sequences. We tested our systems on a data set consisting of greater than 12 Mb of high-quality sequence from 12 vertebrate species. All the sequence was derived from the genomic region orthologous to an approximately 1.5-Mb region on human chromosome 7q31.3. We found that both LAGAN and Multi-LAGAN compare favorably with other leading alignment methods in correctly aligning protein-coding exons, especially between distant homologs such as human and chicken, or human and fugu. Multi-LAGAN produced the most accurate alignments, while requiring just 75 minutes on a personal computer to obtain the multiple alignment of all 12 sequences. Multi-LAGAN is a practical method for generating multiple alignments of long genomic sequences at any evolutionary distance. Our systems are publicly available at http://lagan.stanford.edu.

摘要

为了比较不同物种的全基因组,生物学家越来越需要高效到足以处理长序列且准确到足以正确比对远缘物种间保守生物学特征的比对方法。我们展示了LAGAN,一种用于两条同源基因组序列快速全局比对的系统,以及Multi-LAGAN,一种用于基因组序列多重全局比对的系统。我们在一个由来自12种脊椎动物的超过12 Mb高质量序列组成的数据集上测试了我们的系统。所有序列均来自与人类7号染色体7q31.3上一个约1.5 Mb区域直系同源的基因组区域。我们发现,在正确比对蛋白质编码外显子方面,LAGAN和Multi-LAGAN都优于其他领先的比对方法,尤其是在人类与鸡或人类与河豚等远缘同源物之间。Multi-LAGAN产生了最准确的比对结果,同时在个人计算机上仅需75分钟就能获得所有12条序列的多重比对。Multi-LAGAN是一种在任何进化距离下生成长期基因组序列多重比对的实用方法。我们的系统可在http://lagan.stanford.edu上公开获取。

相似文献

1
LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.LAGAN和多LAGAN:用于基因组DNA大规模多重比对的高效工具。
Genome Res. 2003 Apr;13(4):721-31. doi: 10.1101/gr.926603. Epub 2003 Mar 12.
2
Glocal alignment: finding rearrangements during alignment.全局比对:比对过程中发现重排
Bioinformatics. 2003;19 Suppl 1:i54-62. doi: 10.1093/bioinformatics/btg1005.
3
ABC: software for interactive browsing of genomic multiple sequence alignment data.ABC:用于交互式浏览基因组多序列比对数据的软件。
BMC Bioinformatics. 2004 Dec 8;5:192. doi: 10.1186/1471-2105-5-192.
4
Phylo-VISTA: interactive visualization of multiple DNA sequence alignments.系统发育可视化工具(Phylo-VISTA):多个DNA序列比对的交互式可视化
Bioinformatics. 2004 Mar 22;20(5):636-43. doi: 10.1093/bioinformatics/btg459. Epub 2004 Jan 22.
5
MAVID multiple alignment server.MAVID多重比对服务器。
Nucleic Acids Res. 2003 Jul 1;31(13):3525-6. doi: 10.1093/nar/gkg623.
6
Accurate anchoring alignment of divergent sequences.发散序列的精确锚定比对。
Bioinformatics. 2006 Jan 1;22(1):29-34. doi: 10.1093/bioinformatics/bti772. Epub 2005 Nov 13.
7
An introduction to the Lagan alignment toolkit.拉甘比对工具包简介。
Methods Mol Biol. 2007;395:205-20. doi: 10.1007/978-1-59745-514-5_13.
8
MAVID: constrained ancestral alignment of multiple sequences.MAVID:多条序列的受限祖先比对
Genome Res. 2004 Apr;14(4):693-9. doi: 10.1101/gr.1960404.
9
Benchmarking tools for the alignment of functional noncoding DNA.用于功能性非编码DNA比对的基准测试工具。
BMC Bioinformatics. 2004 Jan 21;5:6. doi: 10.1186/1471-2105-5-6.
10
Genomic multiple sequence alignments: refinement using a genetic algorithm.基因组多序列比对:使用遗传算法进行优化
BMC Bioinformatics. 2005 Aug 8;6:200. doi: 10.1186/1471-2105-6-200.

引用本文的文献

1
Plastomic evolution and genetic diversity of cultivated sweet cheery (Prunus avium (L.) L.) in China.中国栽培甜樱桃(Prunus avium (L.) L.)的质体基因组进化与遗传多样性
BMC Plant Biol. 2025 Aug 8;25(1):1043. doi: 10.1186/s12870-025-07125-1.
2
Comparative analysis of 18 chloroplast genomes reveals genomic diversity and evolutionary dynamics in subtribe Malaxidinae (Orchidaceae).18个叶绿体基因组的比较分析揭示了沼兰亚族(兰科)的基因组多样性和进化动态。
BMC Plant Biol. 2025 Aug 2;25(1):1013. doi: 10.1186/s12870-025-06772-8.
3
Genome Skimming Reveals Plastome Conservation, Phylogenetic Structure, and Novel Molecular Markers in Valuable Orchid .基因组浅层测序揭示了珍贵兰花的质体基因组保守性、系统发育结构和新型分子标记
Genes (Basel). 2025 Jun 20;16(7):723. doi: 10.3390/genes16070723.
4
ReAlign-P: a vertical iterative realignment method for protein multiple sequence alignment.ReAlign-P:一种用于蛋白质多序列比对的垂直迭代重排方法。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf421.
5
Ultrafast and ultralarge multiple sequence alignments using TWILIGHT.使用TWILIGHT进行超快速和超大的多序列比对。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i332-i341. doi: 10.1093/bioinformatics/btaf212.
6
Establishment, conservation, and innovation of dorsal determination mechanisms during the evolution of vertebrate paired appendages.脊椎动物成对附肢进化过程中背侧决定机制的建立、保守性及创新性
bioRxiv. 2025 Jul 5:2025.07.02.662459. doi: 10.1101/2025.07.02.662459.
7
Core gene set of the species .该物种的核心基因集
bioRxiv. 2025 Apr 30:2023.09.07.545205. doi: 10.1101/2023.09.07.545205.
8
A Y-linked duplication of anti-Mullerian hormone is the sex determination gene in threespine stickleback.抗缪勒氏管激素的Y连锁重复是三刺鱼的性别决定基因。
bioRxiv. 2025 Apr 29:2025.04.28.650899. doi: 10.1101/2025.04.28.650899.
9
Convergent Evolutionary Dead-End and Breakdown of Hard Chorion in Parental-Egg-Care Fish Reproductive Strategies.亲鱼护卵鱼类繁殖策略中硬卵壳的趋同进化死胡同与崩溃
Mol Ecol. 2025 Jul;34(13):e17816. doi: 10.1111/mec.17816. Epub 2025 Jun 2.
10
Historical biogeography and plastome evolution of Commelinaceae Mirb. (Commelinales) corroborate the East Gondwanan origins.鸭跖草科(鸭跖草目)的历史生物地理学与质体基因组演化证实了其源自东冈瓦纳。
BMC Plant Biol. 2025 Apr 25;25(1):533. doi: 10.1186/s12870-025-06504-y.

本文引用的文献

1
Fast and sensitive alignment of large genomic sequences.大型基因组序列的快速灵敏比对
Proc IEEE Comput Soc Bioinform Conf. 2002;1:138-47.
2
AVID: A global alignment program.AVID:一个全局比对程序。
Genome Res. 2003 Jan;13(1):97-102. doi: 10.1101/gr.789803.
3
Efficient multiple genome alignment.高效多基因组比对。
Bioinformatics. 2002;18 Suppl 1:S312-20. doi: 10.1093/bioinformatics/18.suppl_1.s312.
4
Fast algorithms for large-scale genome alignment and comparison.用于大规模基因组比对和比较的快速算法。
Nucleic Acids Res. 2002 Jun 1;30(11):2478-83. doi: 10.1093/nar/30.11.2478.
5
Transcriptional regulation of the stem cell leukemia gene (SCL)--comparative analysis of five vertebrate SCL loci.干细胞白血病基因(SCL)的转录调控——五个脊椎动物SCL基因座的比较分析。
Genome Res. 2002 May;12(5):749-59. doi: 10.1101/gr.45502.
6
BLAT--the BLAST-like alignment tool.BLAT——类BLAST比对工具。
Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.
7
Inference of functional regions in proteins by quantification of evolutionary constraints.通过进化限制的量化推断蛋白质中的功能区域
Proc Natl Acad Sci U S A. 2002 Mar 5;99(5):2912-7. doi: 10.1073/pnas.042692299.
8
SSAHA: a fast search method for large DNA databases.SSAHA:一种用于大型DNA数据库的快速搜索方法。
Genome Res. 2001 Oct;11(10):1725-9. doi: 10.1101/gr.194201.
9
Evolutionary HMMs: a Bayesian approach to multiple alignment.进化隐马尔可夫模型:一种用于多序列比对的贝叶斯方法。
Bioinformatics. 2001 Sep;17(9):803-20. doi: 10.1093/bioinformatics/17.9.803.
10
Comparison of genomic DNA sequences: solved and unsolved problems.基因组DNA序列比较:已解决和未解决的问题
Bioinformatics. 2001 May;17(5):391-7. doi: 10.1093/bioinformatics/17.5.391.