• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于图的贪婪算法,用于对齐多个同源基因列表。

A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.

机构信息

Department of Information Technology, Ghent University, Ghent, Belgium.

出版信息

Bioinformatics. 2011 Mar 15;27(6):749-56. doi: 10.1093/bioinformatics/btr008. Epub 2011 Jan 6.

DOI:10.1093/bioinformatics/btr008
PMID:21216775
Abstract

MOTIVATION

Many comparative genomics studies rely on the correct identification of homologous genomic regions using accurate alignment tools. In such case, the alphabet of the input sequences consists of complete genes, rather than nucleotides or amino acids. As optimal multiple sequence alignment is computationally impractical, a progressive alignment strategy is often employed. However, such an approach is susceptible to the propagation of alignment errors in early pairwise alignment steps, especially when dealing with strongly diverged genomic regions. In this article, we present a novel accurate and efficient greedy, graph-based algorithm for the alignment of multiple homologous genomic segments, represented as ordered gene lists.

RESULTS

Based on provable properties of the graph structure, several heuristics are developed to resolve local alignment conflicts that occur due to gene duplication and/or rearrangement events on the different genomic segments. The performance of the algorithm is assessed by comparing the alignment results of homologous genomic segments in Arabidopsis thaliana to those obtained by using both a progressive alignment method and an earlier graph-based implementation. Especially for datasets that contain strongly diverged segments, the proposed method achieves a substantially higher alignment accuracy, and proves to be sufficiently fast for large datasets including a few dozens of eukaryotic genomes.

AVAILABILITY

http://bioinformatics.psb.ugent.be/software. The algorithm is implemented as a part of the i-ADHoRe 3.0 package.

摘要

动机

许多比较基因组学研究依赖于使用准确的对齐工具正确识别同源基因组区域。在这种情况下,输入序列的字母表由完整的基因组成,而不是核苷酸或氨基酸。由于最优的多重序列比对在计算上是不切实际的,因此通常采用渐进对齐策略。然而,这种方法容易在早期的两两比对步骤中传播比对错误,尤其是在处理高度分化的基因组区域时。在本文中,我们提出了一种新的准确而高效的贪婪、基于图的算法,用于对齐多个同源基因片段,这些片段表示为有序的基因列表。

结果

基于图结构的可证明性质,开发了几种启发式方法来解决由于不同基因组片段上的基因复制和/或重排事件而导致的局部比对冲突。通过将拟南芥同源基因片段的比对结果与使用渐进比对方法和早期基于图的实现方法得到的结果进行比较,评估了算法的性能。特别是对于包含高度分化片段的数据集,所提出的方法实现了更高的比对准确性,并且对于包含几十个真核基因组的大型数据集来说,速度足够快。

可用性

http://bioinformatics.psb.ugent.be/software。该算法作为 i-ADHoRe 3.0 包的一部分实现。

相似文献

1
A greedy, graph-based algorithm for the alignment of multiple homologous gene lists.一种基于图的贪婪算法,用于对齐多个同源基因列表。
Bioinformatics. 2011 Mar 15;27(6):749-56. doi: 10.1093/bioinformatics/btr008. Epub 2011 Jan 6.
2
Accurate identification of orthologous segments among multiple genomes.准确识别多个基因组之间的直系同源片段。
Bioinformatics. 2009 Apr 1;25(7):853-60. doi: 10.1093/bioinformatics/btp070. Epub 2009 Feb 2.
3
i-ADHoRe 2.0: an improved tool to detect degenerated genomic homology using genomic profiles.i-ADHoRe 2.0:一种利用基因组图谱检测退化基因组同源性的改进工具。
Bioinformatics. 2008 Jan 1;24(1):127-8. doi: 10.1093/bioinformatics/btm449. Epub 2007 Oct 17.
4
Segment-based multiple sequence alignment.基于片段的多序列比对。
Bioinformatics. 2008 Aug 15;24(16):i187-92. doi: 10.1093/bioinformatics/btn281.
5
PanTools: representation, storage and exploration of pan-genomic data.泛基因组工具:泛基因组数据的表示、存储与探索
Bioinformatics. 2016 Sep 1;32(17):i487-i493. doi: 10.1093/bioinformatics/btw455.
6
GMAP and GSNAP for Genomic Sequence Alignment: Enhancements to Speed, Accuracy, and Functionality.用于基因组序列比对的GMAP和GSNAP:速度、准确性及功能的提升
Methods Mol Biol. 2016;1418:283-334. doi: 10.1007/978-1-4939-3578-9_15.
7
Genomic multiple sequence alignments: refinement using a genetic algorithm.基因组多序列比对:使用遗传算法进行优化
BMC Bioinformatics. 2005 Aug 8;6:200. doi: 10.1186/1471-2105-6-200.
8
Accurate anchoring alignment of divergent sequences.发散序列的精确锚定比对。
Bioinformatics. 2006 Jan 1;22(1):29-34. doi: 10.1093/bioinformatics/bti772. Epub 2005 Nov 13.
9
Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoring.通过基因组DNA与蛋白质序列的剪接比对进行基因结构预测:通过差异剪接位点评分提高准确性。
J Mol Biol. 2000 Apr 14;297(5):1075-85. doi: 10.1006/jmbi.2000.3641.
10
ADEPT: a domain independent sequence alignment strategy for gpu architectures.ADEPT:一种适用于 GPU 架构的与领域无关的序列比对策略。
BMC Bioinformatics. 2020 Sep 15;21(1):406. doi: 10.1186/s12859-020-03720-1.

引用本文的文献

1
Identification of clade-wide putative -regulatory elements from conserved non-coding sequences in Cucurbitaceae genomes.从葫芦科基因组中保守非编码序列鉴定全进化枝推定调控元件
Hortic Res. 2023 Feb 28;10(4):uhad038. doi: 10.1093/hr/uhad038. eCollection 2023 Apr.
2
Diploid and tetraploid genomes of Acorus and the evolution of monocots.菖蒲属的二倍体和四倍体基因组和单子叶植物的演化。
Nat Commun. 2023 Jun 20;14(1):3661. doi: 10.1038/s41467-023-38829-3.
3
The genome of the king protea, Protea cynaroides.帝王花基因组,普洛蒂亚属。
Plant J. 2023 Jan;113(2):262-276. doi: 10.1111/tpj.16044. Epub 2022 Dec 10.
4
Genomes of leafy and leafless Platanthera orchids illuminate the evolution of mycoheterotrophy.有叶和无叶角盘兰属植物的基因组阐明了菌根异养的进化。
Nat Plants. 2022 Apr;8(4):373-388. doi: 10.1038/s41477-022-01127-9. Epub 2022 Apr 21.
5
The Welwitschia genome reveals a unique biology underpinning extreme longevity in deserts.威氏特有基因组揭示了其在沙漠中极端长寿的独特生物学基础。
Nat Commun. 2021 Jul 12;12(1):4247. doi: 10.1038/s41467-021-24528-4.
6
An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice.重复基因检测方法概述:选择重复基因检测方法时为何必须考虑重复机制。
Genes (Basel). 2020 Sep 4;11(9):1046. doi: 10.3390/genes11091046.
7
The water lily genome and the early evolution of flowering plants.睡莲花基因组与早期开花植物进化。
Nature. 2020 Jan;577(7788):79-84. doi: 10.1038/s41586-019-1852-5. Epub 2019 Dec 18.
8
Rapid functional divergence after small-scale gene duplication in grasses.禾本科中小规模基因复制后的快速功能分化。
BMC Evol Biol. 2019 May 2;19(1):97. doi: 10.1186/s12862-019-1415-2.
9
Reciprocally Retained Genes in the Angiosperm Lineage Show the Hallmarks of Dosage Balance Sensitivity.被子植物谱系中相互保留的基因具有剂量平衡敏感性的特征。
Plant Cell. 2017 Nov;29(11):2766-2785. doi: 10.1105/tpc.17.00313. Epub 2017 Oct 23.
10
The Apostasia genome and the evolution of orchids.麻兰属植物基因组与兰花的进化
Nature. 2017 Sep 21;549(7672):379-383. doi: 10.1038/nature23897. Epub 2017 Sep 13.