Suppr超能文献

i-ADHoRe 3.0——在超大数据集中快速且灵敏地检测基因组同源性。

i-ADHoRe 3.0--fast and sensitive detection of genomic homology in extremely large data sets.

机构信息

Department of Plant Systems Biology, VIB, B-9050 Ghent, Belgium.

出版信息

Nucleic Acids Res. 2012 Jan;40(2):e11. doi: 10.1093/nar/gkr955. Epub 2011 Nov 18.

Abstract

Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein-protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution.

摘要

比较基因组学是深入了解塑造相关物种基因组的进化过程的有力手段。随着测序基因组数量的增加,开发能够执行精确跨物种分析的软件变得不可或缺。然而,许多具有比较多个基因组能力的实现方法表现出不利的计算和内存要求,限制了一次运行中可以分析的基因组数量。在这里,我们介绍了一个软件包,该软件包基于基因内容和基因顺序(共线性)的保守性识别来揭示基因组同源性,即 i-ADHoRe 3.0,并将其应用于真核生物基因组。高效算法的使用和对并行计算的支持使大规模数据集的分析成为可能。与其他工具不同,i-ADHoRe 可以在 1 小时内处理包含 49 个物种的 Ensembl 数据集。此外,与基于传递同源性的连锁成对共线性信息相比,谱搜索对检测退化基因组同源性更敏感。通过整合共表达信息和蛋白质-蛋白质相互作用,我们从哺乳动物和鸟类之间的超保守共线性区域中鉴定出了人类基因组中 400 多个具有显著功能一致性的区域。不同的算法改进确保了 i-ADHoRe 3.0 将继续成为研究基因组进化的有力工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c2d/3258164/e2694118c260/gkr955f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验