• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于保守基因邻接关系的新基因组相似性度量方法。

New Genome Similarity Measures based on Conserved Gene Adjacencies.

作者信息

Doerr Daniel, Kowada Luis Antonio B, Araujo Eloi, Deshpande Shachi, Dantas Simone, Moret Bernard M E, Stoye Jens

机构信息

1 École Polytechnique Fédérale de Lausanne , Lausanne, Switzerland .

2 Universidade Federal Fluminense , Niterói, Brazil .

出版信息

J Comput Biol. 2017 Jun;24(6):616-634. doi: 10.1089/cmb.2017.0065.

DOI:10.1089/cmb.2017.0065
PMID:28590847
Abstract

Many important questions in molecular biology, evolution, and biomedicine can be addressed by comparative genomic approaches. One of the basic tasks when comparing genomes is the definition of measures of similarity (or dissimilarity) between two genomes, for example, to elucidate the phylogenetic relationships between species. The power of different genome comparison methods varies with the underlying formal model of a genome. The simplest models impose the strong restriction that each genome under study must contain the same genes, each in exactly one copy. More realistic models allow several copies of a gene in a genome. One speaks of gene families, and comparative genomic methods that allow this kind of input are called gene family-based. The most powerful-but also most complex-models avoid this preprocessing of the input data and instead integrate the family assignment within the comparative analysis. Such methods are called gene family-free. In this article, we study an intermediate approach between family-based and family-free genomic similarity measures. Introducing this simpler model, called gene connections, we focus on the combinatorial aspects of gene family-free genome comparison. While in most cases, the computational costs to the general family-free case are the same, we also find an instance where the gene connections model has lower complexity. Within the gene connections model, we define three variants of genomic similarity measures that have different expression powers. We give polynomial-time algorithms for two of them, while we show NP-hardness for the third, most powerful one. We also generalize the measures and algorithms to make them more robust against recent local disruptions in gene order. Our theoretical findings are supported by experimental results, proving the applicability and performance of our newly defined similarity measures.

摘要

分子生物学、进化和生物医学中的许多重要问题都可以通过比较基因组方法来解决。比较基因组时的一项基本任务是定义两个基因组之间相似性(或不相似性)的度量,例如,以阐明物种之间的系统发育关系。不同基因组比较方法的能力因基因组的底层形式模型而异。最简单的模型施加了严格的限制,即所研究的每个基因组必须包含相同的基因,且每个基因只有一个拷贝。更现实的模型允许基因组中有一个基因的多个拷贝。人们称之为基因家族,允许这种输入的比较基因组方法称为基于基因家族的方法。最强大但也最复杂的模型避免了对输入数据的这种预处理,而是在比较分析中整合家族分配。这种方法称为无基因家族的方法。在本文中,我们研究了一种介于基于家族和无家族的基因组相似性度量之间的中间方法。引入这个更简单的模型,称为基因连接,我们专注于无基因家族的基因组比较的组合方面。虽然在大多数情况下,与一般的无家族情况相比计算成本相同,但我们也发现了一个实例,其中基因连接模型具有更低的复杂度。在基因连接模型中,我们定义了三种具有不同表达能力的基因组相似性度量变体。我们为其中两种给出了多项式时间算法,而对于第三种最强大的变体,我们证明了它是NP难的。我们还对这些度量和算法进行了推广,使其对最近基因顺序的局部破坏更具鲁棒性。我们的理论发现得到了实验结果的支持,证明了我们新定义的相似性度量的适用性和性能。

相似文献

1
New Genome Similarity Measures based on Conserved Gene Adjacencies.基于保守基因邻接关系的新基因组相似性度量方法。
J Comput Biol. 2017 Jun;24(6):616-634. doi: 10.1089/cmb.2017.0065.
2
Efficient tools for computing the number of breakpoints and the number of adjacencies between two genomes with duplicate genes.用于计算具有重复基因的两个基因组之间断点数量和邻接数量的高效工具。
J Comput Biol. 2008 Oct;15(8):1093-115. doi: 10.1089/cmb.2008.0061.
3
The SCJ Small Parsimony Problem for Weighted Gene Adjacencies.加权基因邻接的 SCJ 简约性问题。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1364-1373. doi: 10.1109/TCBB.2017.2661761. Epub 2017 Jan 31.
4
Reconstruction of ancestral gene orders using intermediate genomes.利用中间基因组重建祖先基因顺序
BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S3. doi: 10.1186/1471-2105-16-S14-S3. Epub 2015 Oct 2.
5
On the similarity of sets of permutations and its applications to genome comparison.关于排列集合的相似性及其在基因组比较中的应用
J Comput Biol. 2006 Sep;13(7):1340-54. doi: 10.1089/cmb.2006.13.1340.
6
Comparing genomes with duplications: a computational complexity point of view.从计算复杂性角度比较带有重复序列的基因组。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Oct-Dec;4(4):523-34. doi: 10.1109/TCBB.2007.1069.
7
Family-Free Genome Comparison.无家族基因组比较
Methods Mol Biol. 2018;1704:331-342. doi: 10.1007/978-1-4939-7463-4_12.
8
Fast ancestral gene order reconstruction of genomes with unequal gene content.具有不等基因含量的基因组的快速祖先基因顺序重建
BMC Bioinformatics. 2016 Nov 11;17(Suppl 14):413. doi: 10.1186/s12859-016-1261-9.
9
Family-Free Genome Comparison.无家族基因组比较。
Methods Mol Biol. 2024;2802:57-72. doi: 10.1007/978-1-0716-3838-5_3.
10
Computing the family-free DCJ similarity.计算无亲缘关系的 DCJ 相似度。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):152. doi: 10.1186/s12859-018-2130-5.

引用本文的文献

1
The gene family-free median of three.三个无基因家族的中位数
Algorithms Mol Biol. 2017 May 26;12:14. doi: 10.1186/s13015-017-0106-z. eCollection 2017.