• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

拷贝数杂合变异对基因家族进化的影响。

The Effect of Copy Number Hemiplasy on Gene Family Evolution.

机构信息

School of Mathematics and Statistics/Melbourne Integrative Genomics, The University of Melbourne, Melbourne 3010, Australia.

Alibaba Cloud, Hangzhou, China.

出版信息

Syst Biol. 2024 Jul 27;73(2):355-374. doi: 10.1093/sysbio/syae007.

DOI:10.1093/sysbio/syae007
PMID:38330161
Abstract

The evolution of gene families is complex, involving gene-level evolutionary events such as gene duplication, horizontal gene transfer, and gene loss, and other processes such as incomplete lineage sorting (ILS). Because of this, topological differences often exist between gene trees and species trees. A number of models have been recently developed to explain these discrepancies, the most realistic of which attempts to consider both gene-level events and ILS. When unified in a single model, the interaction between ILS and gene-level events can cause polymorphism in gene copy number, which we refer to as copy number hemiplasy (CNH). In this paper, we extend the Wright-Fisher process to include duplications and losses over several species, and show that the probability of CNH for this process can be significant. We study how well two unified models-multilocus multispecies coalescent (MLMSC), which models CNH, and duplication, loss, and coalescence (DLCoal), which does not-approximate the Wright-Fisher process with duplication and loss. We then study the effect of CNH on gene family evolution by comparing MLMSC and DLCoal. We generate comparable gene trees under both models, showing significant differences in various summary statistics; most importantly, CNH reduces the number of gene copies greatly. If this is not taken into account, the traditional method of estimating duplication rates (by counting the number of gene copies) becomes inaccurate. The simulated gene trees are also used for species tree inference with the summary methods ASTRAL and ASTRAL-Pro, demonstrating that their accuracy, based on CNH-unaware simulations calibrated on real data, may have been overestimated.

摘要

基因家族的进化是复杂的,涉及基因水平的进化事件,如基因复制、水平基因转移和基因丢失,以及不完全谱系分选(ILS)等其他过程。因此,基因树和种系发生树之间经常存在拓扑差异。最近已经开发了许多模型来解释这些差异,其中最现实的模型试图同时考虑基因水平事件和 ILS。当统一在一个单一的模型中时,ILS 和基因水平事件之间的相互作用会导致基因拷贝数的多态性,我们称之为拷贝数半合子(CNH)。在本文中,我们将 Wright-Fisher 过程扩展到包括几个物种的复制和丢失,并表明该过程的 CNH 概率可能很大。我们研究了两种统一模型——多基因多位点种系发生(MLMSC),它模拟 CNH,以及复制、丢失和合并(DLCoal),后者不模拟 CNH——与复制和丢失的 Wright-Fisher 过程的拟合程度。然后,我们通过比较 MLMSC 和 DLCoal 来研究 CNH 对基因家族进化的影响。我们在这两种模型下生成可比的基因树,显示出各种汇总统计数据的显著差异;最重要的是,CNH 大大减少了基因拷贝数。如果不考虑这一点,传统的估计复制率的方法(通过计算基因拷贝数)就变得不准确。模拟的基因树也用于基于汇总方法 ASTRAL 和 ASTRAL-Pro 的种系发生树推断,表明它们的准确性,基于在真实数据上校准的无 CNH 模拟,可能被高估了。

相似文献

1
The Effect of Copy Number Hemiplasy on Gene Family Evolution.拷贝数杂合变异对基因家族进化的影响。
Syst Biol. 2024 Jul 27;73(2):355-374. doi: 10.1093/sysbio/syae007.
2
The Multilocus Multispecies Coalescent: A Flexible New Model of Gene Family Evolution.多位点多物种合并模型:一种基因家族进化的灵活新模型。
Syst Biol. 2021 Jun 16;70(4):822-837. doi: 10.1093/sysbio/syaa084.
3
Counting and sampling gene family evolutionary histories in the duplication-loss and duplication-loss-transfer models.在重复-缺失和重复-缺失-转移模型中计算和采样基因家族进化历史。
J Math Biol. 2020 Apr;80(5):1353-1388. doi: 10.1007/s00285-019-01465-x. Epub 2020 Feb 15.
4
Quartet-based inference is statistically consistent under the unified duplication-loss-coalescence model.基于四重体的推断在统一的复制-丢失-合并模型下是统计一致的。
Bioinformatics. 2021 Nov 18;37(22):4064-4074. doi: 10.1093/bioinformatics/btab414.
5
Unified modeling of gene duplication, loss, and coalescence using a locus tree.利用基因座树对基因复制、丢失和合并进行统一建模。
Genome Res. 2012 Apr;22(4):755-65. doi: 10.1101/gr.123901.111. Epub 2012 Jan 23.
6
Most parsimonious reconciliation in the presence of gene duplication, loss, and deep coalescence using labeled coalescent trees.使用标记合并树在存在基因重复、丢失和深度合并的情况下进行最简约的协调。
Genome Res. 2014 Mar;24(3):475-86. doi: 10.1101/gr.161968.113. Epub 2013 Dec 5.
7
Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.种系树估计及全基因组复制后基因丢失的影响。
Syst Biol. 2022 Oct 12;71(6):1348-1361. doi: 10.1093/sysbio/syac040.
8
Effect of Incomplete Lineage Sorting On Tree-Reconciliation-Based Inference of Gene Duplication.不完全谱系分选对基于树-和解的基因复制推断的影响。
IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):477-85. doi: 10.1109/TCBB.2013.2297913.
9
Evaluating Summary Methods for Multilocus Species Tree Estimation in the Presence of Incomplete Lineage Sorting.在存在不完全谱系分选的情况下评估多位点物种树估计的总结方法。
Syst Biol. 2016 May;65(3):366-80. doi: 10.1093/sysbio/syu063. Epub 2014 Aug 26.
10
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.