Centre for Genomic Regulation (CRG), Barcelona, Spain.
Bioinformatics. 2011 Jan 1;27(1):38-45. doi: 10.1093/bioinformatics/btq609. Epub 2010 Nov 11.
In genome-wide analyses, the relative age of gene duplications is often estimated by measuring the rate of synonymous substitutions (dS) between paralogous sequences. On the other hand, recent studies have shown the feasibility of inferring, at genomic scales, the relative age of duplication events from the topology of gene family trees. This represents a promising alternative for large surveys requiring an automatic methodology to establish a timeline of duplication events and that are usually limited to the use of dS, which presents known limitations such as a fast saturation of the signal. However, both measures have never been compared in a common framework.
Topology-based placement of duplications on a relative time scale corresponding to periods between speciation events were found to be highly consistent, providing the same placement for 67-84% of a reliable set of gene pairs duplicated in a single event. For recent evolutionary periods, dS and topological measures showed a strong correlation. We conclude that the topology-based approach is more appropriate for assigning duplications to temporal scales when analyses need to include ancient events, and that the study of recent duplications may benefit from a combination of dS and topology information.
在全基因组分析中,基因复制的相对年龄通常通过测量同源序列之间的同义替换率(dS)来估计。另一方面,最近的研究表明,从基因家族树的拓扑结构推断基因组尺度上复制事件的相对年龄是可行的。这对于需要自动方法来建立复制事件时间表的大型调查来说是一种很有前途的替代方法,而这种方法通常仅限于使用 dS,因为 dS 存在已知的局限性,例如信号快速饱和。然而,这两种方法从未在共同的框架中进行过比较。
在与物种形成事件之间的时间段相对应的相对时间尺度上,基于拓扑的复制放置被发现高度一致,对于在单次事件中复制的一组可靠的基因对中的 67-84%,提供了相同的放置。对于最近的进化时期,dS 和拓扑测量具有很强的相关性。我们得出结论,当分析需要包括古老事件时,基于拓扑的方法更适合将复制分配到时间尺度上,而最近的复制研究可能受益于 dS 和拓扑信息的结合。