Department of Biology, University of Rochester, Rochester, NY, 14620.
Integr Comp Biol. 2024 Sep 17;64(2):556-564. doi: 10.1093/icb/icae009.
Gene duplicates, or paralogs, serve as a major source of new genetic material and comprise seeds for evolutionary innovation. While originally thought to be quickly lost or nonfunctionalized following duplication, now a vast number of paralogs are known to be retained in a functional state. Daughter paralogs can provide robustness through redundancy, specialize via sub-functionalization, or neo-functionalize to play new roles. Indeed, the duplication and divergence of developmental genes have played a monumental role in the evolution of animal forms (e.g., Hox genes). Still, despite their prevalence and evolutionary importance, the precise detection of gene duplicates in newly sequenced genomes remains technically challenging and often overlooked. This presents an especially pertinent problem for evolutionary developmental biology, where hypothesis testing requires accurate detection of changes in gene expression and function, often in nontraditional model species. Frequently, these analyses rely on molecular reagents designed within coding sequences that may be highly similar in recently duplicated paralogs, leading to cross-reactivity and spurious results. Thus, care is needed to avoid erroneously assigning diverged functions of paralogs to a single gene, and potentially misinterpreting evolutionary history. This perspective aims to overview the prevalence and importance of paralogs and to shed light on the difficulty of their detection and analysis while offering potential solutions.
基因副本,或称为“旁系同源基因”,是新遗传物质的主要来源,也是进化创新的种子。虽然最初认为在复制后会迅速丢失或失去功能,但现在已知大量的旁系同源基因保持在功能状态。子旁系同源基因可以通过冗余提供鲁棒性,可以通过亚功能化进行专业化,也可以新功能化来发挥新作用。事实上,发育基因的复制和分化在动物形态的进化中发挥了重要作用(例如,Hox 基因)。尽管它们普遍存在且具有进化意义,但在新测序的基因组中精确检测基因副本仍然具有技术挑战性,而且经常被忽视。对于进化发育生物学来说,这是一个特别相关的问题,因为假设检验需要准确检测基因表达和功能的变化,而这些变化通常发生在非传统的模式物种中。这些分析经常依赖于在编码序列内设计的分子试剂,这些试剂在最近复制的旁系同源基因中可能非常相似,导致交叉反应和虚假结果。因此,需要小心避免错误地将分化的旁系同源基因功能分配给单个基因,并可能错误地解释进化历史。本观点旨在概述旁系同源基因的普遍性和重要性,并阐明检测和分析它们的困难,同时提供潜在的解决方案。