ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India; PG School, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India; Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi, 110012, India.
ICAR-National Institute for Plant Biotechnology, New Delhi, 110012, India; PG School, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India.
Plant Physiol Biochem. 2023 Oct;203:108062. doi: 10.1016/j.plaphy.2023.108062. Epub 2023 Sep 27.
The complexities of a genome are underpinned to the vast expanses of the intergenic region, which constitutes ∼97-98% of the genome. This region is essentially composed of what is colloquially referred to as the "junk DNA" and is composed of various elements like transposons, repeats, pseudogenes, etc. The latter have long been considered as dead elements merely contributing to transcriptional noise in the genome. Many studies now describe the previously unknown regulatory functions of these genes. Recent advances in the Next-generation sequencing (NGS) technologies have allowed unprecedented access to these regions. With the availability of whole genome sequences of more than 788 different plant species in past 20 years, genome annotation has become feasible like never before. Different bioinformatic pipelines are available for the identification of pseudogenes. However, still little is known about their biological functions. The functional validation of these genes remains challenging and research in this area is still in infancy, particularly in plants. CRISPR/Cas-based genome editing could provide solutions to understand the biological roles of these genes by allowing creation of precise edits within these genes. The possibility of pseudogene reactivation or resurrection as has been demonstrated in a few studies might open new avenues of genetic manipulation to yield a desirable phenotype. This review aims at comprehensively summarizing the progress made with regards to the identification of pseudogenes and understanding their biological functions in plants.
基因组的复杂性是由基因间区的广阔区域支撑的,该区域构成了基因组的 97-98%左右。这个区域主要由通俗上称为“垃圾 DNA”的各种元素组成,包括转座子、重复序列、假基因等。这些假基因长期以来被认为是仅仅为基因组中的转录噪声做出贡献的死基因。现在许多研究描述了这些以前未知的基因的调节功能。新一代测序(NGS)技术的最新进展使得对这些区域进行前所未有的研究成为可能。在过去的 20 年中,已经获得了超过 788 种不同植物物种的全基因组序列,因此基因组注释变得前所未有地可行。有多种生物信息学管道可用于识别假基因。然而,我们对它们的生物学功能仍然知之甚少。这些基因的功能验证仍然具有挑战性,该领域的研究仍处于起步阶段,特别是在植物中。基于 CRISPR/Cas 的基因组编辑可以通过在这些基因内进行精确编辑来提供理解这些基因生物学作用的解决方案。一些研究已经证明假基因的重新激活或复活的可能性,这可能为遗传操作开辟新途径,以产生所需的表型。这篇综述旨在全面总结在鉴定植物中的假基因及其生物学功能方面取得的进展。