Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel.
Philos Trans R Soc Lond B Biol Sci. 2020 Mar 30;375(1795):20190345. doi: 10.1098/rstb.2019.0345. Epub 2020 Feb 10.
Next-generation sequencing approaches have fundamentally changed the types of questions that can be asked about gene function and regulation. With the goal of approaching truly genome-wide quantifications of all the interaction partners and downstream effects of particular genes, these quantitative assays have allowed for an unprecedented level of detail in exploring biological interactions. However, many challenges remain in our ability to accurately describe and quantify the interactions that take place in those hard to reach and extremely repetitive regions of our genome comprised mostly of transposable elements (TEs). Tools dedicated to TE-derived sequences have lagged behind, making the inclusion of these sequences in genome-wide analyses difficult. Recent improvements, both computational and experimental, allow for the better inclusion of TE sequences in genomic assays and a renewed appreciation for the importance of TE biology. This review will discuss the recent improvements that have been made in the computational analysis of TE-derived sequences as well as the areas where such analysis still proves difficult. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.
下一代测序方法从根本上改变了可以提出的关于基因功能和调控的问题类型。这些定量检测方法以实现对特定基因的所有相互作用伙伴和下游效应进行真正的全基因组定量为目标,使得在探索生物相互作用方面达到了前所未有的详细程度。然而,我们在准确描述和量化发生在我们基因组中那些难以触及且极其重复的区域(主要由转座元件 (TE) 组成)方面仍然存在许多挑战。专门针对 TE 衍生序列的工具已经落后,使得这些序列难以包含在全基因组分析中。最近在计算和实验方面的改进,使得 TE 序列在基因组分析中更好地被包含,并重新认识到 TE 生物学的重要性。这篇综述将讨论在 TE 衍生序列的计算分析方面取得的最新进展,以及在这些分析中仍然存在困难的领域。本文是关于“转座子和基因调控之间的交叉点”的讨论会议议题的一部分。