Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec), Bielefeld University, Sequenz 1, 33615, Bielefeld, Germany.
Evolution and Diversity, Department of Plant Sciences, University of Cambridge, Cambridge, UK.
BMC Genomics. 2021 Aug 6;22(1):599. doi: 10.1186/s12864-021-07877-8.
Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades.
We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days.
Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations.
植物基因功能分配的实验证明基于突变体分析。T-DNA 插入系提供了宝贵的突变体资源,并使基于反向遗传学的拟南芥基因功能的系统研究成为可能。
我们使用 Oxford Nanopore Technologies(ONT)长读长对 14 个 A. thaliana GABI-Kat T-DNA 插入系进行了测序,这些插入系逃脱了侧翼序列标签对其插入位点的特征描述的尝试,从而确定了复杂的 T-DNA 插入,并鉴定出 11 个先前未知的 T-DNA 位点,导致每条线约有 2 个 T-DNA 插入,这表明此前对 T-DNA 插入的估计不足。T-DNA 诱变导致染色体融合,同时发生补偿性易位,以保持减数分裂过程中基因组的完整性。此外,还检测到 800 kbp 的倒置重复。约 10%的 GABI-Kat 系可能受到染色体重排的影响,其中一些重排不涉及 T-DNA。选择读取的局部组装被证明是一种有效的方法,可以解析 T-DNA 插入位点的结构。我们开发了一种自动化工作流程,以支持对 T-DNA 插入系的长读数据进行研究。从 DNA 提取到 T-DNA 位点组装的所有步骤都可以在几天内完成。
长读测序被证明是解析复杂 T-DNA 插入和染色体融合的有效方法。许多 T-DNA 插入不仅包含单个 T-DNA,还包含多个 T-DNA 的复杂阵列。很明显,T-DNA 插入等位基因必须通过精确识别 T-DNA::基因组接头来进行特征描述,以生成明确的基因型-表型关系。