利用长读测序技术鉴定和表征隐匿性人类特异性 LINE-1 插入。

Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.

Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.

出版信息

Nucleic Acids Res. 2020 Feb 20;48(3):1146-1163. doi: 10.1093/nar/gkz1173.

DOI:10.1093/nar/gkz1173

PMID:31853540

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7026601/

Abstract

Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.

摘要

长散布元件-1（LINE-1）反转录转座导致个体间和个体内遗传变异，偶尔会导致人类遗传疾病。已经开发了各种策略来从短读长全基因组测序（WGS）数据中鉴定人类特异性 LINE-1（L1Hs）插入；然而，它们在检测复杂重复基因组区域中的插入方面存在局限性。在这里，我们开发了一种计算工具（PALMER），并使用它在 NA12878 基准基因组中鉴定了 203 个非参考 L1Hs 插入。使用 PacBio 长读测序数据，我们鉴定了先前短读研究中缺失的 L1Hs 插入（90/203）。大约 81%（73/90）的 L1Hs 插入位于参考组装中的内源性 LINE-1 序列内，对独特的断点连接序列的分析表明，这些 L1Hs 插入中的 63%（57/90）可以在 1000 基因组计划序列中进行基因分型。此外，我们观察到，单细胞 WGS 实验中遇到的扩增偏差导致四个个体的 NA12878 细胞之间 L1Hs 插入检测率存在广泛差异；低扩增将检测限制在 203 个插入中的 32%（65/203），而过度扩增会增加假阳性调用。总之，这些数据表明，标准的短读测序方法经常会错过 L1Hs 插入，而长读测序方法可以显著提高对个体基因组中存在的 L1Hs 插入的检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/badd/7026601/5ec3e1ff0de9/gkz1173fig1.jpg

相似文献

Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology.利用长读测序技术鉴定和表征隐匿性人类特异性 LINE-1 插入。

Nucleic Acids Res. 2020 Feb 20;48(3):1146-1163. doi: 10.1093/nar/gkz1173.

Restriction Enzyme Based Enriched L1Hs Sequencing (REBELseq): A Scalable Technique for Detection of Ta Subfamily L1Hs in the Human Genome.基于限制性内切酶的长散布核元件（LINE-1 高变区）富集测序（REBELseq）：一种用于检测人类基因组中 Ta 亚家族长散布核元件的高通量技术。

G3 (Bethesda). 2020 May 4;10(5):1647-1655. doi: 10.1534/g3.119.400613.

A Method for Detection of Somatic LINE-1 Insertions at the Single-Cell Level from Postmortem Human Brain.一种从人死后脑组织中单细胞水平检测体细胞 LINE-1 插入的方法。

Methods Mol Biol. 2023;2577:147-159. doi: 10.1007/978-1-0716-2724-2_10.

Deep sequencing reveals low incidence of endogenous LINE-1 retrotransposition in human induced pluripotent stem cells.深度测序揭示人诱导多能干细胞中内源性LINE-1逆转录转座的低发生率。

PLoS One. 2014 Oct 7;9(10):e108682. doi: 10.1371/journal.pone.0108682. eCollection 2014.

Cas9 targeted enrichment of mobile elements using nanopore sequencing.利用纳米孔测序进行 Cas9 靶向富集移动元件。

Nat Commun. 2021 Jun 11;12(1):3586. doi: 10.1038/s41467-021-23918-y.

Mobile element insertions are frequent in oesophageal adenocarcinomas and can mislead paired-end sequencing analysis.移动元件插入在食管腺癌中很常见，并且可能会误导双末端测序分析。

BMC Genomics. 2015 Jul 10;16(1):473. doi: 10.1186/s12864-015-1685-z.

Novel Discovery of LINE-1 in a Korean Individual by a Target Enrichment Method.通过靶向富集方法在一名韩国个体中发现 LINE-1。

Mol Cells. 2019 Jan 31;42(1):87-95. doi: 10.14348/molcells.2018.0351. Epub 2018 Dec 6.

Somatic LINE-1 retrotransposition in cortical neurons and non-brain tissues of Rett patients and healthy individuals.雷特综合征患者和健康个体的皮质神经元和非脑组织中的体 LINE-1 反转录转座。

PLoS Genet. 2019 Apr 11;15(4):e1008043. doi: 10.1371/journal.pgen.1008043. eCollection 2019 Apr.

High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes.高通量测序揭示了个体人类基因组中人类特异性 L1 含量的广泛变异。

Genome Res. 2010 Sep;20(9):1262-70. doi: 10.1101/gr.106419.110. Epub 2010 May 20.

RISCI--Repeat Induced Sequence Changes Identifier: a comprehensive, comparative genomics-based, in silico subtractive hybridization pipeline to identify repeat induced sequence changes in closely related genomes.重复诱导序列变化标识符（RISCI）：一种全面的、基于比较基因组学的、计算机辅助的消减杂交技术，用于鉴定密切相关基因组中的重复诱导序列变化。

BMC Bioinformatics. 2010 Dec 26;11:609. doi: 10.1186/1471-2105-11-609.

引用本文的文献

Segmental duplication-mediated rearrangements alter the landscape of mouse genomes.节段性重复介导的重排改变了小鼠基因组的格局。

bioRxiv. 2025 Jul 22:2025.07.18.665526. doi: 10.1101/2025.07.18.665526.

Structural variation in 1,019 diverse humans based on long-read sequencing.基于长读长测序的1019名不同个体的结构变异

Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09290-7.

The Somatic Mosaicism across Human Tissues Network.人类组织网络中的体细胞嵌合现象

Nature. 2025 Jul;643(8070):47-59. doi: 10.1038/s41586-025-09096-7. Epub 2025 Jul 2.

Long-read sequencing for diagnosis of genetic myopathies.用于诊断遗传性肌病的长读长测序

BMJ Neurol Open. 2025 May 11;7(1):e000990. doi: 10.1136/bmjno-2024-000990. eCollection 2025.

Transposable elements as genome regulators in normal and malignant haematopoiesis.转座元件作为正常和恶性造血过程中的基因组调节因子。

Blood Cancer J. 2025 May 6;15(1):87. doi: 10.1038/s41408-025-01295-9.

Structural features of somatic and germline retrotransposition events in humans.人类体细胞和生殖系逆转录转座事件的结构特征。

Mob DNA. 2025 Apr 22;16(1):20. doi: 10.1186/s13100-025-00357-w.

Characterisation of a LINE-1 Insertion in the Gene by Targeted Adaptive Nanopore Sequencing in a Family with Retinitis Pigmentosa.通过靶向适应性纳米孔测序对一个患有视网膜色素变性的家族中的基因中LINE-1插入进行特征分析。

Hum Mutat. 2024 Feb 9;2024:6580561. doi: 10.1155/2024/6580561. eCollection 2024.

A personalized multi-platform assessment of somatic mosaicism in the human frontal cortex.人类额叶皮质体细胞镶嵌现象的个性化多平台评估

bioRxiv. 2024 Dec 21:2024.12.18.629274. doi: 10.1101/2024.12.18.629274.

Multiple Displacement Amplification Facilitates SMRT Sequencing of Microscopic Animals and the Genome of the Gastrotrich Lepidodermella squamata (Dujardin 1841).多重置换扩增促进了微观动物的单分子实时测序以及腹毛动物鳞皮棘尾虫（杜雅尔丹，1841年）基因组的测序。

Genome Biol Evol. 2024 Dec 4;16(12). doi: 10.1093/gbe/evae254.

Repeat-Rich Regions Cause False-Positive Detection of NUMTs: A Case Study in Amphibians Using an Improved Cane Toad Reference Genome.富含重复序列区域导致 NUMTs 的假阳性检测：以改良蟾蜍参考基因组为例的两栖动物研究

Genome Biol Evol. 2024 Nov 1;16(11). doi: 10.1093/gbe/evae246.

本文引用的文献

RNA ligation precedes the retrotransposition of U6/LINE-1 chimeric RNA.RNA 连接发生在 U6/LINE-1 嵌合 RNA 逆转录转座之前。

Proc Natl Acad Sci U S A. 2019 Oct 8;116(41):20612-20622. doi: 10.1073/pnas.1805404116. Epub 2019 Sep 23.

Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。

Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.

The Landscape of L1 Retrotransposons in the Human Genome Is Shaped by Pre-insertion Sequence Biases and Post-insertion Selection.人类基因组中 L1 反转录转座子的景观由插入前序列偏好和插入后选择形成。

Mol Cell. 2019 May 2;74(3):555-570.e7. doi: 10.1016/j.molcel.2019.02.036. Epub 2019 Apr 4.

Genome-wide de novo L1 Retrotransposition Connects Endonuclease Activity with Replication.全基因组从头 L1 反转录转座将内切酶活性与复制联系起来。

Cell. 2019 May 2;177(4):837-851.e28. doi: 10.1016/j.cell.2019.02.050. Epub 2019 Apr 4.

Genomic Analysis in the Age of Human Genome Sequencing.人类基因组测序时代的基因组分析。

Cell. 2019 Mar 21;177(1):70-84. doi: 10.1016/j.cell.2019.02.032.

Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。

Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.

Comparison of village dog and wolf genomes highlights the role of the neural crest in dog domestication.比较村庄狗和狼的基因组，突出了神经嵴在狗驯化中的作用。

BMC Biol. 2018 Jun 28;16(1):64. doi: 10.1186/s12915-018-0535-2.

The case for not masking away repetitive DNA.不掩盖重复DNA的理由。

Mob DNA. 2018 May 1;9:15. doi: 10.1186/s13100-018-0120-9. eCollection 2018.

Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。

Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.

Spliced integrated retrotransposed element (SpIRE) formation in the human genome.人类基因组中拼接整合的反转录转座子（SpIRE）形成。

PLoS Biol. 2018 Mar 5;16(3):e2003067. doi: 10.1371/journal.pbio.2003067. eCollection 2018 Mar.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用长读测序技术鉴定和表征隐匿性人类特异性 LINE-1 插入。

Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献