• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

串联基因型:从长 DNA 读取中稳健检测串联重复扩展。

Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.

机构信息

Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan.

Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan.

出版信息

Genome Biol. 2019 Mar 19;20(1):58. doi: 10.1186/s13059-019-1667-6.

DOI:10.1186/s13059-019-1667-6
PMID:30890163
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6425644/
Abstract

Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome. Our method is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we prioritize pathogenic expansions within the top 10 out of 700,000 tandem repeats in whole genome sequencing data. This may help to elucidate the many genetic diseases whose causes remain unknown.

摘要

串联重复 DNA 高度易变,可导致至少 31 种疾病,但很难在全基因组范围内检测到致病性重复扩展。在这里,我们报告了一种稳健的方法,通过对长但易错的(PacBio 和纳米孔)reads 与参考基因组进行仔细比对,来检测人类重复扩展。我们的方法对系统测序错误、边界不精确的模糊重复和低测序覆盖率具有稳健性。通过与健康对照进行比较,我们在全基因组测序数据中 700000 个串联重复的前 10 个中优先考虑致病性扩展。这可能有助于阐明许多病因不明的遗传疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/15ec088edd11/13059_2019_1667_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/38af290076a7/13059_2019_1667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/2014b30f6d53/13059_2019_1667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/025e6f3140d9/13059_2019_1667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/938ce86f51a2/13059_2019_1667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/569d10d30b58/13059_2019_1667_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/a67d8760ad02/13059_2019_1667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/da9647129b47/13059_2019_1667_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/15ec088edd11/13059_2019_1667_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/38af290076a7/13059_2019_1667_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/2014b30f6d53/13059_2019_1667_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/025e6f3140d9/13059_2019_1667_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/938ce86f51a2/13059_2019_1667_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/569d10d30b58/13059_2019_1667_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/a67d8760ad02/13059_2019_1667_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/da9647129b47/13059_2019_1667_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f61/6425644/15ec088edd11/13059_2019_1667_Fig8_HTML.jpg

相似文献

1
Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads.串联基因型:从长 DNA 读取中稳健检测串联重复扩展。
Genome Biol. 2019 Mar 19;20(1):58. doi: 10.1186/s13059-019-1667-6.
2
LongTR: genome-wide profiling of genetic variation at tandem repeats from long reads.LongTR:从长读段中进行串联重复的全基因组遗传变异分析。
Genome Biol. 2024 Jul 4;25(1):176. doi: 10.1186/s13059-024-03319-2.
3
Long-read sequencing identified intronic repeat expansions in from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.长读测序鉴定了来自中国家族性皮质肌阵挛性震颤伴癫痫家系的突变基因中的内含子重复扩展。
J Med Genet. 2019 Apr;56(4):265-270. doi: 10.1136/jmedgenet-2018-105484. Epub 2018 Sep 7.
4
Analysis of Tandem Repeat Expansions Using Long DNA Reads.利用长 DNA 读取分析串联重复扩展。
Methods Mol Biol. 2023;2632:147-159. doi: 10.1007/978-1-0716-2996-3_11.
5
Long-read sequencing across the C9orf72 'GGGGCC' repeat expansion: implications for clinical use and genetic discovery efforts in human disease.长读测序技术在 C9orf72“GGGGCC”重复扩增中的应用:对人类疾病临床应用和遗传发现的影响。
Mol Neurodegener. 2018 Aug 21;13(1):46. doi: 10.1186/s13024-018-0274-4.
6
REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats.REViewer:串联重复序列及其附近读取比对的单倍型解析可视化。
Genome Med. 2022 Aug 11;14(1):84. doi: 10.1186/s13073-022-01085-z.
7
RF: a method for filtering short reads with tandem repeats for genome mapping.RF:一种用于基因组图谱构建的带有串联重复的短读过滤方法。
Genomics. 2013 Jul;102(1):35-7. doi: 10.1016/j.ygeno.2013.03.002. Epub 2013 Mar 29.
8
TRiCoLOR: tandem repeat profiling using whole-genome long-read sequencing data.TRiCoLOR:使用全基因组长读测序数据进行串联重复分析。
Gigascience. 2020 Oct 7;9(10). doi: 10.1093/gigascience/giaa101.
9
Detecting a long insertion variant in SAMD12 by SMRT sequencing: implications of long-read whole-genome sequencing for repeat expansion diseases.通过 SMRT 测序检测 SAMD12 中的长插入变异:长读长全基因组测序对重复扩展疾病的影响。
J Hum Genet. 2019 Mar;64(3):191-197. doi: 10.1038/s10038-018-0551-7. Epub 2018 Dec 17.
10
Complete sequencing of expanded SAMD12 repeats by long-read sequencing and Cas9-mediated enrichment.通过长读测序和 Cas9 介导的富集完成 SAMD12 重复序列的完整测序。
Brain. 2021 May 7;144(4):1103-1117. doi: 10.1093/brain/awab021.

引用本文的文献

1
A practical guide to identifying associations between tandem repeats and complex human traits using consensus genotypes from multiple tools.利用多种工具的一致性基因型识别串联重复序列与复杂人类性状之间关联的实用指南。
Nat Protoc. 2025 Sep 1. doi: 10.1038/s41596-025-01231-y.
2
TRsv: simultaneous detection of tandem repeat variations, structural variations, and short indels using long read sequencing data.TRsv:利用长读长测序数据同时检测串联重复变异、结构变异和短插入缺失变异
Genome Biol. 2025 Aug 20;26(1):246. doi: 10.1186/s13059-025-03718-z.
3
Topsicle: a method for estimating telomere length from whole genome long-read sequencing data.

本文引用的文献

1
Long-read sequencing identifies GGC repeat expansions in NOTCH2NLC associated with neuronal intranuclear inclusion disease.长读测序鉴定出 NOTCH2NLC 中的 GGC 重复扩展与神经元核内包涵体病有关。
Nat Genet. 2019 Aug;51(8):1215-1221. doi: 10.1038/s41588-019-0459-y. Epub 2019 Jul 22.
2
Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome.牛津纳米孔 PromethION 测序人类基因组鉴定的结构变异。
Genome Res. 2019 Jul;29(7):1178-1187. doi: 10.1101/gr.244939.118. Epub 2019 Jun 11.
3
Detecting a long insertion variant in SAMD12 by SMRT sequencing: implications of long-read whole-genome sequencing for repeat expansion diseases.
Topsicle:一种从全基因组长读测序数据估计端粒长度的方法。
bioRxiv. 2025 Jul 15:2025.07.10.664126. doi: 10.1101/2025.07.10.664126.
4
Intronic hexanucleotide repeat expansion in in monozygotic twins with congenital progressive universal melanosis.先天性进行性全身性黑素沉着症单卵双胞胎中的内含子六核苷酸重复序列扩增。
Biomed Rep. 2025 Jun 12;23(2):138. doi: 10.3892/br.2025.2016. eCollection 2025 Aug.
5
Comprehensive Characterisation of the RFC1 Repeat in an Australian Cohort.澳大利亚队列中 RFC1 重复序列的综合特征分析
Cerebellum. 2025 Jun 7;24(4):111. doi: 10.1007/s12311-025-01867-2.
6
Association of human-specific expanded short tandem repeats with neuron-specific regulatory features.人类特异性扩增短串联重复序列与神经元特异性调控特征的关联。
Sci Adv. 2025 May 30;11(22):eadp9707. doi: 10.1126/sciadv.adp9707.
7
Modelling Population Genetic Screening in Rare Neurodegenerative Diseases.罕见神经退行性疾病的群体基因筛查建模
Biomedicines. 2025 Apr 23;13(5):1018. doi: 10.3390/biomedicines13051018.
8
Validation of a comprehensive long-read sequencing platform for broad clinical genetic diagnosis.用于广泛临床基因诊断的综合长读长测序平台的验证
Front Genet. 2025 May 2;16:1499456. doi: 10.3389/fgene.2025.1499456. eCollection 2025.
9
TRACKing tandem repeats: a customizable pipeline for identification and cross-species comparison.追踪串联重复序列:用于识别和跨物种比较的可定制流程
Bioinform Adv. 2025 Apr 8;5(1):vbaf066. doi: 10.1093/bioadv/vbaf066. eCollection 2025.
10
Aberrant Short Tandem Repeats: Pathogenicity, Mechanisms, Detection, and Roles in Neuropsychiatric Disorders.异常短串联重复序列:致病性、机制、检测及其在神经精神疾病中的作用
Genes (Basel). 2025 Mar 30;16(4):406. doi: 10.3390/genes16040406.
通过 SMRT 测序检测 SAMD12 中的长插入变异:长读长全基因组测序对重复扩展疾病的影响。
J Hum Genet. 2019 Mar;64(3):191-197. doi: 10.1038/s10038-018-0551-7. Epub 2018 Dec 17.
4
Single-Molecule Sequencing: Towards Clinical Applications.单分子测序:迈向临床应用。
Trends Biotechnol. 2019 Jan;37(1):72-85. doi: 10.1016/j.tibtech.2018.07.013. Epub 2018 Aug 13.
5
Detailed analysis of HTT repeat elements in human blood using targeted amplification-free long-read sequencing.利用靶向扩增免费长读测序技术对人血中的 HTT 重复元件进行详细分析。
Hum Mutat. 2018 Sep;39(9):1262-1272. doi: 10.1002/humu.23580. Epub 2018 Jul 12.
6
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
7
Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy.良性成人家族性肌阵挛性癫痫中内含子 TTTCA 和 TTTTA 重复的扩展。
Nat Genet. 2018 Apr;50(4):581-590. doi: 10.1038/s41588-018-0067-2. Epub 2018 Mar 5.
8
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
9
A survey of localized sequence rearrangements in human DNA.人类 DNA 局部序列重排的调查。
Nucleic Acids Res. 2018 Feb 28;46(4):1661-1673. doi: 10.1093/nar/gkx1266.
10
Mapping and phasing of structural variation in patient genomes using nanopore sequencing.使用纳米孔测序对患者基因组中的结构变异进行图谱绘制和相位分析。
Nat Commun. 2017 Nov 6;8(1):1326. doi: 10.1038/s41467-017-01343-4.