• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

发现并分析一整套结构变异和短串联重复序列。

Discovery and quality analysis of a comprehensive set of structural variants and short tandem repeats.

机构信息

Biomedical Sciences Graduate Program, University of California San Diego, La Jolla, CA, 92093-0419, USA.

Department of Biomedical Informatics, University of California San Diego, La Jolla, CA, 92093-0419, USA.

出版信息

Nat Commun. 2020 Jun 10;11(1):2928. doi: 10.1038/s41467-020-16481-5.

DOI:10.1038/s41467-020-16481-5
PMID:32522985
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7287045/
Abstract

Structural variants (SVs) and short tandem repeats (STRs) are important sources of genetic diversity but are not routinely analyzed in genetic studies because they are difficult to accurately identify and genotype. Because SVs and STRs range in size and type, it is necessary to apply multiple algorithms that incorporate different types of evidence from sequencing data and employ complex filtering strategies to discover a comprehensive set of high-quality and reproducible variants. Here we assemble a set of 719 deep whole genome sequencing (WGS) samples (mean 42×) from 477 distinct individuals which we use to discover and genotype a wide spectrum of SV and STR variants using five algorithms. We use 177 unique pairs of genetic replicates to identify factors that affect variant call reproducibility and develop a systematic filtering strategy to create of one of the most complete and well characterized maps of SVs and STRs to date.

摘要

结构变异(SV)和短串联重复(STR)是遗传多样性的重要来源,但由于难以准确识别和基因分型,通常不在遗传研究中进行分析。由于 SV 和 STR 的大小和类型不同,因此需要应用多种算法,这些算法结合了测序数据中不同类型的证据,并采用复杂的过滤策略来发现一套全面的高质量且可重复的变体。在这里,我们组装了一组来自 477 个不同个体的 719 个深度全基因组测序(WGS)样本(平均 42×),我们使用这组样本,通过五种算法来发现和基因分型各种 SV 和 STR 变体。我们使用 177 对独特的遗传重复样本,来确定影响变体调用可重复性的因素,并开发了一种系统的过滤策略,以创建迄今为止最完整和特征描述最好的 SV 和 STR 图谱之一。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/50958d325acb/41467_2020_16481_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/f58872b89873/41467_2020_16481_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/95d1a99d859c/41467_2020_16481_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/42a7041cf5d6/41467_2020_16481_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/a28bce781ec7/41467_2020_16481_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/50958d325acb/41467_2020_16481_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/f58872b89873/41467_2020_16481_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/95d1a99d859c/41467_2020_16481_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/42a7041cf5d6/41467_2020_16481_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/a28bce781ec7/41467_2020_16481_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d43/7287045/50958d325acb/41467_2020_16481_Fig5_HTML.jpg

相似文献

1
Discovery and quality analysis of a comprehensive set of structural variants and short tandem repeats.发现并分析一整套结构变异和短串联重复序列。
Nat Commun. 2020 Jun 10;11(1):2928. doi: 10.1038/s41467-020-16481-5.
2
STRAS:a snakemake pipeline for genome-wide short tandem repeats annotation and score.STRAS:一个用于全基因组短串联重复序列注释和评分的 SnakeMake 管道。
Hum Genet. 2024 Jun;143(6):735-738. doi: 10.1007/s00439-024-02662-5. Epub 2024 Mar 20.
3
From cytogenetics to cytogenomics: whole-genome sequencing as a first-line test comprehensively captures the diverse spectrum of disease-causing genetic variation underlying intellectual disability.从细胞遗传学到细胞基因组学:全基因组测序作为一线检测手段,全面捕捉到导致智力障碍的遗传变异的多样谱。
Genome Med. 2019 Nov 7;11(1):68. doi: 10.1186/s13073-019-0675-1.
4
Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions.全基因组测序作为短串联重复扩展的一线筛查试验。
Genome Med. 2021 Aug 9;13(1):126. doi: 10.1186/s13073-021-00932-9.
5
Efficient acquisition of tens of thousands of short tandem repeats in single-cell whole-genome-amplified DNA.单细胞全基因组扩增 DNA 中数万短串联重复序列的高效获取。
STAR Protoc. 2021 Sep 16;2(4):100828. doi: 10.1016/j.xpro.2021.100828. eCollection 2021 Dec 17.
6
STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data.STRScan:全基因组测序数据中短串联重复序列的靶向分析
BMC Bioinformatics. 2017 Oct 3;18(Suppl 11):398. doi: 10.1186/s12859-017-1800-z.
7
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
8
Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates.群体规模测序数据有助于精确估计Y染色体短串联重复序列(Y-STR)突变率。
Am J Hum Genet. 2016 May 5;98(5):919-933. doi: 10.1016/j.ajhg.2016.04.001. Epub 2016 Apr 25.
9
Sequencing and characterizing short tandem repeats in the human genome.对人类基因组中的短串联重复序列进行测序和特征分析。
Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16.
10
A game of hide and seq: Identification of parallel Y-STR evolution in deep-rooting pedigrees.躲猫猫游戏:在溯源深远的家系中鉴定平行 Y-STR 进化。
Eur J Hum Genet. 2019 Apr;27(4):637-646. doi: 10.1038/s41431-018-0312-2. Epub 2018 Dec 20.

引用本文的文献

1
Accurate, Scalable Structural Variant Genotyping in Complex Genomes at Population Scales.群体规模下复杂基因组中准确、可扩展的结构变异基因分型
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf180.
2
Genome graphs reveal the importance of structural variation in evolution and drug resistance.基因组图谱揭示了结构变异在进化和耐药性中的重要性。
bioRxiv. 2025 May 7:2025.05.07.652570. doi: 10.1101/2025.05.07.652570.
3
Multiomic QTL mapping reveals phenotypic complexity of GWAS loci and prioritizes putative causal variants.多组学QTL定位揭示了全基因组关联研究(GWAS)位点的表型复杂性,并对潜在的因果变异进行了优先级排序。

本文引用的文献

1
Properties of structural variants and short tandem repeats associated with gene expression and complex traits.与基因表达和复杂性状相关的结构变异和短串联重复序列的特性。
Nat Commun. 2020 Jun 10;11(1):2927. doi: 10.1038/s41467-020-16482-4.
2
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.
3
Multi-platform discovery of haplotype-resolved structural variation in human genomes.
Cell Genom. 2025 Mar 12;5(3):100775. doi: 10.1016/j.xgen.2025.100775. Epub 2025 Feb 21.
4
Diversity and consequences of structural variation in the human genome.人类基因组结构变异的多样性及其影响
Nat Rev Genet. 2025 Jan 21. doi: 10.1038/s41576-024-00808-9.
5
GREGoR: Accelerating Genomics for Rare Diseases.GREGoR:加速罕见病基因组学研究
ArXiv. 2024 Dec 18:arXiv:2412.14338v1.
6
Mapping crossover events of mouse meiotic recombination by restriction fragment ligation-based Refresh-seq.通过基于限制性片段连接的Refresh-seq技术对小鼠减数分裂重组的交叉事件进行定位。
Cell Discov. 2024 Mar 5;10(1):26. doi: 10.1038/s41421-023-00638-9.
7
Sequencing and characterizing short tandem repeats in the human genome.对人类基因组中的短串联重复序列进行测序和特征分析。
Nat Rev Genet. 2024 Jul;25(7):460-475. doi: 10.1038/s41576-024-00692-3. Epub 2024 Feb 16.
8
Detection of trait-associated structural variations using short-read sequencing.利用短读长测序检测性状相关的结构变异
Cell Genom. 2023 May 18;3(6):100328. doi: 10.1016/j.xgen.2023.100328. eCollection 2023 Jun 14.
9
Characterization of genome-wide STR variation in 6487 human genomes.6487 个人类基因组中全基因组 STR 变异的特征。
Nat Commun. 2023 Apr 12;14(1):2092. doi: 10.1038/s41467-023-37690-8.
10
Genomic structural variation: A complex but important driver of human evolution.基因组结构变异:人类进化的复杂但重要的驱动因素。
Am J Biol Anthropol. 2023 Aug;181 Suppl 76(Suppl 76):118-144. doi: 10.1002/ajpa.24713. Epub 2023 Feb 16.
多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
4
Genome maps across 26 human populations reveal population-specific patterns of structural variation.26 个人类群体的基因组图谱揭示了结构变异的群体特异性模式。
Nat Commun. 2019 Mar 4;10(1):1025. doi: 10.1038/s41467-019-08992-7.
5
Characterizing the Major Structural Variant Alleles of the Human Genome.人类基因组主要结构变异等位基因的特征。
Cell. 2019 Jan 24;176(3):663-675.e19. doi: 10.1016/j.cell.2018.12.019. Epub 2019 Jan 17.
6
Insights into the Mutational Burden of Human Induced Pluripotent Stem Cells from an Integrative Multi-Omics Approach.从综合多组学方法看人类诱导多能干细胞的突变负担。
Cell Rep. 2018 Jul 24;24(4):883-894. doi: 10.1016/j.celrep.2018.06.091.
7
Paternally inherited cis-regulatory structural variants are associated with autism.父系遗传的顺式调控结构变异与自闭症有关。
Science. 2018 Apr 20;360(6386):327-331. doi: 10.1126/science.aan2261.
8
FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods. FusorSV:一种用于最优组合来自多种结构变异检测方法的数据的算法。
Genome Biol. 2018 Mar 20;19(1):38. doi: 10.1186/s13059-018-1404-6.
9
Trinucleotide repeat disorders.三核苷酸重复序列疾病
Handb Clin Neurol. 2017;145:383-391. doi: 10.1016/B978-0-12-802395-2.00027-4.
10
The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology.移动元件定位工具(MELT):大规模移动元件发现与生物学。
Genome Res. 2017 Nov;27(11):1916-1929. doi: 10.1101/gr.218032.116. Epub 2017 Aug 30.