• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GATK-gCNV 可从外显子测序数据中发现罕见的拷贝数变异。

GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data.

机构信息

Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

Nat Genet. 2023 Sep;55(9):1589-1597. doi: 10.1038/s41588-023-01449-0. Epub 2023 Aug 21.

DOI:10.1038/s41588-023-01449-0
PMID:37604963
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10904014/
Abstract

Copy number variants (CNVs) are major contributors to genetic diversity and disease. While standardized methods, such as the genome analysis toolkit (GATK), exist for detecting short variants, technical challenges have confounded uniform large-scale CNV analyses from whole-exome sequencing (WES) data. Given the profound impact of rare and de novo coding CNVs on genome organization and human disease, we developed GATK-gCNV, a flexible algorithm to discover rare CNVs from sequencing read-depth information, complete with open-source distribution via GATK. We benchmarked GATK-gCNV in 7,962 exomes from individuals in quartet families with matched genome sequencing and microarray data, finding up to 95% recall of rare coding CNVs at a resolution of more than two exons. We used GATK-gCNV to generate a reference catalog of rare coding CNVs in WES data from 197,306 individuals in the UK Biobank, and observed strong correlations between per-gene CNV rates and measures of mutational constraint, as well as rare CNV associations with multiple traits. In summary, GATK-gCNV is a tunable approach for sensitive and specific CNV discovery in WES data, with broad applications.

摘要

拷贝数变异 (CNVs) 是遗传多样性和疾病的主要贡献者。虽然存在用于检测短变体的标准化方法(如基因组分析工具包 (GATK)),但技术挑战使得从全外显子组测序 (WES) 数据进行统一的大规模 CNV 分析变得复杂。鉴于罕见和新生编码 CNVs 对基因组结构和人类疾病的深远影响,我们开发了 GATK-gCNV,这是一种从测序读深度信息中发现罕见 CNVs 的灵活算法,通过 GATK 提供开源分布。我们在具有匹配基因组测序和微阵列数据的四元家庭个体的 7962 个外显子中对 GATK-gCNV 进行了基准测试,在分辨率超过两个外显子的情况下,罕见编码 CNVs 的召回率高达 95%。我们使用 GATK-gCNV 生成了来自 UK Biobank 的 197,306 个个体的 WES 数据中罕见编码 CNVs 的参考目录,并观察到基因间 CNV 率与突变约束测量值之间存在很强的相关性,以及罕见 CNV 与多个特征之间的关联。总之,GATK-gCNV 是一种用于 WES 数据中敏感和特异性 CNV 发现的可调方法,具有广泛的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/2c4f37b9db62/nihms-1959615-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/bff09bdc5f51/nihms-1959615-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/c92dabb5011b/nihms-1959615-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/2c4f37b9db62/nihms-1959615-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/bff09bdc5f51/nihms-1959615-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/c92dabb5011b/nihms-1959615-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f49a/10904014/2c4f37b9db62/nihms-1959615-f0003.jpg

相似文献

1
GATK-gCNV enables the discovery of rare copy number variants from exome sequencing data.GATK-gCNV 可从外显子测序数据中发现罕见的拷贝数变异。
Nat Genet. 2023 Sep;55(9):1589-1597. doi: 10.1038/s41588-023-01449-0. Epub 2023 Aug 21.
2
Exome copy number variant detection, analysis, and classification in a large cohort of families with undiagnosed rare genetic disease.在一大群未确诊罕见遗传病的家庭中进行外显子组拷贝数变异检测、分析和分类。
Am J Hum Genet. 2024 May 2;111(5):863-876. doi: 10.1016/j.ajhg.2024.03.008. Epub 2024 Apr 1.
3
Exome copy number variant detection, analysis and classification in a large cohort of families with undiagnosed rare genetic disease.在一大群未确诊罕见遗传病的家庭中进行外显子组拷贝数变异检测、分析和分类。
medRxiv. 2023 Oct 5:2023.10.05.23296595. doi: 10.1101/2023.10.05.23296595.
4
Exome sequence read depth methods for identifying copy number changes.用于识别拷贝数变化的外显子序列读取深度方法。
Brief Bioinform. 2015 May;16(3):380-92. doi: 10.1093/bib/bbu027. Epub 2014 Aug 28.
5
A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data.种系全外显子组和全基因组测序数据中拷贝数变异检测工具的比较
Cancers (Basel). 2021 Dec 14;13(24):6283. doi: 10.3390/cancers13246283.
6
Diagnostic yield of exome sequencing-based copy number variation analysis in Mendelian disorders: a clinical application.基于外显子组测序的拷贝数变异分析在孟德尔疾病中的诊断效能:一项临床应用。
BMC Med Genomics. 2024 Sep 30;17(1):239. doi: 10.1186/s12920-024-02015-1.
7
Evaluation of somatic copy number estimation tools for whole-exome sequencing data.全外显子组测序数据的体细胞拷贝数估计工具评估
Brief Bioinform. 2016 Mar;17(2):185-92. doi: 10.1093/bib/bbv055. Epub 2015 Jul 25.
8
Accurate in silico confirmation of rare copy number variant calls from exome sequencing data using transfer learning.利用迁移学习准确地从外显子组测序数据中确认罕见拷贝数变异的调用。
Nucleic Acids Res. 2022 Nov 28;50(21):e123. doi: 10.1093/nar/gkac788.
9
Germline CNV Detection through Whole-Exome Sequencing (WES) Data Analysis Enhances Resolution of Rare Genetic Diseases.通过全外显子组测序(WES)数据分析进行胚系 CNV 检测可提高罕见遗传病的分辨率。
Genes (Basel). 2023 Jul 21;14(7):1490. doi: 10.3390/genes14071490.
10
Detection of clinically relevant copy number variants with whole-exome sequencing.全外显子测序检测临床相关拷贝数变异。
Hum Mutat. 2013 Oct;34(10):1439-48. doi: 10.1002/humu.22387. Epub 2013 Aug 30.

引用本文的文献

1
Multifocal Genomic Reconstruction Leading to Germline Structural Variants.导致种系结构变异的多灶基因组重建
Methods Mol Biol. 2025;2968:509-520. doi: 10.1007/978-1-0716-4750-9_30.
2
Genome-wide identification of the H3K27ac signals reveals key thermogenic cis-regulatory elements of brown adipose tissues in Oryctolagus cuniculus.全基因组范围内H3K27ac信号的鉴定揭示了穴兔棕色脂肪组织关键的产热顺式调控元件。
BMC Genomics. 2025 Jul 25;26(1):691. doi: 10.1186/s12864-025-11868-4.
3
Integrating Artificial Intelligence in Next-Generation Sequencing: Advances, Challenges, and Future Directions.

本文引用的文献

1
CNest: A novel copy number association discovery method uncovers 862 new associations from 200,629 whole-exome sequence datasets in the UK Biobank.CNest:一种新型的拷贝数关联发现方法,从英国生物银行的200,629个全外显子序列数据集中发现了862个新的关联。
Cell Genom. 2022 Aug 10;2(8):100167. doi: 10.1016/j.xgen.2022.100167.
2
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
3
Rare coding variation provides insight into the genetic architecture and phenotypic context of autism.
将人工智能整合到下一代测序中:进展、挑战与未来方向。
Curr Issues Mol Biol. 2025 Jun 19;47(6):470. doi: 10.3390/cimb47060470.
4
Scalable automated reanalysis of genomic data in research and clinical rare disease cohorts.研究和临床罕见病队列中基因组数据的可扩展自动化再分析
medRxiv. 2025 May 21:2025.05.19.25327921. doi: 10.1101/2025.05.19.25327921.
5
Tumor aneuploidy as a prognostic and predictive biomarker in immune checkpoint blockade.肿瘤非整倍体作为免疫检查点阻断治疗中的一种预后和预测生物标志物。
Nat Genet. 2025 Jun 30. doi: 10.1038/s41588-025-02226-x.
6
MOLGENIS VIP: an end-to-end DNA variant interpretation pipeline for research and diagnostics configurable to support rapid implementation of new methods.MOLGENIS VIP:一个用于研究和诊断的端到端DNA变异解读流程,可配置以支持新方法的快速实施。
NAR Genom Bioinform. 2025 Jun 23;7(2):lqaf087. doi: 10.1093/nargab/lqaf087. eCollection 2025 Jun.
7
Detection of Selection Signatures and Genome-Wide Association Analysis of Body Weight Traits in Xianan Cattle.湘西黄牛选择信号检测及体重性状全基因组关联分析
Genes (Basel). 2025 May 30;16(6):682. doi: 10.3390/genes16060682.
8
Combining chromosome conformation capture and exome sequencing for simultaneous detection of structural and single-nucleotide variants.结合染色体构象捕获技术和外显子组测序技术以同时检测结构变异和单核苷酸变异。
Genome Med. 2025 May 7;17(1):47. doi: 10.1186/s13073-025-01471-3.
9
Benchmarking of germline copy number variant callers from whole genome sequencing data for clinical applications.用于临床应用的全基因组测序数据中种系拷贝数变异检测工具的基准测试
Bioinform Adv. 2025 Apr 10;5(1):vbaf071. doi: 10.1093/bioadv/vbaf071. eCollection 2025.
10
Genome-wide chromatin accessibility and selective signals of meat rabbits reveal key Cis-regulatory elements and variants during postnatal development of skeletal muscles in rabbits.肉兔全基因组染色质可及性和选择信号揭示了家兔骨骼肌出生后发育过程中的关键顺式调控元件和变异。
BMC Genomics. 2025 Mar 25;26(1):296. doi: 10.1186/s12864-025-11496-y.
罕见编码变异为自闭症的遗传结构和表型背景提供了深入了解。
Nat Genet. 2022 Sep;54(9):1320-1331. doi: 10.1038/s41588-022-01104-0. Epub 2022 Aug 18.
4
A cross-disorder dosage sensitivity map of the human genome.人类基因组的跨疾病剂量敏感性图谱。
Cell. 2022 Aug 4;185(16):3041-3055.e25. doi: 10.1016/j.cell.2022.06.036. Epub 2022 Aug 1.
5
Rare coding variants in ten genes confer substantial risk for schizophrenia.十个基因中的罕见编码变异赋予精神分裂症的显著风险。
Nature. 2022 Apr;604(7906):509-516. doi: 10.1038/s41586-022-04556-w. Epub 2022 Apr 8.
6
SavvyCNV: Genome-wide CNV calling from off-target reads.SavvyCNV:从脱靶reads 进行全基因组 CNV 调用。
PLoS Comput Biol. 2022 Mar 16;18(3):e1009940. doi: 10.1371/journal.pcbi.1009940. eCollection 2022 Mar.
7
The individual and global impact of copy-number variants on complex human traits.拷贝数变异对复杂人类特征的个体和全球影响。
Am J Hum Genet. 2022 Apr 7;109(4):647-668. doi: 10.1016/j.ajhg.2022.02.010. Epub 2022 Mar 2.
8
Exome sequencing and analysis of 454,787 UK Biobank participants.外显子组测序和分析 454787 名英国生物银行参与者。
Nature. 2021 Nov;599(7886):628-634. doi: 10.1038/s41586-021-04103-z. Epub 2021 Oct 18.
9
De novo structural mutation rates and gamete-of-origin biases revealed through genome sequencing of 2,396 families.通过对 2396 个家族的基因组测序揭示新的结构突变率和配子来源偏倚。
Am J Hum Genet. 2021 Apr 1;108(4):597-607. doi: 10.1016/j.ajhg.2021.02.012. Epub 2021 Mar 5.
10
Haplotype-resolved diverse human genomes and integrated analysis of structural variation.单体型解析的多样化人类基因组和结构变异的综合分析。
Science. 2021 Apr 2;372(6537). doi: 10.1126/science.abf7117. Epub 2021 Feb 25.