• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于组学信息的拷贝数变异(CNV)检测可降低假阳性率并提高检测CNV与性状关联的效能。

Omics-informed CNV calls reduce false-positive rates and improve power for CNV-trait associations.

作者信息

Lepamets Maarja, Auwerx Chiara, Nõukas Margit, Claringbould Annique, Porcu Eleonora, Kals Mart, Jürgenson Tuuli, Morris Andrew Paul, Võsa Urmo, Bochud Murielle, Stringhini Silvia, Wijmenga Cisca, Franke Lude, Peterson Hedi, Vilo Jaak, Lepik Kaido, Mägi Reedik, Kutalik Zoltán

机构信息

Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu 51010, Estonia.

Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia.

出版信息

HGG Adv. 2022 Aug 1;3(4):100133. doi: 10.1016/j.xhgg.2022.100133. eCollection 2022 Oct 13.

DOI:10.1016/j.xhgg.2022.100133
PMID:36035246
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9399386/
Abstract

Copy-number variations (CNV) are believed to play an important role in a wide range of complex traits, but discovering such associations remains challenging. While whole-genome sequencing (WGS) is the gold-standard approach for CNV detection, there are several orders of magnitude more samples with available genotyping microarray data. Such array data can be exploited for CNV detection using dedicated software (e.g., PennCNV); however, these calls suffer from elevated false-positive and -negative rates. In this study, we developed a CNV quality score that weights PennCNV calls (pCNVs) based on their likelihood of being true positive. First, we established a measure of pCNV reliability by leveraging evidence from multiple omics data (WGS, transcriptomics, and methylomics) obtained from the same samples. Next, we built a predictor of omics-confirmed pCNVs, termed omics-informed quality score (OQS), using only PennCNV software output parameters. Promisingly, OQS assigned to pCNVs detected in close family members was up to 35% higher than the OQS of pCNVs not carried by other relatives (p < 3.0 × 10), outperforming other scores. Finally, in an association study of four anthropometric traits in 89,516 Estonian Biobank samples, the use of OQS led to a relative increase in the trait variance explained by CNVs of up to 56% compared with published quality filtering methods or scores. Overall, we put forward a flexible framework to improve any CNV detection method leveraging multi-omics evidence, applied it to improve PennCNV calls, and demonstrated its utility by improving the statistical power for downstream association analyses.

摘要

拷贝数变异(CNV)被认为在广泛的复杂性状中起重要作用,但发现此类关联仍然具有挑战性。虽然全基因组测序(WGS)是CNV检测的金标准方法,但有可用基因分型微阵列数据的样本数量要多几个数量级。此类阵列数据可使用专用软件(如PennCNV)用于CNV检测;然而,这些调用存在较高的假阳性和假阴性率。在本研究中,我们开发了一种CNV质量评分,根据其为真阳性的可能性对PennCNV调用(pCNV)进行加权。首先,我们通过利用从相同样本获得的多种组学数据(WGS、转录组学和甲基组学)的证据,建立了一种pCNV可靠性的度量。接下来,我们仅使用PennCNV软件输出参数构建了一个经组学确认的pCNV预测器,称为组学知情质量评分(OQS)。很有前景的是,分配给在近亲中检测到的pCNV的OQS比其他亲属未携带的pCNV的OQS高出多达35%(p < 3.0×10),优于其他评分。最后,在对89516个爱沙尼亚生物银行样本的四项人体测量性状的关联研究中,与已发表的质量过滤方法或评分相比,使用OQS导致CNV解释的性状方差相对增加高达56%。总体而言,我们提出了一个灵活的框架,以利用多组学证据改进任何CNV检测方法,将其应用于改进PennCNV调用,并通过提高下游关联分析的统计效力证明了其效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/a9783f0f2803/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/7e34aaaf0117/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/0ed095107a7e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/4b4518500c09/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/a9783f0f2803/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/7e34aaaf0117/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/0ed095107a7e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/4b4518500c09/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/063e/9399386/a9783f0f2803/gr4.jpg

相似文献

1
Omics-informed CNV calls reduce false-positive rates and improve power for CNV-trait associations.基于组学信息的拷贝数变异(CNV)检测可降低假阳性率并提高检测CNV与性状关联的效能。
HGG Adv. 2022 Aug 1;3(4):100133. doi: 10.1016/j.xhgg.2022.100133. eCollection 2022 Oct 13.
2
A novel scatterplot-based method to detect copy number variation (CNV).一种基于散点图的新型拷贝数变异(CNV)检测方法。
Front Genet. 2023 Jul 6;14:1166972. doi: 10.3389/fgene.2023.1166972. eCollection 2023.
3
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
4
Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。
BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.
5
New quality measure for SNP array based CNV detection.基于 SNP 芯片的 CNV 检测的新质量度量。
Bioinformatics. 2016 Nov 1;32(21):3298-3305. doi: 10.1093/bioinformatics/btw477. Epub 2016 Jul 10.
6
Identification of Copy Number Variants from SNP Arrays Using PennCNV.使用PennCNV从SNP阵列中鉴定拷贝数变异
Methods Mol Biol. 2018;1833:1-28. doi: 10.1007/978-1-4939-8666-8_1.
7
Using family data as a verification standard to evaluate copy number variation calling strategies for genetic association studies.利用家系数据作为验证标准,评估遗传关联研究中拷贝数变异 calling 策略。
Genet Epidemiol. 2012 Apr;36(3):253-62. doi: 10.1002/gepi.21618.
8
Copy number variant and runs of homozygosity detection by microarrays enabled more precise molecular diagnoses in 11,020 clinical exome cases.微阵列技术可检测拷贝数变异和纯合性运行,从而使 11020 例临床外显子组病例的分子诊断更为精确。
Genome Med. 2019 May 17;11(1):30. doi: 10.1186/s13073-019-0639-5.
9
Copy number variation genotyping using family information.基于家系信息的拷贝数变异基因分型。
BMC Bioinformatics. 2013 May 9;14:157. doi: 10.1186/1471-2105-14-157.
10
Accuracy of CNV Detection from GWAS Data.从 GWAS 数据中检测 CNV 的准确性。
PLoS One. 2011 Jan 13;6(1):e14511. doi: 10.1371/journal.pone.0014511.

引用本文的文献

1
Plasma metabolomic signatures for copy number variants and COVID-19 risk loci in Northern Finland populations.芬兰北部人群中拷贝数变异和新冠病毒疾病风险位点的血浆代谢组学特征
Sci Rep. 2025 Apr 16;15(1):13172. doi: 10.1038/s41598-025-94839-9.
2
The Estonian Biobank's journey from biobanking to personalized medicine.爱沙尼亚生物银行从生物样本库到个性化医疗的历程。
Nat Commun. 2025 Apr 5;16(1):3270. doi: 10.1038/s41467-025-58465-3.
3
Genome-Wide Scan for Copy Number Variations in Chinese Merino Sheep Based on Ovine High-Density 600K SNP Arrays.

本文引用的文献

1
Influences of rare copy-number variation on human complex traits.稀有拷贝数变异对人类复杂特征的影响。
Cell. 2022 Oct 27;185(22):4233-4248.e27. doi: 10.1016/j.cell.2022.09.028.
2
The individual and global impact of copy-number variants on complex human traits.拷贝数变异对复杂人类特征的个体和全球影响。
Am J Hum Genet. 2022 Apr 7;109(4):647-668. doi: 10.1016/j.ajhg.2022.02.010. Epub 2022 Mar 2.
3
Gene regulation contributes to explain the impact of early life socioeconomic disadvantage on adult inflammatory levels in two cohort studies.
基于绵羊高密度600K SNP芯片的中国美利奴羊拷贝数变异全基因组扫描
Animals (Basel). 2024 Oct 8;14(19):2897. doi: 10.3390/ani14192897.
4
Genome-wide association testing beyond SNPs.超越单核苷酸多态性的全基因组关联测试。
Nat Rev Genet. 2025 Mar;26(3):156-170. doi: 10.1038/s41576-024-00778-y. Epub 2024 Oct 7.
5
Genetic determinants of plasma protein levels in the Estonian population.爱沙尼亚人群血浆蛋白水平的遗传决定因素。
Sci Rep. 2024 Apr 2;14(1):7694. doi: 10.1038/s41598-024-57966-3.
6
Identification of copy number variations in the genome of Dairy Gir cattle.鉴定奶牛基因组中的拷贝数变异。
PLoS One. 2023 Apr 10;18(4):e0284085. doi: 10.1371/journal.pone.0284085. eCollection 2023.
基因调控有助于解释两个队列研究中,早期生活社会经济劣势对成年人炎症水平的影响。
Sci Rep. 2021 Feb 4;11(1):3100. doi: 10.1038/s41598-021-82714-2.
4
Genetics of 35 blood and urine biomarkers in the UK Biobank.英国生物库中 35 项血液和尿液生物标志物的遗传学研究
Nat Genet. 2021 Feb;53(2):185-194. doi: 10.1038/s41588-020-00757-z. Epub 2021 Jan 18.
5
The contribution of CNVs to the most common aging-related neurodegenerative diseases.CNVs 对最常见的与年龄相关的神经退行性疾病的贡献。
Aging Clin Exp Res. 2021 May;33(5):1187-1195. doi: 10.1007/s40520-020-01485-4. Epub 2020 Feb 6.
6
Rare copy number variants in over 100,000 European ancestry subjects reveal multiple disease associations.在超过 10 万欧洲血统个体中罕见的拷贝数变异揭示了多种疾病的关联。
Nat Commun. 2020 Jan 14;11(1):255. doi: 10.1038/s41467-019-13624-1.
7
Phenome-wide Burden of Copy-Number Variation in the UK Biobank.英国生物库中拷贝数变异的表型全基因组负担
Am J Hum Genet. 2019 Aug 1;105(2):373-383. doi: 10.1016/j.ajhg.2019.07.001. Epub 2019 Jul 25.
8
Medical consequences of pathogenic CNVs in adults: analysis of the UK Biobank.成人致病性 CNV 的医学后果:英国生物银行分析。
J Med Genet. 2019 Mar;56(3):131-138. doi: 10.1136/jmedgenet-2018-105477. Epub 2018 Oct 20.
9
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
10
Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr.使用两个 R 包:bigstatsr 和 bigsnpr,高效分析大规模全基因组数据。
Bioinformatics. 2018 Aug 15;34(16):2781-2787. doi: 10.1093/bioinformatics/bty185.