• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯拷贝数检测及其在大规模研究中的关联分析。

Bayesian copy number detection and association in large-scale studies.

机构信息

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.

Department of Oncology The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA.

出版信息

BMC Cancer. 2020 Sep 7;20(1):856. doi: 10.1186/s12885-020-07304-3.

DOI:10.1186/s12885-020-07304-3
PMID:32894098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7487704/
Abstract

BACKGROUND

Germline copy number variants (CNVs) increase risk for many diseases, yet detection of CNVs and quantifying their contribution to disease risk in large-scale studies is challenging due to biological and technical sources of heterogeneity that vary across the genome within and between samples.

METHODS

We developed an approach called CNPBayes to identify latent batch effects in genome-wide association studies involving copy number, to provide probabilistic estimates of integer copy number across the estimated batches, and to fully integrate the copy number uncertainty in the association model for disease.

RESULTS

Applying a hidden Markov model (HMM) to identify CNVs in a large multi-site Pancreatic Cancer Case Control study (PanC4) of 7598 participants, we found CNV inference was highly sensitive to technical noise that varied appreciably among participants. Applying CNPBayes to this dataset, we found that the major sources of technical variation were linked to sample processing by the centralized laboratory and not the individual study sites. Modeling the latent batch effects at each CNV region hierarchically, we developed probabilistic estimates of copy number that were directly incorporated in a Bayesian regression model for pancreatic cancer risk. Candidate associations aided by this approach include deletions of 8q24 near regulatory elements of the tumor oncogene MYC and of Tumor Suppressor Candidate 3 (TUSC3).

CONCLUSIONS

Laboratory effects may not account for the major sources of technical variation in genome-wide association studies. This study provides a robust Bayesian inferential framework for identifying latent batch effects, estimating copy number, and evaluating the role of copy number in heritable diseases.

摘要

背景

种系拷贝数变异 (CNVs) 会增加许多疾病的风险,但由于基因组内和样本间存在生物学和技术异质性,因此在大规模研究中检测 CNVs 并量化其对疾病风险的贡献具有挑战性。

方法

我们开发了一种称为 CNPBayes 的方法,用于识别涉及拷贝数的全基因组关联研究中的潜在批次效应,提供估计批次中整数拷贝数的概率估计,并在关联模型中充分整合拷贝数不确定性以研究疾病。

结果

在一个涉及 7598 名参与者的大型多站点胰腺癌病例对照研究(PanC4)中,我们应用隐马尔可夫模型 (HMM) 识别 CNV,发现 CNV 推断对技术噪声高度敏感,而技术噪声在参与者之间存在明显差异。我们将 CNPBayes 应用于该数据集,发现主要的技术变异源与集中实验室的样本处理有关,而与个别研究地点无关。对每个 CNV 区域的潜在批次效应进行分层建模,我们开发了拷贝数的概率估计值,这些值直接纳入了用于胰腺癌风险的贝叶斯回归模型。通过这种方法辅助的候选关联包括肿瘤致癌基因 MYC 和肿瘤抑制候选 3 (TUSC3) 附近的 8q24 缺失。

结论

实验室效应可能无法解释全基因组关联研究中主要的技术变异源。本研究为识别潜在批次效应、估计拷贝数以及评估拷贝数在遗传性疾病中的作用提供了稳健的贝叶斯推断框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/bf79da261d27/12885_2020_7304_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/871b37e2bac7/12885_2020_7304_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/74663a1c4a1c/12885_2020_7304_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/68ba0aaf22d6/12885_2020_7304_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/bf79da261d27/12885_2020_7304_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/871b37e2bac7/12885_2020_7304_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/74663a1c4a1c/12885_2020_7304_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/68ba0aaf22d6/12885_2020_7304_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c2/7487704/bf79da261d27/12885_2020_7304_Fig4_HTML.jpg

相似文献

1
Bayesian copy number detection and association in large-scale studies.贝叶斯拷贝数检测及其在大规模研究中的关联分析。
BMC Cancer. 2020 Sep 7;20(1):856. doi: 10.1186/s12885-020-07304-3.
2
The role of copy number variation in susceptibility to amyotrophic lateral sclerosis: genome-wide association study and comparison with published loci.拷贝数变异在肌萎缩侧索硬化易感性中的作用:全基因组关联研究及与已发表基因座的比较。
PLoS One. 2009 Dec 4;4(12):e8175. doi: 10.1371/journal.pone.0008175.
3
Genome-wide association and targeted analysis of copy number variants with psoriatic arthritis in German patients.德国患者银屑病关节炎的全基因组关联研究及拷贝数变异的靶向分析
BMC Med Genet. 2017 Aug 23;18(1):92. doi: 10.1186/s12881-017-0447-y.
4
Association test using Copy Number Profile Curves (CONCUR) enhances power in rare copy number variant analysis.使用拷贝数谱曲线(CONCUR)的关联分析增强了稀有拷贝数变异分析的效能。
PLoS Comput Biol. 2020 May 4;16(5):e1007797. doi: 10.1371/journal.pcbi.1007797. eCollection 2020 May.
5
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
6
Family-Based Benchmarking of Copy Number Variation Detection Software.基于家族的拷贝数变异检测软件基准测试
PLoS One. 2015 Jul 21;10(7):e0133465. doi: 10.1371/journal.pone.0133465. eCollection 2015.
7
Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model.基于逐步贝叶斯模型,利用阵列强度和测序读取深度检测拷贝数变异。
BMC Bioinformatics. 2010 Oct 31;11:539. doi: 10.1186/1471-2105-11-539.
8
Implications of germline copy-number variations in psychiatric disorders: review of large-scale genetic studies.精神障碍中胚系拷贝数变异的意义:大规模遗传学研究综述。
J Hum Genet. 2021 Jan;66(1):25-37. doi: 10.1038/s10038-020-00838-1. Epub 2020 Sep 21.
9
Genome-wide association study of copy number variation with lung function identifies a novel signal of association near BANP for forced vital capacity.拷贝数变异与肺功能的全基因组关联研究确定了一个靠近BANP的与用力肺活量相关的新关联信号。
BMC Genet. 2016 Aug 11;17(1):116. doi: 10.1186/s12863-016-0423-0.
10
Copy number variation associates with mortality in long-lived individuals: a genome-wide assessment.拷贝数变异与长寿个体的死亡率相关:一项全基因组评估。
Aging Cell. 2016 Feb;15(1):49-55. doi: 10.1111/acel.12407. Epub 2015 Oct 8.

本文引用的文献

1
Rare copy number variations affecting the synaptic gene DMXL2 in neurodevelopmental disorders.影响神经发育障碍突触基因 DMXL2 的罕见拷贝数变异。
J Neurodev Disord. 2019 Feb 7;11(1):3. doi: 10.1186/s11689-019-9263-3.
2
Neurodevelopmental disease genes implicated by de novo mutation and copy number variation morbidity.由新生突变和拷贝数变异发病率所提示的神经发育疾病基因。
Nat Genet. 2019 Jan;51(1):106-116. doi: 10.1038/s41588-018-0288-4. Epub 2018 Dec 17.
3
Comparative Analyses of Copy-Number Variation in Autism Spectrum Disorder and Schizophrenia Reveal Etiological Overlap and Biological Insights.
自闭症谱系障碍和精神分裂症的拷贝数变异比较分析揭示了病因重叠和生物学见解。
Cell Rep. 2018 Sep 11;24(11):2838-2856. doi: 10.1016/j.celrep.2018.08.022.
4
Pathogenic Germline Variants in 10,389 Adult Cancers.10389 例成年癌症中的致病变异体种系变异。
Cell. 2018 Apr 5;173(2):355-370.e14. doi: 10.1016/j.cell.2018.03.039.
5
TUSC3: functional duality of a cancer gene.TUSC3:一个癌症基因的功能双重性。
Cell Mol Life Sci. 2018 Mar;75(5):849-857. doi: 10.1007/s00018-017-2660-4. Epub 2017 Sep 19.
6
Oncogenic function of TUSC3 in non-small cell lung cancer is associated with Hedgehog signalling pathway.TUSC3 在非小细胞肺癌中的致癌功能与 Hedgehog 信号通路有关。
Biochim Biophys Acta Mol Basis Dis. 2017 Jul;1863(7):1749-1760. doi: 10.1016/j.bbadis.2017.05.005. Epub 2017 May 6.
7
Integrated Proteogenomic Characterization of Human High-Grade Serous Ovarian Cancer.人类高级别浆液性卵巢癌的综合蛋白质基因组特征分析
Cell. 2016 Jul 28;166(3):755-765. doi: 10.1016/j.cell.2016.05.069. Epub 2016 Jun 29.
8
TUSC3 promotes colorectal cancer progression and epithelial-mesenchymal transition (EMT) through WNT/β-catenin and MAPK signalling.TUSC3通过WNT/β-连环蛋白和丝裂原活化蛋白激酶(MAPK)信号传导促进结直肠癌进展和上皮-间质转化(EMT)。
J Pathol. 2016 May;239(1):60-71. doi: 10.1002/path.4697. Epub 2016 Mar 28.
9
Extensive Hidden Genomic Mosaicism Revealed in Normal Tissue.正常组织中发现广泛的隐匿性基因组镶嵌现象。
Am J Hum Genet. 2016 Mar 3;98(3):571-578. doi: 10.1016/j.ajhg.2016.02.003.
10
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.