• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

cn.MOPS:一种用于在下一代测序数据中发现拷贝数变异的泊松混合模型,具有较低的假发现率。

cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.

机构信息

Institute of Bioinformatics, Johannes Kepler University, A-4040 Linz, Austria.

出版信息

Nucleic Acids Res. 2012 May;40(9):e69. doi: 10.1093/nar/gks003. Epub 2012 Feb 1.

DOI:10.1093/nar/gks003
PMID:22302147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3351174/
Abstract

Quantitative analyses of next-generation sequencing (NGS) data, such as the detection of copy number variations (CNVs), remain challenging. Current methods detect CNVs as changes in the depth of coverage along chromosomes. Technological or genomic variations in the depth of coverage thus lead to a high false discovery rate (FDR), even upon correction for GC content. In the context of association studies between CNVs and disease, a high FDR means many false CNVs, thereby decreasing the discovery power of the study after correction for multiple testing. We propose 'Copy Number estimation by a Mixture Of PoissonS' (cn.MOPS), a data processing pipeline for CNV detection in NGS data. In contrast to previous approaches, cn.MOPS incorporates modeling of depths of coverage across samples at each genomic position. Therefore, cn.MOPS is not affected by read count variations along chromosomes. Using a Bayesian approach, cn.MOPS decomposes variations in the depth of coverage across samples into integer copy numbers and noise by means of its mixture components and Poisson distributions, respectively. The noise estimate allows for reducing the FDR by filtering out detections having high noise that are likely to be false detections. We compared cn.MOPS with the five most popular methods for CNV detection in NGS data using four benchmark datasets: (i) simulated data, (ii) NGS data from a male HapMap individual with implanted CNVs from the X chromosome, (iii) data from HapMap individuals with known CNVs, (iv) high coverage data from the 1000 Genomes Project. cn.MOPS outperformed its five competitors in terms of precision (1-FDR) and recall for both gains and losses in all benchmark data sets. The software cn.MOPS is publicly available as an R package at http://www.bioinf.jku.at/software/cnmops/ and at Bioconductor.

摘要

下一代测序(NGS)数据的定量分析,例如拷贝数变异(CNV)的检测,仍然具有挑战性。当前的方法通过检测染色体上覆盖深度的变化来检测 CNV。因此,即使对 GC 含量进行校正,覆盖深度的技术或基因组变异也会导致高假发现率(FDR)。在 CNV 与疾病之间的关联研究中,高 FDR 意味着许多假 CNV,从而在对多重检验进行校正后降低了研究的发现能力。我们提出了“通过泊松混合进行拷贝数估计(cn.MOPS)”,这是一种用于 NGS 数据中 CNV 检测的数据处理管道。与以前的方法不同,cn.MOPS 在每个基因组位置都整合了样本之间覆盖深度的建模。因此,cn.MOPS 不受染色体上读长变化的影响。通过使用贝叶斯方法,cn.MOPS 通过其混合成分和泊松分布将样本之间覆盖深度的变化分别分解为整数拷贝数和噪声。噪声估计通过过滤掉具有高噪声的检测来降低 FDR,这些检测很可能是假检测。我们使用四个基准数据集将 cn.MOPS 与用于 NGS 数据中 CNV 检测的五种最流行的方法进行了比较:(i)模拟数据,(ii)来自具有植入 X 染色体 CNV 的男性 HapMap 个体的 NGS 数据,(iii)具有已知 CNV 的 HapMap 个体的数据,(iv)来自 1000 基因组计划的高覆盖数据。cn.MOPS 在所有基准数据集的增益和损失方面均在精度(1-FDR)和召回率方面优于其五个竞争对手。cn.MOPS 软件作为 R 包在 http://www.bioinf.jku.at/software/cnmops/ 和 Bioconductor 上公开提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/d2b84c79ab13/gks003f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/5340232ad2fd/gks003f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/c725e8e26ff1/gks003f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/b1a309956eff/gks003f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/77a8a95c1c1a/gks003f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/d2b84c79ab13/gks003f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/5340232ad2fd/gks003f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/c725e8e26ff1/gks003f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/b1a309956eff/gks003f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/77a8a95c1c1a/gks003f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb8f/3351174/d2b84c79ab13/gks003f5.jpg

相似文献

1
cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.cn.MOPS:一种用于在下一代测序数据中发现拷贝数变异的泊松混合模型,具有较低的假发现率。
Nucleic Acids Res. 2012 May;40(9):e69. doi: 10.1093/nar/gks003. Epub 2012 Feb 1.
2
cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.cn.FARMS:一种用于检测微阵列数据中拷贝数变异的潜在变量模型,具有较低的假发现率。
Nucleic Acids Res. 2011 Jul;39(12):e79. doi: 10.1093/nar/gkr197. Epub 2011 Apr 12.
3
Study on detection of CNVs using human whole genome bisulfite sequencing data.利用人类全基因组亚硫酸氢盐测序数据进行 CNV 检测的研究。
Yi Chuan. 2023 Apr 20;45(4):324-340. doi: 10.16288/j.yczz.22-385.
4
panelcn.MOPS: Copy-number detection in targeted NGS panel data for clinical diagnostics.panelcn.MOPS:用于临床诊断的靶向二代测序面板数据中的拷贝数检测。
Hum Mutat. 2017 Jul;38(7):889-897. doi: 10.1002/humu.23237. Epub 2017 May 16.
5
A remark on copy number variation detection methods.关于拷贝数变异检测方法的评论。
PLoS One. 2018 Apr 27;13(4):e0196226. doi: 10.1371/journal.pone.0196226. eCollection 2018.
6
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
7
MSeq-CNV: accurate detection of Copy Number Variation from Sequencing of Multiple samples.MSeq-CNV:从多个样本测序中准确检测拷贝数变异
Sci Rep. 2018 Mar 5;8(1):4009. doi: 10.1038/s41598-018-22323-8.
8
Genome-Wide Copy Number Variation Detection Using NGS: Data Analysis and Interpretation.使用二代测序技术进行全基因组拷贝数变异检测:数据分析与解读
Methods Mol Biol. 2019;1908:113-124. doi: 10.1007/978-1-4939-9004-7_8.
9
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.iCopyDAV:用于拷贝数变异检测、注释和可视化的集成平台。
PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018.
10
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.使用全外显子组测序数据对癌症拷贝数变异检测工具的评估
BMC Bioinformatics. 2017 May 31;18(1):286. doi: 10.1186/s12859-017-1705-x.

引用本文的文献

1
Structural variation in nebulin and its impact on phenotype and inheritance: establishing a dominant distal phenotype caused by large deletions.伴肌动蛋白的结构变异及其对表型和遗传的影响:确定由大片段缺失引起的显性远端表型。
Eur J Hum Genet. 2025 Jun 14. doi: 10.1038/s41431-025-01891-0.
2
Y chromosome variation and prostate cancer ancestral disparities.Y染色体变异与前列腺癌的种族差异。
iScience. 2025 Apr 15;28(5):112437. doi: 10.1016/j.isci.2025.112437. eCollection 2025 May 16.
3
Auricular malformations are driven by copy number variations in a hierarchical enhancer cluster and a dominant enhancer recapitulates human pathogenesis.

本文引用的文献

1
Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV.基于外显子组测序的拷贝数变异和杂合性丢失检测:ExomeCNV。
Bioinformatics. 2011 Oct 1;27(19):2648-54. doi: 10.1093/bioinformatics/btr462. Epub 2011 Aug 9.
2
Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling.RNA-Seq 定量转录表达谱中精度的特征描述和改进。
Bioinformatics. 2011 Jul 1;27(13):i383-91. doi: 10.1093/bioinformatics/btr247.
3
cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.
耳廓畸形由一个分层增强子簇中的拷贝数变异驱动,且一个显性增强子概括了人类发病机制。
Nat Commun. 2025 May 17;16(1):4598. doi: 10.1038/s41467-025-59735-w.
4
A Systematic Review of the Advances and New Insights into Copy Number Variations in Plant Genomes.植物基因组拷贝数变异研究进展与新见解的系统综述
Plants (Basel). 2025 May 6;14(9):1399. doi: 10.3390/plants14091399.
5
Human de novo mutation rates from a four-generation pedigree reference.基于一个四代家系参考得出的人类新生突变率。
Nature. 2025 Apr 23. doi: 10.1038/s41586-025-08922-2.
6
Genome Sequencing Unveils the Role of Copy Number Variants in Hearing Loss and Identifies Novel Deletions With Founder Effect in the DFNB1 Locus.基因组测序揭示拷贝数变异在听力损失中的作用,并在DFNB1位点鉴定出具有奠基者效应的新型缺失。
Hum Mutat. 2024 Aug 6;2024:9517114. doi: 10.1155/2024/9517114. eCollection 2024.
7
EMcnv: enhancing CNV detection performance through ensemble strategies with heterogeneous meta-graph neural networks.EMcnv:通过使用异构元图神经网络的集成策略提高拷贝数变异(CNV)检测性能。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf135.
8
A deficient CP24 allele defines variation for dynamic nonphotochemical quenching and photosystem II efficiency in maize.一个缺失的CP24等位基因决定了玉米中动态非光化学猝灭和光系统II效率的变异。
Plant Cell. 2025 Apr 2;37(4). doi: 10.1093/plcell/koaf063.
9
Benchmarking strategies for CNV calling from whole genome bisulfite data in humans.人类全基因组亚硫酸氢盐数据中拷贝数变异(CNV)检测的基准测试策略
Comput Struct Biotechnol J. 2025 Mar 6;27:912-919. doi: 10.1016/j.csbj.2025.02.040. eCollection 2025.
10
Truncated Variants in FAM20A and WDR72 Genes Underlie Autosomal Recessive Amelogenesis Imperfecta in Four Pakistani Families.FAM20A和WDR72基因的截短变异是四个巴基斯坦家庭常染色体隐性遗传性牙釉质发育不全的病因。
Biochem Genet. 2025 Mar 19. doi: 10.1007/s10528-025-11087-2.
cn.FARMS:一种用于检测微阵列数据中拷贝数变异的潜在变量模型,具有较低的假发现率。
Nucleic Acids Res. 2011 Jul;39(12):e79. doi: 10.1093/nar/gkr197. Epub 2011 Apr 12.
4
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
5
Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm.利用 JointSLM 算法检测高通量测序数据中的常见拷贝数变异。
Nucleic Acids Res. 2011 May;39(10):e65. doi: 10.1093/nar/gkr068. Epub 2011 Feb 14.
6
Initial impact of the sequencing of the human genome.人类基因组测序的初步影响。
Nature. 2011 Feb 10;470(7333):187-97. doi: 10.1038/nature09792.
7
Control-free calling of copy number alterations in deep-sequencing data using GC-content normalization.使用 GC 含量归一化控制深度测序数据中拷贝数改变的无偏调用。
Bioinformatics. 2011 Jan 15;27(2):268-9. doi: 10.1093/bioinformatics/btq635. Epub 2010 Nov 15.
8
Filtering data from high-throughput experiments based on measurement reliability.基于测量可靠性从高通量实验中筛选数据。
Proc Natl Acad Sci U S A. 2010 Nov 16;107(46):E173-4; author reply E175. doi: 10.1073/pnas.1010604107. Epub 2010 Nov 8.
9
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
10
SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples.从多个二倍体样本的低覆盖测序数据中进行 SNP 检测和基因分型。
Genome Res. 2011 Jun;21(6):952-60. doi: 10.1101/gr.113084.110. Epub 2010 Oct 27.