cn.FARMS：一种用于检测微阵列数据中拷贝数变异的潜在变量模型，具有较低的假发现率。

cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.

机构信息

Institute of Bioinformatics, Johannes Kepler University Linz, Linz, Austria.

出版信息

Nucleic Acids Res. 2011 Jul;39(12):e79. doi: 10.1093/nar/gkr197. Epub 2011 Apr 12.

DOI:10.1093/nar/gkr197

PMID:21486749

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3130288/

Abstract

Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.

摘要

像 Affymetrix SNP 6.0 这样的具有成本效益的寡核苷酸基因分型芯片仍然是测量 DNA 拷贝数变异 (CNV) 的主要技术。然而，微阵列的 CNV 检测方法高估了 CNV 区域的数量和大小，因此存在高假发现率 (FDR)。高 FDR 意味着许多 CNV 被错误地检测到，因此在临床研究中与疾病无关，尽管对多次测试进行了校正，但考虑到这一点，会降低研究的发现能力。为了控制 FDR，我们提出了一个概率潜在变量模型“cn.FARMS”，该模型通过贝叶斯最大后验方法进行优化。cn.FARMS 通过后验相对于先验的信息增益来控制 FDR。先验代表所有样本的拷贝数 2 的零假设，而后验只能通过数据中的强而一致的信号偏离。在 HapMap 数据上，cn.FARMS 在灵敏度和 FDR 方面明显优于两种最流行的方法。cn.FARMS 软件作为 R 包在 http://www.bioinf.jku.at/software/cnfarms/cnfarms.html 上公开提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc49/3130288/d4ec1eea8c57/gkr197f1.jpg

相似文献

cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.cn.FARMS：一种用于检测微阵列数据中拷贝数变异的潜在变量模型，具有较低的假发现率。

Nucleic Acids Res. 2011 Jul;39(12):e79. doi: 10.1093/nar/gkr197. Epub 2011 Apr 12.

cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.cn.MOPS：一种用于在下一代测序数据中发现拷贝数变异的泊松混合模型，具有较低的假发现率。

Nucleic Acids Res. 2012 May;40(9):e69. doi: 10.1093/nar/gks003. Epub 2012 Feb 1.

Algorithm implementation for CNV discovery using Affymetrix and Illumina SNP array data.使用Affymetrix和Illumina SNP阵列数据进行拷贝数变异（CNV）发现的算法实现

Methods Mol Biol. 2012;838:291-310. doi: 10.1007/978-1-61779-507-7_14.

Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。

BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.

An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays.一种使用新一代群体优化人类阵列分析杂交强度和基因型的综合分析工具。

BMC Genomics. 2016 Mar 31;17:266. doi: 10.1186/s12864-016-2478-8.

Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。

BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.

A new summarization method for Affymetrix probe level data.一种针对Affymetrix探针水平数据的新汇总方法。

Bioinformatics. 2006 Apr 15;22(8):943-9. doi: 10.1093/bioinformatics/btl033. Epub 2006 Feb 10.

A remark on copy number variation detection methods.关于拷贝数变异检测方法的评论。

PLoS One. 2018 Apr 27;13(4):e0196226. doi: 10.1371/journal.pone.0196226. eCollection 2018.

Hybridization and amplification rate correction for affymetrix SNP arrays.Affymetrix SNP 阵列的杂交和扩增率校正。

BMC Med Genomics. 2012 Jun 12;5:24. doi: 10.1186/1755-8794-5-24.

COKGEN: a software for the identification of rare copy number variation from SNP microarrays.COKGEN：一款用于从单核苷酸多态性微阵列中识别罕见拷贝数变异的软件。

Pac Symp Biocomput. 2010:371-82.

引用本文的文献

IBD Sharing between Africans, Neandertals, and Denisovans.非洲人、尼安德特人和丹尼索瓦人之间的炎症性肠病基因共享。

Genome Biol Evol. 2016 Dec 1;8(12):3406-3416. doi: 10.1093/gbe/evw234.

Integrating genetics and epigenetics in breast cancer: biological insights, experimental, computational methods and therapeutic potential.乳腺癌中遗传学与表观遗传学的整合：生物学见解、实验方法、计算方法及治疗潜力

BMC Syst Biol. 2015 Sep 21;9:62. doi: 10.1186/s12918-015-0211-x.

Multimodality vaccination against clade C SHIV: partial protection against mucosal challenges with a heterologous tier 2 virus.针对C组猿猴-人免疫缺陷病毒嵌合体的多模态疫苗接种：对异源2级病毒黏膜攻击的部分保护作用

Vaccine. 2014 Nov 12;32(48):6527-36. doi: 10.1016/j.vaccine.2014.08.065. Epub 2014 Sep 20.

HapFABIA: identification of very short segments of identity by descent characterized by rare variants in large sequencing data.HapFABIA：通过在大型测序数据中鉴定罕见变异来识别具有同源性的非常短的片段。

Nucleic Acids Res. 2013 Dec;41(22):e202. doi: 10.1093/nar/gkt1013. Epub 2013 Oct 29.

Live attenuated Rev-independent Nef¯SIV enhances acquisition of heterologous SIVsmE660 in acutely vaccinated rhesus macaques.活病毒减毒 Rev 非依赖性 Nef¯SIV 增强急性接种恒河猴中异源 SIVsmE660 的获得。

PLoS One. 2013 Sep 30;8(9):e75556. doi: 10.1371/journal.pone.0075556. eCollection 2013.

DEXUS: identifying differential expression in RNA-Seq studies with unknown conditions.DEXUS：在未知条件的 RNA-Seq 研究中识别差异表达。

Nucleic Acids Res. 2013 Nov;41(21):e198. doi: 10.1093/nar/gkt834. Epub 2013 Sep 17.

The Growing Importance of CNVs: New Insights for Detection and Clinical Interpretation.CNVs 的重要性日益增加：检测和临床解读的新见解。

Front Genet. 2013 May 30;4:92. doi: 10.3389/fgene.2013.00092. eCollection 2013.

Current analysis platforms and methods for detecting copy number variation.当前用于检测拷贝数变异的分析平台和方法。

Physiol Genomics. 2013 Jan 7;45(1):1-16. doi: 10.1152/physiolgenomics.00082.2012. Epub 2012 Nov 6.

Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort.利用大型临床队列中的 SNP 基因分型阵列鉴定和验证拷贝数变异。

BMC Genomics. 2012 Jun 15;13:241. doi: 10.1186/1471-2164-13-241.

Hybridization and amplification rate correction for affymetrix SNP arrays.Affymetrix SNP 阵列的杂交和扩增率校正。

BMC Med Genomics. 2012 Jun 12;5:24. doi: 10.1186/1755-8794-5-24.

本文引用的文献

Filtering data from high-throughput experiments based on measurement reliability.基于测量可靠性从高通量实验中筛选数据。

Proc Natl Acad Sci U S A. 2010 Nov 16;107(46):E173-4; author reply E175. doi: 10.1073/pnas.1010604107. Epub 2010 Nov 8.

Independent filtering increases detection power for high-throughput experiments.独立过滤提高了高通量实验的检测能力。

Proc Natl Acad Sci U S A. 2010 May 25;107(21):9546-51. doi: 10.1073/pnas.0914005107. Epub 2010 May 11.

Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.全基因组关联研究分析了 16000 例 8 种常见疾病和 3000 例共享对照的 CNVs。

Nature. 2010 Apr 1;464(7289):713-20. doi: 10.1038/nature08979.

Preprocessing and downstream analysis of microarray DNA copy number profiles.微阵列 DNA 拷贝数谱的预处理和下游分析。

Brief Bioinform. 2011 Jan;12(1):10-21. doi: 10.1093/bib/bbq004. Epub 2010 Feb 19.

Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays.七种基于单核苷酸多态性微阵列的拷贝数变异识别算法的比较分析。

Nucleic Acids Res. 2010 May;38(9):e105. doi: 10.1093/nar/gkq040. Epub 2010 Feb 8.

Origins and functional impact of copy number variation in the human genome.人类基因组中拷贝数变异的起源和功能影响。

Nature. 2010 Apr 1;464(7289):704-12. doi: 10.1038/nature08516. Epub 2009 Oct 7.

Hidden Markov models for the assessment of chromosomal alterations using high-throughput SNP arrays.使用高通量SNP阵列评估染色体改变的隐马尔可夫模型。

Ann Appl Stat. 2008 Jun 1;2(2):687-713. doi: 10.1214/07-AOAS155.

A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6.一种用于从包括GenomeWideSNP 5和6在内的所有Affymetrix基因分型阵列估计全分辨率原始拷贝数的单阵列预处理方法。

Bioinformatics. 2009 Sep 1;25(17):2149-56. doi: 10.1093/bioinformatics/btp371. Epub 2009 Jun 17.

Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性（SNPs）、常见拷贝数多态性和罕见拷贝数变异（CNVs）的整合基因型分型与关联分析。

Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.

Integrated detection and population-genetic analysis of SNPs and copy number variation.单核苷酸多态性（SNPs）与拷贝数变异的综合检测及群体遗传分析

Nat Genet. 2008 Oct;40(10):1166-74. doi: 10.1038/ng.238. Epub 2008 Sep 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

cn.FARMS：一种用于检测微阵列数据中拷贝数变异的潜在变量模型，具有较低的假发现率。

cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献