序贯蒙特卡罗多重检验。

Sequential Monte Carlo multiple testing.

机构信息

Department of Informatics, University of Oslo, Oslo, Norway.

出版信息

Bioinformatics. 2011 Dec 1;27(23):3235-41. doi: 10.1093/bioinformatics/btr568. Epub 2011 Oct 13.

DOI:10.1093/bioinformatics/btr568

PMID:21998154

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3223366/

Abstract

MOTIVATION

In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high.

RESULTS

We here present MCFDR, a simple, novel algorithm for false discovery rate (FDR) modulated sequential Monte Carlo (MC) multiple hypothesis testing. The algorithm iterates between adding MC samples across tests and calculating intermediate FDR values for the collection of tests. MC sampling is stopped either by sequential MC or based on a threshold on FDR. An essential property of the algorithm is that it limits the total number of MC samples whatever the number of true null hypotheses. We show on both real and simulated data that the proposed algorithm provides large gains in computational efficiency.

AVAILABILITY

MCFDR is implemented in the Genomic HyperBrowser (http://hyperbrowser.uio.no/mcfdr), a web-based system for genome analysis. All input data and results are available and can be reproduced through a Galaxy Pages document at: http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr.

CONTACT

geirksa@ifi.uio.no.

摘要

动机

在分子生物学中，与许多其他科学领域一样，分析的规模一直在不断扩大。通常需要进行复杂的蒙特卡罗模拟，有时还需要在大规模的多重检验环境中进行。由此产生的计算成本可能高得令人望而却步。

结果

我们在这里提出 MCFDR，这是一种用于错误发现率（FDR）调制的序贯蒙特卡罗（MC）多重假设检验的简单新颖算法。该算法在跨测试添加 MC 样本和计算测试集合的中间 FDR 值之间迭代。MC 采样要么通过序贯 MC 停止，要么基于 FDR 的阈值停止。该算法的一个重要特性是，无论真实零假设的数量如何，它都限制了 MC 样本的总数。我们在真实和模拟数据上都表明，所提出的算法在计算效率方面有很大的提高。

可用性

MCFDR 是在基于网络的基因组分析系统 Genomic HyperBrowser（http://hyperbrowser.uio.no/mcfdr）中实现的。所有输入数据和结果都可用，并可通过 Galaxy Pages 文档在以下网址重现：http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr。

联系方式

geirksa@ifi.uio.no。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecc0/3223366/c6b6ef597910/btr568f1.jpg

相似文献

Sequential Monte Carlo multiple testing.序贯蒙特卡罗多重检验。

Bioinformatics. 2011 Dec 1;27(23):3235-41. doi: 10.1093/bioinformatics/btr568. Epub 2011 Oct 13.

MERIT: Controlling Monte-Carlo error rate in large-scale Monte-Carlo hypothesis testing.MERIT：控制大规模蒙特卡罗假设检验中的蒙特卡罗误差率。

Stat Med. 2024 Jan 30;43(2):279-295. doi: 10.1002/sim.9959. Epub 2023 Nov 14.

Employing a Monte Carlo algorithm in Newton-type methods for restricted maximum likelihood estimation of genetic parameters.在用于遗传参数限制最大似然估计的牛顿型方法中采用蒙特卡罗算法。

PLoS One. 2013 Dec 10;8(12):e80821. doi: 10.1371/journal.pone.0080821. eCollection 2013.

Assessment of Monte Carlo algorithm for compliance with RTOG 0915 dosimetric criteria in peripheral lung cancer patients treated with stereotactic body radiotherapy.评估蒙特卡罗算法在接受立体定向体部放射治疗的周围型肺癌患者中符合 RTOG 0915 剂量学标准的应用。

J Appl Clin Med Phys. 2016 May 8;17(3):277-293. doi: 10.1120/jacmp.v17i3.6077.

ClusTrack: feature extraction and similarity measures for clustering of genome-wide data sets.ClusTrack：用于全基因组数据集聚类的特征提取与相似性度量

PLoS One. 2015 Apr 16;10(4):e0123261. doi: 10.1371/journal.pone.0123261. eCollection 2015.

Multiple "time step" Monte Carlo simulations: application to charged systems with Ewald summation.多次“时间步长”蒙特卡罗模拟：应用于采用埃瓦尔德求和的带电系统。

J Chem Phys. 2004 Jul 1;121(1):44-50. doi: 10.1063/1.1755195.

Efficient Monte Carlo algorithm for restricted maximum likelihood estimation of genetic parameters.用于遗传参数限制极大似然估计的高效蒙特卡罗算法。

J Anim Breed Genet. 2019 Jul;136(4):252-261. doi: 10.1111/jbg.12375.

A profile-based deterministic sequential Monte Carlo algorithm for motif discovery.一种基于轮廓的确定性序贯蒙特卡罗基序发现算法。

Bioinformatics. 2008 Jan 1;24(1):46-55. doi: 10.1093/bioinformatics/btm543. Epub 2007 Nov 17.

Adaptive step size algorithm to increase efficiency of proton macro Monte Carlo dose calculation.自适应步长算法提高质子宏观蒙特卡罗剂量计算效率。

Radiat Oncol. 2019 Sep 9;14(1):165. doi: 10.1186/s13014-019-1362-5.

Patient-specific scatter correction in clinical cone beam computed tomography imaging made possible by the combination of Monte Carlo simulations and a ray tracing algorithm.通过蒙特卡罗模拟和光线追踪算法的结合，实现了临床锥形束计算机断层摄影成像中的患者特异性散射校正。

Acta Oncol. 2013 Oct;52(7):1477-83. doi: 10.3109/0284186X.2013.813641. Epub 2013 Jul 23.

引用本文的文献

Integrative analysis of microbial 16S gene and shotgun metagenomic sequencing data improves statistical efficiency in testing differential abundance.微生物16S基因与鸟枪法宏基因组测序数据的整合分析提高了差异丰度检测的统计效率。

J Am Stat Assoc. 2025 Aug 5. doi: 10.1080/01621459.2025.2516205.

MERIT: Controlling Monte-Carlo error rate in large-scale Monte-Carlo hypothesis testing.MERIT：控制大规模蒙特卡罗假设检验中的蒙特卡罗误差率。

Stat Med. 2024 Jan 30;43(2):279-295. doi: 10.1002/sim.9959. Epub 2023 Nov 14.

Early-life stress and ovarian hormones alter transcriptional regulation in the nucleus accumbens resulting in sex-specific responses to cocaine.早期生活应激和卵巢激素改变伏隔核中的转录调控，导致可卡因的性别特异性反应。

Cell Rep. 2023 Oct 31;42(10):113187. doi: 10.1016/j.celrep.2023.113187. Epub 2023 Sep 29.

Skin microbiome alterations in upper extremity secondary lymphedema.上肢继发性淋巴水肿的皮肤微生物组改变。

PLoS One. 2023 May 17;18(5):e0283609. doi: 10.1371/journal.pone.0283609. eCollection 2023.

LOCOM: A logistic regression model for testing differential abundance in compositional microbiome data with false discovery rate control.LOCOM：一种用于检验微生物组数据中丰度差异的逻辑回归模型，具有错误发现率控制。

Proc Natl Acad Sci U S A. 2022 Jul 26;119(30):e2122788119. doi: 10.1073/pnas.2122788119. Epub 2022 Jul 22.

Signatures of copy number alterations in human cancer.人类癌症中拷贝数改变的特征。

Nature. 2022 Jun;606(7916):984-991. doi: 10.1038/s41586-022-04738-6. Epub 2022 Jun 15.

Integrative analysis of relative abundance data and presence-absence data of the microbiome using the LDM.使用 LDM 对微生物组的相对丰度数据和存在缺失数据进行综合分析。

Bioinformatics. 2022 May 13;38(10):2915-2917. doi: 10.1093/bioinformatics/btac181.

A new approach to testing mediation of the microbiome at both the community and individual taxon levels.一种新的方法来检测微生物组在群落和个体分类群水平上的中介作用。

Bioinformatics. 2022 Jun 13;38(12):3173-3180. doi: 10.1093/bioinformatics/btac310.

Testing hypotheses about the microbiome using the linear decomposition model (LDM).使用线性分解模型（LDM）检验关于微生物组的假设。

Bioinformatics. 2020 Aug 15;36(14):4106-4115. doi: 10.1093/bioinformatics/btaa260.

Allele-specific control of replication timing and genome organization during development.发育过程中复制定时和基因组组织的等位基因特异性控制。

Genome Res. 2018 Jun;28(6):800-811. doi: 10.1101/gr.232561.117. Epub 2018 May 7.

本文引用的文献

Towards accurate estimation of the proportion of true null hypotheses in multiple testing.准确估计多重检验中真实零假设的比例。

PLoS One. 2011 Apr 22;6(4):e18874. doi: 10.1371/journal.pone.0018874.

The Genomic HyperBrowser: inferential genomics at the sequence level.基因组超浏览器：序列水平的推理基因组学。

Genome Biol. 2010;11(12):R121. doi: 10.1186/gb-2010-11-12-r121. Epub 2010 Dec 23.

Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn.排列P值永远不应为零：当排列是随机抽取时计算精确P值。

Stat Appl Genet Mol Biol. 2010;9:Article39. doi: 10.2202/1544-6115.1585. Epub 2010 Oct 31.

A unique H3K4me2 profile marks tissue-specific gene regulation.一种独特的 H3K4me2 特征标志着组织特异性基因调控。

Genome Res. 2010 Nov;20(11):1493-502. doi: 10.1101/gr.109389.110. Epub 2010 Sep 14.

Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.Galaxy：一种支持生命科学领域可访问、可重现和透明计算研究的综合方法。

Genome Biol. 2010;11(8):R86. doi: 10.1186/gb-2010-11-8-r86. Epub 2010 Aug 25.

Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing.生物信息学方法在基因组学和下一代测序的后基因组学应用。

Brief Bioinform. 2010 Mar;11(2):181-97. doi: 10.1093/bib/bbp046. Epub 2009 Oct 27.

Next-generation gap.下一代差距。

Nat Methods. 2009 Nov;6(11 Suppl):S2-5. doi: 10.1038/nmeth.f.268.

Estimating the proportion of true null hypotheses for multiple comparisons.估计多重比较中真零假设的比例。

Cancer Inform. 2008;6:25-32. Epub 2008 Feb 14.

H3K27me3 forms BLOCs over silent genes and intergenic regions and specifies a histone banding pattern on a mouse autosomal chromosome.H3K27me3在沉默基因和基因间区域形成BLOCs，并在小鼠常染色体上确定一种组蛋白带型模式。

Genome Res. 2009 Feb;19(2):221-33. doi: 10.1101/gr.080861.108. Epub 2008 Dec 1.

Next-generation DNA sequencing.下一代DNA测序

Nat Biotechnol. 2008 Oct;26(10):1135-45. doi: 10.1038/nbt1486.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

序贯蒙特卡罗多重检验。

Sequential Monte Carlo multiple testing.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

CONTACT

动机

结果

可用性

联系方式

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献