• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

检验对树结构的依赖。

Testing for dependence on tree structures.

机构信息

Department of Statistics, University of California, Berkeley, CA 94720.

Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom.

出版信息

Proc Natl Acad Sci U S A. 2020 May 5;117(18):9787-9792. doi: 10.1073/pnas.1912957117. Epub 2020 Apr 22.

DOI:10.1073/pnas.1912957117
PMID:32321827
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7211961/
Abstract

Tree structures, showing hierarchical relationships and the latent structures between samples, are ubiquitous in genomic and biomedical sciences. A common question in many studies is whether there is an association between a response variable measured on each sample and the latent group structure represented by some given tree. Currently, this is addressed on an ad hoc basis, usually requiring the user to decide on an appropriate number of clusters to prune out of the tree to be tested against the response variable. Here, we present a statistical method with statistical guarantees that tests for association between the response variable and a fixed tree structure across all levels of the tree hierarchy with high power while accounting for the overall false positive error rate. This enhances the robustness and reproducibility of such findings.

摘要

树状结构普遍存在于基因组学和生物医学科学中,能够显示出样本之间的层次关系和潜在结构。在许多研究中,一个常见的问题是,每个样本上测量的响应变量与给定树表示的潜在分组结构之间是否存在关联。目前,这是在特定的基础上解决的,通常需要用户决定从要测试的树中修剪出适当数量的聚类,以与响应变量进行比较。在这里,我们提出了一种具有统计保证的统计方法,可以在树层次结构的所有级别上对响应变量和固定树结构之间的关联进行测试,同时保持高功效,同时考虑到总体假阳性错误率。这增强了此类发现的稳健性和可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23bf/7211961/46b21204197c/pnas.1912957117fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23bf/7211961/456350679ddb/pnas.1912957117fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23bf/7211961/46b21204197c/pnas.1912957117fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23bf/7211961/456350679ddb/pnas.1912957117fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23bf/7211961/46b21204197c/pnas.1912957117fig02.jpg

相似文献

1
Testing for dependence on tree structures.检验对树结构的依赖。
Proc Natl Acad Sci U S A. 2020 May 5;117(18):9787-9792. doi: 10.1073/pnas.1912957117. Epub 2020 Apr 22.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
4
Hypotheses on a tree: new error rates and testing strategies.树上的假设:新的错误率和检验策略。
Biometrika. 2021 Sep;108(3):575-590. doi: 10.1093/biomet/asaa086. Epub 2020 Oct 14.
5
Semi-supervised adaptive-height snipping of the hierarchical clustering tree.层次聚类树的半监督自适应高度剪枝
BMC Bioinformatics. 2015 Jan 16;16(1):15. doi: 10.1186/s12859-014-0448-1.
6
A two-stage microbial association mapping framework with advanced FDR control.一种具有先进 FDR 控制的两阶段微生物关联映射框架。
Microbiome. 2018 Jul 25;6(1):131. doi: 10.1186/s40168-018-0517-1.
7
Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives.随机对照试验中的亚组分析:量化假阳性和假阴性风险
Health Technol Assess. 2001;5(33):1-56. doi: 10.3310/hta5330.
8
Tree testing of hierarchical menu structures for health applications.针对健康应用的分层菜单结构进行树形测试。
J Biomed Inform. 2014 Jun;49:198-205. doi: 10.1016/j.jbi.2014.02.011. Epub 2014 Feb 26.
9
ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree.ProtoNet 6.0:在紧凑的层次家族树中组织 1000 万蛋白质序列。
Nucleic Acids Res. 2012 Jan;40(Database issue):D313-20. doi: 10.1093/nar/gkr1027. Epub 2011 Nov 25.
10
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.

引用本文的文献

1
Kernel machine tests of association using extrinsic and intrinsic cluster evaluation metrics.基于外在和内在聚类评估指标的核机器关联检验。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012524. doi: 10.1371/journal.pcbi.1012524. eCollection 2024 Nov.
2
TreeKernel: interpretable kernel machine tests for interactions between -omics and clinical predictors with applications to metabolomics and COPD phenotypes.TreeKernel:用于解释 -omics 和临床预测因子之间相互作用的可解释核机器测试,应用于代谢组学和 COPD 表型。
BMC Bioinformatics. 2023 Oct 25;24(1):398. doi: 10.1186/s12859-023-05459-x.
3
AN OMNIBUS TEST FOR DETECTION OF SUBGROUP TREATMENT EFFECTS VIA DATA PARTITIONING.

本文引用的文献

1
Multidomain analyses of a longitudinal human microbiome intestinal cleanout perturbation experiment.一项纵向人体微生物组肠道清理扰动实验的多领域分析
PLoS Comput Biol. 2017 Aug 18;13(8):e1005706. doi: 10.1371/journal.pcbi.1005706. eCollection 2017 Aug.
2
Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus.全基因组分析凸显了人类固有免疫和适应性免疫系统对丙型肝炎病毒的影响。
Nat Genet. 2017 May;49(5):666-673. doi: 10.1038/ng.3835. Epub 2017 Apr 10.
3
Identifying lineage effects when controlling for population structure improves power in bacterial association studies.
一种通过数据划分检测亚组治疗效果的综合检验。
Ann Appl Stat. 2022 Dec;16(4):2266-2278. doi: 10.1214/21-AOAS1589. Epub 2022 Sep 26.
4
Deconvoluting complex correlates of COVID-19 severity with a multi-omic pandemic tracking strategy.利用多组学大流行追踪策略解析 COVID-19 严重程度的复杂关联。
Nat Commun. 2022 Aug 30;13(1):5107. doi: 10.1038/s41467-022-32397-8.
5
Statistical Challenges in Tracking the Evolution of SARS-CoV-2.追踪新冠病毒进化过程中的统计挑战
Stat Sci. 2022 May;37(2):162-182. doi: 10.1214/22-sts853. Epub 2022 May 16.
6
A scalable analytical approach from bacterial genomes to epidemiology.从细菌基因组到流行病学的可扩展分析方法。
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210246. doi: 10.1098/rstb.2021.0246. Epub 2022 Aug 22.
7
Association testing for binary trees-A Markov branching process approach.基于马尔可夫分支过程的二叉树关联检验方法
Stat Med. 2022 Jun 30;41(14):2557-2573. doi: 10.1002/sim.9370. Epub 2022 Mar 9.
8
Inferring the Allelic Series at QTL in Multiparental Populations.在多亲本群体中推断 QTL 的等位基因系列。
Genetics. 2020 Dec;216(4):957-983. doi: 10.1534/genetics.120.303393. Epub 2020 Oct 20.
9
A cautionary note on the use of unsupervised machine learning algorithms to characterise malaria parasite population structure from genetic distance matrices.关于使用无监督机器学习算法从遗传距离矩阵来描述疟原虫种群结构的警示说明。
PLoS Genet. 2020 Oct 9;16(10):e1009037. doi: 10.1371/journal.pgen.1009037. eCollection 2020 Oct.
在控制群体结构时识别谱系效应可提高细菌关联研究的效能。
Nat Microbiol. 2016 Apr 4;1:16041. doi: 10.1038/nmicrobiol.2016.41.
4
Bayesian Inference of the Evolution of a Phenotype Distribution on a Phylogenetic Tree.系统发育树上表型分布演化的贝叶斯推断
Genetics. 2016 Sep;204(1):89-98. doi: 10.1534/genetics.116.190496. Epub 2016 Jul 13.
5
Stepwise Signal Extraction via Marginal Likelihood.通过边际似然进行逐步信号提取
J Am Stat Assoc. 2016;111(513):314-330. doi: 10.1080/01621459.2015.1006365. Epub 2015 Feb 6.
6
HIV epidemiology. The early spread and epidemic ignition of HIV-1 in human populations.艾滋病毒流行病学。HIV-1 在人类群体中的早期传播和流行引发。
Science. 2014 Oct 3;346(6205):56-61. doi: 10.1126/science.1256739. Epub 2014 Oct 2.
7
A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data.一种修正的贝叶斯信息准则及其在比较基因组杂交数据分析中的应用。
Biometrics. 2007 Mar;63(1):22-32. doi: 10.1111/j.1541-0420.2006.00662.x.
8
MicroRNA expression profiles classify human cancers.微小RNA表达谱可对人类癌症进行分类。
Nature. 2005 Jun 9;435(7043):834-8. doi: 10.1038/nature03702.
9
Diversity of the human intestinal microbial flora.人类肠道微生物群的多样性。
Science. 2005 Jun 10;308(5728):1635-8. doi: 10.1126/science.1110591. Epub 2005 Apr 14.
10
Language-tree divergence times support the Anatolian theory of Indo-European origin.语言树的分化时间支持印欧语系起源的安纳托利亚理论。
Nature. 2003 Nov 27;426(6965):435-9. doi: 10.1038/nature02029.