• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

处理组学分析中的混杂因素。

Dealing with Confounders in Omics Analysis.

机构信息

School of Biological Sciences, Nanyang Technological University, 637551, Singapore.

Department of Computer Science, National University of Singapore, 117417, Singapore; Department of Pathology, National University of Singapore, 119074, Singapore.

出版信息

Trends Biotechnol. 2018 May;36(5):488-498. doi: 10.1016/j.tibtech.2018.01.013. Epub 2018 Feb 20.

DOI:10.1016/j.tibtech.2018.01.013
PMID:29475622
Abstract

The Anna Karenina effect is a manifestation of the theory-practice gap that exists when theoretical statistics are applied on real-world data. In the course of analyzing biological data for differential features such as genes or proteins, it derives from the situation where the null hypothesis is rejected for extraneous reasons (or confounders), rather than because the alternative hypothesis is relevant to the disease phenotype. The mechanics of applying statistical tests therefore must address and resolve confounders. It is inadequate to simply rely on manipulating the P-value. We discuss three mechanistic elements (hypothesis statement construction, null distribution appropriateness, and test-statistic construction) and suggest how they can be designed to foil the Anna Karenina effect to select phenotypically relevant biological features.

摘要

安娜·卡列尼娜效应是理论统计应用于实际数据时存在的理论-实践差距的表现。在分析生物学数据的差异特征(如基因或蛋白质)时,它源于由于无关原因(或混杂因素)而拒绝零假设的情况,而不是因为替代假设与疾病表型相关。因此,应用统计检验的机制必须解决和解决混杂因素。仅仅依靠操纵 P 值是不够的。我们讨论了三个机械元素(假设陈述构建、零分布适当性和检验统计构建),并提出了如何设计它们以挫败安娜·卡列尼娜效应以选择表型相关的生物特征。

相似文献

1
Dealing with Confounders in Omics Analysis.处理组学分析中的混杂因素。
Trends Biotechnol. 2018 May;36(5):488-498. doi: 10.1016/j.tibtech.2018.01.013. Epub 2018 Feb 20.
2
SPSS and SAS programs for generalizability theory analyses.用于概化理论分析的SPSS和SAS程序。
Behav Res Methods. 2006 Aug;38(3):542-7. doi: 10.3758/bf03192810.
3
Predictive biomarkers for treatment selection: statistical considerations.用于治疗选择的预测性生物标志物:统计学考量
Biomark Med. 2015;9(11):1121-35. doi: 10.2217/bmm.15.84. Epub 2015 Oct 28.
4
Many-to-one comparison after sample size reestimation for trials with multiple treatment arms and treatment selection.对具有多个治疗组和治疗选择的试验进行样本量重新估计后的多对一比较。
J Biopharm Stat. 2010 Sep;20(5):927-40. doi: 10.1080/10543401003618959.
5
Hypothesis testing for differentially correlated features.差异相关特征的假设检验。
Biostatistics. 2016 Oct;17(4):677-91. doi: 10.1093/biostatistics/kxw013. Epub 2016 Apr 4.
6
Biostatistics Series Module 2: Overview of Hypothesis Testing.生物统计学系列模块2:假设检验概述。
Indian J Dermatol. 2016 Mar-Apr;61(2):137-45. doi: 10.4103/0019-5154.177775.
7
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
8
A maximally selected test of symmetry about zero.关于零点的最大对称性检验。
Stat Med. 2012 Nov 20;31(26):3178-91. doi: 10.1002/sim.5384. Epub 2012 Jun 22.
9
Score tests for exploring complex models: application to HIV dynamics models.用于探索复杂模型的计分检验:在HIV动力学模型中的应用
Biom J. 2010 Feb;52(1):10-21. doi: 10.1002/bimj.200900030.
10
What role should null-hypothesis significance tests have in statistical education and hypothesis falsification?零假设显著性检验在统计教育和假设证伪中应扮演什么角色?
Trends Ecol Evol. 2007 Sep;22(9):445-6; author reply 446. doi: 10.1016/j.tree.2007.06.006. Epub 2007 Jul 20.

引用本文的文献

1
Bioinformatics approaches for studying molecular sex differences in complex diseases.生物信息学方法研究复杂疾病中的分子性别差异。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae499.
2
Cell-free RNA Liquid Biopsy to Monitor Hematopoietic Stem Cell Transplantation.无细胞RNA液体活检用于监测造血干细胞移植
medRxiv. 2024 May 16:2024.05.15.24307448. doi: 10.1101/2024.05.15.24307448.
3
SurvConvMixer: robust and interpretable cancer survival prediction based on ConvMixer using pathway-level gene expression images.SurvConvMixer:基于 ConvMixer 使用通路水平基因表达图像的稳健且可解释的癌症生存预测。
BMC Bioinformatics. 2024 Mar 27;25(1):133. doi: 10.1186/s12859-024-05745-2.
4
Artificial Intelligence in Renal Cell Carcinoma Histopathology: Current Applications and Future Perspectives.人工智能在肾细胞癌组织病理学中的应用:现状与未来展望
Diagnostics (Basel). 2023 Jul 6;13(13):2294. doi: 10.3390/diagnostics13132294.
5
Robust SNP-based prediction of rheumatoid arthritis through machine-learning-optimized polygenic risk score.通过机器学习优化的多基因风险评分实现类风湿关节炎的稳健 SNP 预测。
J Transl Med. 2023 Feb 7;21(1):92. doi: 10.1186/s12967-023-03939-5.
6
Perspectives for better batch effect correction in mass-spectrometry-based proteomics.基于质谱的蛋白质组学中更好的批次效应校正前景
Comput Struct Biotechnol J. 2022 Aug 12;20:4369-4375. doi: 10.1016/j.csbj.2022.08.022. eCollection 2022.
7
Functional coding haplotypes and machine-learning feature elimination identifies predictors of Methotrexate Response in Rheumatoid Arthritis patients.功能性编码单倍型和机器学习特征消除鉴定类风湿关节炎患者甲氨蝶呤反应的预测因子。
EBioMedicine. 2022 Jan;75:103800. doi: 10.1016/j.ebiom.2021.103800. Epub 2022 Jan 10.
8
Mathematical-based microbiome analytics for clinical translation.用于临床转化的基于数学的微生物组分析
Comput Struct Biotechnol J. 2021 Nov 22;19:6272-6281. doi: 10.1016/j.csbj.2021.11.029. eCollection 2021.
9
Extensions of the External Validation for Checking Learned Model Interpretability and Generalizability.用于检查学习模型可解释性和泛化性的外部验证扩展
Patterns (N Y). 2020 Nov 13;1(8):100129. doi: 10.1016/j.patter.2020.100129.
10
Avoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy.避免机器学习中的过度简化:超越类别预测准确率
Patterns (N Y). 2020 May 8;1(2):100025. doi: 10.1016/j.patter.2020.100025.