• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对 2x2 列联表进行因式分解。

Factoring a 2 x 2 contingency table.

机构信息

Science, Technology and Research Institute of Delaware, Wilmington, DE, United States of America.

出版信息

PLoS One. 2019 Oct 25;14(10):e0224460. doi: 10.1371/journal.pone.0224460. eCollection 2019.

DOI:10.1371/journal.pone.0224460
PMID:31652283
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6814214/
Abstract

We show that a two-component proportional representation provides the necessary framework to account for the properties of a 2 × 2 contingency table. This corresponds to the factorization of the table as a product of proportion and diagonal row or column sum matrices. The row and column sum invariant measures for proportional variation are obtained. Geometrically, these correspond to displacements of two point vectors in the standard one-simplex, which are reduced to a center-of-mass coordinate representation, [Formula: see text]. Then, effect size measures, such as the odds ratio and relative risk, correspond to different perspective functions for the mapping of (δ, μ) to [Formula: see text]. Furthermore, variations in δ and μ will be associated with different cost-benefit trade-offs for a given application. Therefore, pure mathematics alone does not provide the specification of a general form for the perspective function. This implies that the question of the merits of the odds ratio versus relative risk cannot be resolved in a general way. Expressions are obtained for the marginal sum dependence and the relations between various effect size measures, including the simple matching coefficient, odds ratio, relative risk, Yule's Q, ϕ, and Goodman and Kruskal's τc|r. We also show that Gini information gain (IGG) is equivalent to ϕ2 in the classification and regression tree (CART) algorithm. Then, IGG can yield misleading results due to the dependence on marginal sums. Monte Carlo methods facilitate the detailed specification of stochastic effects in the data acquisition process and provide a practical way to estimate the confidence interval for an effect size.

摘要

我们表明,两分量比例代表制为解释 2×2 列联表的性质提供了必要的框架。这对应于将表分解为比例和对角行或列和矩阵的乘积。获得了比例变化的行和列和不变量测度。从几何上看,这些对应于标准一单形中两个点向量的位移,这些位移简化为质心坐标表示,[公式:见正文]。然后,效果大小度量,如优势比和相对风险,对应于(δ,μ)到[公式:见正文]映射的不同透视函数。此外,对于给定的应用,δ和μ的变化将与不同的成本效益权衡相关。因此,纯粹的数学本身并不能为透视函数的一般形式提供规范。这意味着,优势比与相对风险的优点不能以一般方式解决。得到了边际和依赖关系以及各种效果大小度量之间的关系,包括简单匹配系数、优势比、相对风险、Yule 的 Q、ϕ 和 Goodman 和 Kruskal 的 τc|r。我们还表明,基尼信息增益(IGG)在分类和回归树(CART)算法中等同于ϕ2。然后,由于对边际和的依赖,IGG 可能会产生误导性结果。蒙特卡罗方法有助于在数据采集过程中详细说明随机效应,并为效果大小的置信区间提供实用的估计方法。

相似文献

1
Factoring a 2 x 2 contingency table.对 2x2 列联表进行因式分解。
PLoS One. 2019 Oct 25;14(10):e0224460. doi: 10.1371/journal.pone.0224460. eCollection 2019.
2
Nonoverlap proportion and the representation of point-biserial variation.非重叠比例与点双列变异的表示。
PLoS One. 2020 Dec 28;15(12):e0244517. doi: 10.1371/journal.pone.0244517. eCollection 2020.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Goodman and Kruskal's Gamma Coefficient for Ordinalized Bivariate Normal Distributions. Goodman 和 Kruskal 的有序双变量正态分布 Gamma 系数。
Psychometrika. 2020 Dec;85(4):905-925. doi: 10.1007/s11336-020-09730-5. Epub 2020 Oct 27.
5
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
6
Simple methods of determining confidence intervals for functions of estimates in published results.确定已发表结果中估计量函数置信区间的简单方法。
PLoS One. 2014 May 28;9(5):e98498. doi: 10.1371/journal.pone.0098498. eCollection 2014.
7
A practical approach for comparing means of two groups without equal variance assumption.一种在不假设方差相等的情况下比较两组均值的实用方法。
Stat Med. 2002 Oct 30;21(20):3137-51. doi: 10.1002/sim.1238.
8
Covariate-adjusted confidence interval for the intraclass correlation coefficient.调整协变量后的组内相关系数置信区间。
Contemp Clin Trials. 2013 Sep;36(1):244-53. doi: 10.1016/j.cct.2013.07.003. Epub 2013 Jul 16.
9
How accurate is the Pearson r-from-Z approximation? A Monte Carlo simulation study.皮尔逊r值的Z近似法有多准确?一项蒙特卡洛模拟研究。
J Gen Psychol. 2012 Apr-Jun;139(2):68-77. doi: 10.1080/00221309.2012.661376.
10
Applications of Monte Carlo Simulation in Modelling of Biochemical Processes蒙特卡罗模拟在生化过程建模中的应用

引用本文的文献

1
A parametric framework for multidimensional linear measurement error regression.多维线性测量误差回归的参数框架。
PLoS One. 2022 Jan 21;17(1):e0262148. doi: 10.1371/journal.pone.0262148. eCollection 2022.
2
Nonoverlap proportion and the representation of point-biserial variation.非重叠比例与点双列变异的表示。
PLoS One. 2020 Dec 28;15(12):e0244517. doi: 10.1371/journal.pone.0244517. eCollection 2020.

本文引用的文献

1
Five ways to fix statistics.修正统计数据的五种方法。
Nature. 2017 Nov 30;551(7682):557-559. doi: 10.1038/d41586-017-07522-z.
2
Big data need big theory too.大数据也需要大理论。
Philos Trans A Math Phys Eng Sci. 2016 Nov 13;374(2080). doi: 10.1098/rsta.2016.0153.
3
Confidence intervals for correlations when data are not normal.数据非正态时相关性的置信区间。
Behav Res Methods. 2017 Feb;49(1):294-309. doi: 10.3758/s13428-016-0702-8.
4
Comments on Fifty Years of Classification and Regression Trees.关于分类与回归树五十年的评论
Int Stat Rev. 2014 Dec 1;82(3):359-361. doi: 10.1111/insr.12060.
5
Exploring factors associated with pressure ulcers: a data mining approach.探索与压疮相关的因素:一种数据挖掘方法。
Int J Nurs Stud. 2015 Jan;52(1):102-11. doi: 10.1016/j.ijnurstu.2014.08.002. Epub 2014 Aug 18.
6
Conditional asymmetric linkage disequilibrium (ALD): extending the biallelic r2 measure.条件非对称连锁不平衡(ALD):扩展双等位基因r2测量方法
Genetics. 2014 Sep;198(1):321-31. doi: 10.1534/genetics.114.165266. Epub 2014 Jul 14.
7
Converting an odds ratio to a range of plausible relative risks for better communication of research findings.将比值比转换为一系列合理的相对风险,以便更好地传达研究结果。
BMJ. 2014 Jan 24;348:f7450. doi: 10.1136/bmj.f7450.
8
Points of significance: error bars.重要点:误差线。
Nat Methods. 2013 Oct;10(10):921-2. doi: 10.1038/nmeth.2659.
9
Effect sizes for 2×2 contingency tables.2×2 列联表的效应量。
PLoS One. 2013;8(3):e58777. doi: 10.1371/journal.pone.0058777. Epub 2013 Mar 7.
10
The use of classification trees for bioinformatics.分类树在生物信息学中的应用。
Wiley Interdiscip Rev Data Min Knowl Discov. 2011 Jan;1(1):55-63. doi: 10.1002/widm.14. Epub 2011 Jan 6.