• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

非重叠比例与点双列变异的表示。

Nonoverlap proportion and the representation of point-biserial variation.

机构信息

Vector Analytics, LLC, Wilmington, DE, United States of America.

出版信息

PLoS One. 2020 Dec 28;15(12):e0244517. doi: 10.1371/journal.pone.0244517. eCollection 2020.

DOI:10.1371/journal.pone.0244517
PMID:33370394
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7769483/
Abstract

We consider the problem of constructing a complete set of parameters that account for all of the degrees of freedom for point-biserial variation. We devise an algorithm where sort as an intrinsic property of both numbers and labels, is used to generate the parameters. Algebraically, point-biserial variation is represented by a Cartesian product of statistical parameters for two sets of [Formula: see text] data, and the difference between mean values (δ) corresponds to the representation of variation in the center of mass coordinates, (δ, μ). The existence of alternative effect size measures is explained by the fact that mathematical considerations alone do not specify a preferred coordinate system for the representation of point-biserial variation. We develop a novel algorithm for estimating the nonoverlap proportion (ρpb) of two sets of [Formula: see text] data. ρpb is obtained by sorting the labeled [Formula: see text] data and analyzing the induced order in the categorical data using a diagonally symmetric 2 × 2 contingency table. We examine the correspondence between ρpb and point-biserial correlation (rpb) for uniform and normal distributions. We identify the [Formula: see text], [Formula: see text], and [Formula: see text] representations for Pearson product-moment correlation, Cohen's d, and rpb. We compare the performance of rpb versus ρpb and the sample size proportion corrected correlation (rpbd), confirm that invariance with respect to the sample size proportion is important in the formulation of the effect size, and give an example where three parameters (rpbd, μ, ρpb) are needed to distinguish different forms of point-biserial variation in CART regression tree analysis. We discuss the importance of providing an assessment of cost-benefit trade-offs between relevant system parameters because 'substantive significance' is specified by mapping functional or engineering requirements into the effect size coordinates. Distributions and confidence intervals for the statistical parameters are obtained using Monte Carlo methods.

摘要

我们考虑构建一个完整的参数集,以解释点双列变异的所有自由度。我们设计了一种算法,其中排序作为数字和标签的固有属性,用于生成参数。从代数角度看,点双列变异由两组[公式:见文本]数据的统计参数的笛卡尔积表示,均值差(δ)对应于质心坐标(δ,μ)的变化表示。替代效应量测的存在可以解释为数学考虑本身并不能为点双列变异的表示指定一个首选的坐标系。我们开发了一种新的算法来估计两组[公式:见文本]数据的非重叠比例(ρpb)。ρpb 通过对标记的[公式:见文本]数据进行排序,并使用对角对称的 2×2 列联表分析分类数据中的诱导顺序来获得。我们检查了 ρpb 与点双列相关系数(rpb)在均匀分布和正态分布下的对应关系。我们确定了皮尔逊积差相关系数(rpb)、科恩氏 d 和 rpb 的[公式:见文本]、[公式:见文本]和[公式:见文本]表示。我们比较了 rpb 与 ρpb 以及样本大小比例校正相关系数(rpbd)的性能,确认了样本大小比例不变性在效应量的制定中很重要,并给出了一个例子,其中在 CART 回归树分析中需要三个参数(rpbd、μ、ρpb)来区分不同形式的点双列变异。我们讨论了提供对相关系统参数的成本效益权衡评估的重要性,因为“实质性意义”是通过将功能或工程要求映射到效应量坐标来指定的。使用蒙特卡罗方法获得统计参数的分布和置信区间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/f71326928b97/pone.0244517.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/7d2e2ab9e85c/pone.0244517.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/07ee737db013/pone.0244517.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/ba3de5783973/pone.0244517.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/c646f8814ff2/pone.0244517.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/2d415f17bc2a/pone.0244517.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/28fa61f763e0/pone.0244517.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/f71326928b97/pone.0244517.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/7d2e2ab9e85c/pone.0244517.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/07ee737db013/pone.0244517.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/ba3de5783973/pone.0244517.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/c646f8814ff2/pone.0244517.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/2d415f17bc2a/pone.0244517.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/28fa61f763e0/pone.0244517.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9757/7769483/f71326928b97/pone.0244517.g007.jpg

相似文献

1
Nonoverlap proportion and the representation of point-biserial variation.非重叠比例与点双列变异的表示。
PLoS One. 2020 Dec 28;15(12):e0244517. doi: 10.1371/journal.pone.0244517. eCollection 2020.
2
Factoring a 2 x 2 contingency table.对 2x2 列联表进行因式分解。
PLoS One. 2019 Oct 25;14(10):e0224460. doi: 10.1371/journal.pone.0224460. eCollection 2019.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Point-biserial correlation: Interval estimation, hypothesis testing, meta-analysis, and sample size determination.点二列相关:区间估计、假设检验、元分析和样本量确定。
Br J Math Stat Psychol. 2020 Nov;73 Suppl 1:113-144. doi: 10.1111/bmsp.12189. Epub 2019 Sep 30.
5
A short note on the maximal point-biserial correlation under non-normality.关于非正态下最大点二列相关的简短说明。
Br J Math Stat Psychol. 2016 Nov;69(3):344-351. doi: 10.1111/bmsp.12075.
6
A novel particle filtering method for estimation of pulse pressure variation during spontaneous breathing.一种用于估计自主呼吸期间脉压变异的新型粒子滤波方法。
Biomed Eng Online. 2016 Aug 11;15(1):94. doi: 10.1186/s12938-016-0214-x.
7
Validation of a virtual source model of medical linac for Monte Carlo dose calculation using multi-threaded Geant4.利用多线程 Geant4 对医用直线加速器虚拟源模型进行蒙特卡罗剂量计算的验证
Phys Med Biol. 2018 Apr 13;63(8):085008. doi: 10.1088/1361-6560/aab7a1.
8
Inferring the Number of Attributes for the Exploratory DINA Model.推断探索性 DINA 模型的属性数量。
Psychometrika. 2021 Mar;86(1):30-64. doi: 10.1007/s11336-021-09750-9. Epub 2021 Mar 22.
9
Investigating energy deposition within cell populations using Monte Carlo simulations.利用蒙特卡罗模拟研究细胞群体中的能量沉积。
Phys Med Biol. 2018 Aug 1;63(15):155018. doi: 10.1088/1361-6560/aacf7b.
10
Phylogenetic Flexibility via Hall-Type Inequalities and Submodularity.基于 Hall 型不等式与次模性的系统发育灵活性。
Bull Math Biol. 2019 Feb;81(2):598-617. doi: 10.1007/s11538-018-0419-1. Epub 2018 Mar 27.

引用本文的文献

1
Correlation and causation for cardiothoracic surgeons: part 4-distinguishing relationships in data.心胸外科医生的相关性与因果关系:第4部分——区分数据中的关系
Indian J Thorac Cardiovasc Surg. 2025 Mar;41(3):371-380. doi: 10.1007/s12055-024-01889-1. Epub 2025 Feb 8.
2
A parametric framework for multidimensional linear measurement error regression.多维线性测量误差回归的参数框架。
PLoS One. 2022 Jan 21;17(1):e0262148. doi: 10.1371/journal.pone.0262148. eCollection 2022.

本文引用的文献

1
Factoring a 2 x 2 contingency table.对 2x2 列联表进行因式分解。
PLoS One. 2019 Oct 25;14(10):e0224460. doi: 10.1371/journal.pone.0224460. eCollection 2019.
2
Measuring Distribution Similarities Between Samples: A Distribution-Free Overlapping Index.测量样本间的分布相似性:一种无分布重叠指数。
Front Psychol. 2019 May 21;10:1089. doi: 10.3389/fpsyg.2019.01089. eCollection 2019.
3
The Meaningfulness of Effect Sizes in Psychological Research: Differences Between Sub-Disciplines and the Impact of Potential Biases.
效应量在心理学研究中的意义:子学科之间的差异及潜在偏差的影响。
Front Psychol. 2019 Apr 11;10:813. doi: 10.3389/fpsyg.2019.00813. eCollection 2019.
4
Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality.减少由于非正态性导致的相关系数偏差和误差。
Educ Psychol Meas. 2015 Oct;75(5):785-804. doi: 10.1177/0013164414557639. Epub 2014 Nov 11.
5
A short note on the maximal point-biserial correlation under non-normality.关于非正态下最大点二列相关的简短说明。
Br J Math Stat Psychol. 2016 Nov;69(3):344-351. doi: 10.1111/bmsp.12075.
6
On effect size.关于效应量。
Psychol Methods. 2012 Jun;17(2):137-52. doi: 10.1037/a0028086. Epub 2012 Apr 30.
7
Effect size estimates: current use, calculations, and interpretation.效应量估计:当前使用、计算和解释。
J Exp Psychol Gen. 2012 Feb;141(1):2-18. doi: 10.1037/a0024338. Epub 2011 Aug 8.
8
Effect size, confidence interval and statistical significance: a practical guide for biologists.效应量、置信区间与统计显著性:生物学家实用指南
Biol Rev Camb Philos Soc. 2007 Nov;82(4):591-605. doi: 10.1111/j.1469-185X.2007.00027.x.
9
When effect sizes disagree: the case of r and d.当效应量不一致时:r与d的情况
Psychol Methods. 2006 Dec;11(4):386-401. doi: 10.1037/1082-989X.11.4.386.
10
The effect of misclassification on the estimation of association: a review.错误分类对关联估计的影响:综述
Int J Methods Psychiatr Res. 2005;14(2):92-101. doi: 10.1002/mpr.20.