Suppr超能文献

多元免疫荧光直方图之间的概率分箱与一致性检验:扩展卡方检验

Probability binning and testing agreement between multivariate immunofluorescence histograms: extending the chi-squared test.

作者信息

Baggerly K A

机构信息

Department of Biostatistics, M. D. Anderson Cancer Center, Houston, Texas 77030-4009, USA.

出版信息

Cytometry. 2001 Oct 1;45(2):141-50. doi: 10.1002/1097-0320(20011001)45:2<141::aid-cyto1156>3.0.co;2-m.

Abstract

BACKGROUND

A key problem in immunohistochemistry is assessing when two sample histograms are significantly different. One test that is commonly used for this purpose in the univariate case is the chi-squared test. Comparing multivariate distributions is qualitatively harder, as the "curse of dimensionality" means that the number of bins can grow exponentially. For the chi-squared test to be useful, data-dependent binning methods must be employed. An example of how this can be done is provided by the "probability binning" method of Roederer et al. (1,2,3).

METHODS

We derive the theoretical distribution of the probability binning statistic, giving it a more rigorous foundation. We show that the null distribution is a scaled chi-square, and show how it can be related to the standard chi-squared statistic.

RESULTS

A small simulation shows how the theoretical results can be used to (a) modify the probability binning statistic to make it more sensitive and (b) suggest variant statistics which, while still exploiting the data-dependent strengths of the probability binning procedure, may be easier to work with.

CONCLUSIONS

The probability binning procedure effectively uses adaptive binning to locate structure in high-dimensional data. The derivation of a theoretical basis provides a more detailed interpretation of its behavior and renders the probability binning method more flexible.

摘要

背景

免疫组织化学中的一个关键问题是评估两个样本直方图何时存在显著差异。在单变量情况下,常用于此目的的一种检验是卡方检验。比较多变量分布在定性上更难,因为“维度诅咒”意味着箱数会呈指数增长。为使卡方检验有用,必须采用依赖数据的分箱方法。Roederer等人(1,2,3)的“概率分箱”方法提供了一个如何做到这一点的示例。

方法

我们推导了概率分箱统计量的理论分布,为其提供了更严格的基础。我们表明零分布是一个缩放后的卡方分布,并展示了它如何与标准卡方统计量相关。

结果

一个小型模拟展示了理论结果如何用于(a)修改概率分箱统计量以使其更敏感,以及(b)提出变体统计量,这些统计量虽然仍利用概率分箱过程中依赖数据的优势,但可能更易于使用。

结论

概率分箱过程有效地利用自适应分箱来定位高维数据中的结构。理论基础的推导为其行为提供了更详细的解释,并使概率分箱方法更灵活。

相似文献

1
Probability binning and testing agreement between multivariate immunofluorescence histograms: extending the chi-squared test.
Cytometry. 2001 Oct 1;45(2):141-50. doi: 10.1002/1097-0320(20011001)45:2<141::aid-cyto1156>3.0.co;2-m.
2
Probability binning comparison: a metric for quantitating multivariate distribution differences.
Cytometry. 2001 Sep 1;45(1):47-55. doi: 10.1002/1097-0320(20010901)45:1<47::aid-cyto1143>3.0.co;2-a.
3
Probability binning comparison: a metric for quantitating univariate distribution differences.
Cytometry. 2001 Sep 1;45(1):37-46. doi: 10.1002/1097-0320(20010901)45:1<37::aid-cyto1142>3.0.co;2-e.
5
[Statistical analysis of pharmacological data: use of cumulative chi-squared statistic].
Nihon Yakurigaku Zasshi. 1997 Dec;110(6):341-6. doi: 10.1254/fpj.110.341.
6
Quantile-function based null distribution in resampling based multiple testing.
Stat Appl Genet Mol Biol. 2006;5:Article14. doi: 10.2202/1544-6115.1199. Epub 2006 May 21.
7
Categorical independence tests for large sparse r-way contingency tables.
Percept Mot Skills. 2002 Oct;95(2):606-10. doi: 10.2466/pms.2002.95.2.606.
8
Chi-squared and Fisher-Irwin tests of two-by-two tables with small sample recommendations.
Stat Med. 2007 Aug 30;26(19):3661-75. doi: 10.1002/sim.2832.
9
Frequency difference gating: a multivariate method for identifying subsets that differ between samples.
Cytometry. 2001 Sep 1;45(1):56-64. doi: 10.1002/1097-0320(20010901)45:1<56::aid-cyto1144>3.0.co;2-9.
10
A multivariate two-sample mean test for small sample size and missing data.
Biometrics. 2006 Sep;62(3):877-85. doi: 10.1111/j.1541-0420.2006.00533.x.

引用本文的文献

1
Earth Mover's Distance (EMD): A True Metric for Comparing Biomarker Expression Levels in Cell Populations.
PLoS One. 2016 Mar 23;11(3):e0151859. doi: 10.1371/journal.pone.0151859. eCollection 2016.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验