• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

countfitteR:用于评估DNA损伤的计数分布的高效选择

countfitteR: efficient selection of count distributions to assess DNA damage.

作者信息

Chilimoniuk Jarosław, Gosiewska Alicja, Słowik Jadwiga, Weiss Romano, Deckert P Markus, Rödiger Stefan, Burdukiewicz Michał

机构信息

Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland.

Faculty of Natural Sciences, Brandenburg University of Technology Cottbus-Senftenberg, Senftenberg, Germany.

出版信息

Ann Transl Med. 2021 Apr;9(7):528. doi: 10.21037/atm-20-6363.

DOI:10.21037/atm-20-6363
PMID:33987226
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8105836/
Abstract

BACKGROUND

DNA double-strand breaks can be counted as discrete foci by imaging techniques. In personalized medicine and pharmacology, the analysis of counting data is relevant for numerous applications, e.g., for cancer and aging research and the evaluation of drug efficacy. By default, it is assumed to follow the Poisson distribution. This assumption, however, may lead to biased results and faulty conclusions in datasets with excess zero values (zero-inflation), a variance larger than the mean (overdispersion), or both. In such cases, the assumption of a Poisson distribution would skew the estimation of mean and variance, and other models like the negative binomial (NB), zero-inflated Poisson or zero-inflated NB distributions should be employed. The model chosen has an influence on the parameter estimation (mean value and confidence interval). Yet the choice of the suitable distribution model is not trivial.

METHODS

To support, simplify and objectify this process, we have developed the countfitteR software as an R package. We used a Bayesian approach for distribution model selection and the shiny web application framework for interactive data analysis.

RESULTS

We show the application of our software based on examples of DNA double-strand break count data from phenotypic imaging by multiplex fluorescence microscopy. In analyzing numerous datasets of molecular pharmacological markers (phosphorylated histone H2AX and p53 binding protein), countfitteR demonstrated an equal or superior statistical performance compared to the usually employed two-step procedure, with an overall power of up to 98%. In addition, it still gave information in cases with no result at all from the two-step procedure. In our data sample we found that the NB distribution was the most frequent, with the Poisson distribution taking second place.

CONCLUSIONS

countfitteR can perform an automated distribution model selection and thus support the data analysis and lead to objective statistically verifiable estimated values. Originally designed for the analysis of foci in biomedical image data, countfitteR can be used in a variety of areas where non-Poisson distributed counting data is prevalent.

摘要

背景

DNA双链断裂可通过成像技术计为离散的病灶。在个性化医学和药理学中,计数数据分析在众多应用中具有相关性,例如癌症和衰老研究以及药物疗效评估。默认情况下,假定其遵循泊松分布。然而,在具有过多零值(零膨胀)、方差大于均值(过度离散)或两者兼有的数据集中,这一假设可能导致有偏差的结果和错误的结论。在这种情况下,泊松分布的假设会使均值和方差的估计产生偏差,应采用其他模型,如负二项分布(NB)、零膨胀泊松分布或零膨胀NB分布。所选模型会对参数估计(均值和置信区间)产生影响。然而,选择合适的分布模型并非易事。

方法

为了支持、简化并使这一过程客观化,我们开发了countfitteR软件作为一个R包。我们采用贝叶斯方法进行分布模型选择,并使用闪亮的网络应用框架进行交互式数据分析。

结果

我们基于多重荧光显微镜表型成像的DNA双链断裂计数数据示例展示了我们软件的应用。在分析众多分子药理学标志物(磷酸化组蛋白H2AX和p53结合蛋白)的数据集时,countfitteR与通常采用的两步法相比,表现出同等或更优的统计性能,总体效能高达98%。此外,在两步法完全没有结果的情况下,它仍能提供信息。在我们的数据样本中,我们发现NB分布最为常见,泊松分布位居第二。

结论

countfitteR可以执行自动分布模型选择,从而支持数据分析并得出客观的、经统计验证的估计值。countfitteR最初设计用于分析生物医学图像数据中的病灶,可用于非泊松分布计数数据普遍存在的各种领域。

相似文献

1
countfitteR: efficient selection of count distributions to assess DNA damage.countfitteR:用于评估DNA损伤的计数分布的高效选择
Ann Transl Med. 2021 Apr;9(7):528. doi: 10.21037/atm-20-6363.
2
On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.关于零膨胀和过度分散计数响应的参数模型和非参数模型的性能。
Stat Med. 2015 Oct 30;34(24):3235-45. doi: 10.1002/sim.6560. Epub 2015 Jun 15.
3
Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros.用于具有过多零值的过度分散和相关计数数据的边缘化多级障碍模型和零膨胀模型。
Stat Med. 2014 Nov 10;33(25):4402-19. doi: 10.1002/sim.6237. Epub 2014 Jun 23.
4
Evaluation of negative binomial and zero-inflated negative binomial models for the analysis of zero-inflated count data: application to the telemedicine for children with medical complexity trial.零膨胀计数数据的负二项式和零膨胀负二项式模型评估:在医疗复杂性儿童远程医疗试验中的应用。
Trials. 2023 Sep 27;24(1):613. doi: 10.1186/s13063-023-07648-8.
5
Analyzing hospitalization data: potential limitations of Poisson regression.分析住院数据:泊松回归的潜在局限性
Nephrol Dial Transplant. 2015 Aug;30(8):1244-9. doi: 10.1093/ndt/gfv071. Epub 2015 Mar 25.
6
Zero-inflated and hurdle models of count data with extra zeros: examples from an HIV-risk reduction intervention trial.带有额外零值的计数数据的零膨胀和障碍模型:来自 HIV 风险降低干预试验的实例。
Am J Drug Alcohol Abuse. 2011 Sep;37(5):367-75. doi: 10.3109/00952990.2011.597280.
7
Modelling overdispersion and Markovian features in count data.对计数数据中的过离散和马尔可夫特征进行建模。
J Pharmacokinet Pharmacodyn. 2009 Oct;36(5):461-77. doi: 10.1007/s10928-009-9131-y. Epub 2009 Oct 2.
8
Nonlinear mixed-effects modeling of longitudinal count data: Bayesian inference about median counts based on the marginal zero-inflated discrete Weibull distribution.基于边缘零膨胀离散 Weibull 分布的纵向计数数据的非线性混合效应建模:基于边缘零膨胀离散 Weibull 分布的中位数计数的贝叶斯推断。
Stat Med. 2021 Oct 15;40(23):5078-5095. doi: 10.1002/sim.9112. Epub 2021 Jun 21.
9
Selecting the right statistical model for analysis of insect count data by using information theoretic measures.利用信息论方法为昆虫计数数据分析选择合适的统计模型。
Bull Entomol Res. 2006 Oct;96(5):479-88.
10
Analyzing clustered count data with a cluster specific random effect zero-inflated Conway-Maxwell-Poisson distribution.使用具有特定聚类随机效应的零膨胀康威-麦克斯韦-泊松分布分析聚类计数数据。
J Appl Stat. 2018;45(5):799-814. doi: 10.1080/02664763.2017.1312299. Epub 2017 Apr 8.

引用本文的文献

1
Challenges and opportunities in processing NanoString nCounter data.处理NanoString nCounter数据中的挑战与机遇。
Comput Struct Biotechnol J. 2024 Apr 30;23:1951-1958. doi: 10.1016/j.csbj.2024.04.061. eCollection 2024 Dec.
2
Radiation dose estimation with time-since-exposure uncertainty using the [Formula: see text]-H2AX biomarker.利用 [公式:见正文] -H2AX 生物标志物估算暴露时间不确定的辐射剂量。
Sci Rep. 2022 Nov 18;12(1):19877. doi: 10.1038/s41598-022-24331-1.

本文引用的文献

1
DNA damage response signaling pathways and targets for radiotherapy sensitization in cancer.癌症放疗增敏的 DNA 损伤反应信号通路和靶点。
Signal Transduct Target Ther. 2020 May 1;5(1):60. doi: 10.1038/s41392-020-0150-x.
2
Copy Number Amplification of DNA Damage Repair Pathways Potentiates Therapeutic Resistance in Cancer.DNA 损伤修复途径的拷贝数扩增增强了癌症的治疗抵抗性。
Theranostics. 2020 Mar 4;10(9):3939-3951. doi: 10.7150/thno.39341. eCollection 2020.
3
In-depth mining of clinical data: the construction of clinical prediction model with R.
临床数据的深度挖掘:使用R构建临床预测模型。
Ann Transl Med. 2019 Dec;7(23):796. doi: 10.21037/atm.2019.08.63.
4
Pan-Cancer Analysis of Potential Synthetic Lethal Drug Targets Specific to Alterations in DNA Damage Response.针对DNA损伤反应改变的潜在合成致死药物靶点的泛癌分析
Front Oncol. 2019 Oct 25;9:1136. doi: 10.3389/fonc.2019.01136. eCollection 2019.
5
Altering DNA Repair to Improve Radiation Therapy: Specific and Multiple Pathway Targeting.改变DNA修复以改善放射治疗:特异性和多途径靶向
Front Oncol. 2019 Oct 10;9:1009. doi: 10.3389/fonc.2019.01009. eCollection 2019.
6
From Big Data to Precision Medicine.从大数据到精准医学。
Front Med (Lausanne). 2019 Mar 1;6:34. doi: 10.3389/fmed.2019.00034. eCollection 2019.
7
AutoFoci, an automated high-throughput foci detection approach for analyzing low-dose DNA double-strand break repair.自动焦点分析,一种用于分析低剂量 DNA 双链断裂修复的自动化高通量焦点检测方法。
Sci Rep. 2018 Nov 23;8(1):17282. doi: 10.1038/s41598-018-35660-5.
8
Homology-Directed Repair and the Role of BRCA1, BRCA2, and Related Proteins in Genome Integrity and Cancer.同源定向修复以及BRCA1、BRCA2和相关蛋白在基因组完整性和癌症中的作用。
Annu Rev Cancer Biol. 2018 Mar;2:313-336. doi: 10.1146/annurev-cancerbio-030617-050502. Epub 2017 Dec 1.
9
Prostate cancer in the era of "Omic" medicine: recognizing the importance of DNA damage repair pathways.“组学”医学时代的前列腺癌:认识DNA损伤修复途径的重要性。
Ann Transl Med. 2018 May;6(9):161. doi: 10.21037/atm.2018.05.06.
10
The Positive Relationship Between γH2AX and PD-L1 Expression in Lung Squamous Cell Carcinoma.肺鳞状细胞癌中γH2AX与PD-L1表达的正相关关系
In Vivo. 2018 Jan-Feb;32(1):171-177. doi: 10.21873/invivo.11221.