• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于环境数据集中概率分布的拟合优度评估和聚类的图形界面。

Graphical interface for goodness-of-fit evaluation and clustering of probability distributions in environmental datasets.

作者信息

Reba Felix, Saifudin Toha, Hendradi Rimuljo

机构信息

Doctoral Program of Mathematics and Natural Sciences, Faculty of Sciences and Technology, Universitas Airlangga, Surabaya, Indonesia.

Mathematics Department, Faculty of Sciences and Technology, Universitas Airlangga, Surabaya, Indonesia.

出版信息

MethodsX. 2025 Aug 27;15:103586. doi: 10.1016/j.mex.2025.103586. eCollection 2025 Dec.

DOI:10.1016/j.mex.2025.103586
PMID:40949830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12423415/
Abstract

Goodness-of-Fit (GoF) tests are applied to assess the suitability of probability distributions for environmental data. However, classical methods such as Kolmogorov-Smirnov (KS) and Anderson-Darling (AD) often yield inconsistent outcomes in heterogeneous datasets. Previous studies employed clustering or mixture modeling separately, lacking integration with automated estimation and adaptive weighting. This study introduces a unified framework combining GoF evaluation, K-Means++ clustering, and a KS-weighted mixture model to enhance distribution selection. Seventeen univariate probability distributions were tested on chlorophyll concentration data from the Black Sea, with adequacy assessed via KS and AD tests and five information criteria. The framework was implemented via a MATLAB GUI to automate clustering, estimation, model selection, and evaluation steps. Tested across multiple sample sizes and extended to variables, the GUI demonstrated adaptability and robustness. Model performance showed that the KS-weighted mixture model provided stable fits for complex datasets, improving interpretability and reducing reliance on single-distribution assumptions. Integrates GoF testing, clustering, and mixture modeling Implements a reproducible workflow via MATLAB GUI Enhances robustness and positions mixture modeling within environmental data analysis.

摘要

拟合优度(GoF)检验用于评估概率分布对环境数据的适用性。然而,诸如柯尔莫哥洛夫-斯米尔诺夫(KS)和安德森- Darling(AD)等经典方法在异质数据集中往往会产生不一致的结果。以往的研究分别采用聚类或混合建模,缺乏与自动估计和自适应加权的整合。本研究引入了一个统一的框架,将GoF评估、K-Means++聚类和KS加权混合模型相结合,以增强分布选择。对来自黑海的叶绿素浓度数据测试了17种单变量概率分布,并通过KS和AD检验以及五个信息准则评估其充分性。该框架通过MATLAB GUI实现,以自动化聚类、估计、模型选择和评估步骤。在多个样本量上进行测试并扩展到变量,该GUI展示了适应性和稳健性。模型性能表明,KS加权混合模型为复杂数据集提供了稳定的拟合,提高了可解释性并减少了对单分布假设的依赖。集成了GoF测试、聚类和混合建模 通过MATLAB GUI实现了可重复的工作流程 增强了稳健性,并将混合建模定位在环境数据分析中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/d38229314dd1/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/879679249915/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/21d8517d6efb/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/946d5a2b7907/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/91ef419118ff/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/c846efe76698/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/11e66f8b91e2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/0daa9ca69909/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/d38229314dd1/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/879679249915/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/21d8517d6efb/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/946d5a2b7907/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/91ef419118ff/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/c846efe76698/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/11e66f8b91e2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/0daa9ca69909/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0481/12423415/d38229314dd1/gr7.jpg

相似文献

1
Graphical interface for goodness-of-fit evaluation and clustering of probability distributions in environmental datasets.用于环境数据集中概率分布的拟合优度评估和聚类的图形界面。
MethodsX. 2025 Aug 27;15:103586. doi: 10.1016/j.mex.2025.103586. eCollection 2025 Dec.
2
ASAS-NANP symposium: mathematical modeling in animal nutrition: synthetic database generation for non-normal multivariate distributions: a rank-based method with application to ruminant methane emissions.美国动物科学学会-北美猪营养大会研讨会:动物营养中的数学建模:非正态多元分布的综合数据库生成:一种基于秩的方法及其在反刍动物甲烷排放中的应用
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf136.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
6
Participation in environmental enhancement and conservation activities for health and well-being in adults: a review of quantitative and qualitative evidence.成年人参与促进环境改善和保护活动对健康与福祉的影响:定量和定性证据综述
Cochrane Database Syst Rev. 2016 May 21;2016(5):CD010351. doi: 10.1002/14651858.CD010351.pub2.
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.
9
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
10
Audit and feedback: effects on professional practice.审核与反馈:对专业实践的影响
Cochrane Database Syst Rev. 2025 Mar 25;3(3):CD000259. doi: 10.1002/14651858.CD000259.pub4.

本文引用的文献

1
Data-driven determination of zooplankton bioregions and robustness analysis.基于数据驱动的浮游动物生物区域确定及稳健性分析。
MethodsX. 2024 Apr 2;12:102676. doi: 10.1016/j.mex.2024.102676. eCollection 2024 Jun.
2
DetEdit: A graphical user interface for annotating and editing events detected in long-term acoustic monitoring data.DetEdit:一个用于注释和编辑长期声学监测数据中检测到的事件的图形用户界面。
PLoS Comput Biol. 2020 Jan 13;16(1):e1007598. doi: 10.1371/journal.pcbi.1007598. eCollection 2020 Jan.
3
Resurrecting the ecological underpinnings of ocean plankton blooms.
复苏海洋浮游生物爆发的生态基础。
Ann Rev Mar Sci. 2014;6:167-94. doi: 10.1146/annurev-marine-052913-021325. Epub 2013 Sep 25.