• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于层次数据二元分类的专家混合网络扩展

Extension of mixture-of-experts networks for binary classification of hierarchical data.

作者信息

Ng Shu-Kay, McLachlan Geoffrey J

机构信息

Department of Mathematics, University of Queensland, Brisbane, Qld 4072, Australia.

出版信息

Artif Intell Med. 2007 Sep;41(1):57-67. doi: 10.1016/j.artmed.2007.06.001. Epub 2007 Jul 16.

DOI:10.1016/j.artmed.2007.06.001
PMID:17629686
Abstract

OBJECTIVE

For many applied problems in the context of medically relevant artificial intelligence, the data collected exhibit a hierarchical or clustered structure. Ignoring the interdependence between hierarchical data can result in misleading classification. In this paper, we extend the mechanism for mixture-of-experts (ME) networks for binary classification of hierarchical data. Another extension is to quantify cluster-specific information on data hierarchy by random effects via the generalized linear mixed-effects model (GLMM).

METHODS AND MATERIAL

The extension of ME networks is implemented by allowing for correlation in the hierarchical data in both the gating and expert networks via the GLMM. The proposed model is illustrated using a real thyroid disease data set. In our study, we consider 7652 thyroid diagnosis records from 1984 to early 1987 with complete information on 20 attribute values. We obtain 10 independent random splits of the data into a training set and a test set in the proportions 85% and 15%. The test sets are used to assess the generalization performance of the proposed model, based on the percentage of misclassifications. For comparison, the results obtained from the ME network with independence assumption are also included.

RESULTS

With the thyroid disease data, the misclassification rate on test sets for the extended ME network is 8.9%, compared to 13.9% for the ME network. In addition, based on model selection methods described in Section 2, a network with two experts is selected. These two expert networks can be considered as modeling two groups of patients with high and low incidence rates. Significant variation among the predicted cluster-specific random effects is detected in the patient group with low incidence rate.

CONCLUSIONS

It is shown that the extended ME network outperforms the ME network for binary classification of hierarchical data. With the thyroid disease data, useful information on the relative log odds of patients with diagnosed conditions at different periods can be evaluated. This information can be taken into consideration for the assessment of treatment planning of the disease. The proposed extended ME network thus facilitates a more general approach to incorporate data hierarchy mechanism in network modeling.

摘要

目的

对于医学相关人工智能背景下的许多应用问题,所收集的数据呈现出层次或聚类结构。忽略层次数据之间的相互依赖可能导致误导性的分类。在本文中,我们扩展了用于层次数据二元分类的专家混合(ME)网络机制。另一个扩展是通过广义线性混合效应模型(GLMM)利用随机效应量化数据层次上特定聚类的信息。

方法与材料

通过在门控网络和专家网络中允许层次数据的相关性,利用GLMM实现ME网络的扩展。使用真实的甲状腺疾病数据集对所提出的模型进行了说明。在我们的研究中,我们考虑了1984年至1987年初的7652条甲状腺诊断记录,这些记录包含20个属性值的完整信息。我们将数据以85%和15%的比例独立随机划分为训练集和测试集。基于错误分类的百分比,测试集用于评估所提出模型的泛化性能。为了进行比较,还包括了在独立假设下从ME网络获得的结果。

结果

对于甲状腺疾病数据,扩展后的ME网络在测试集上的错误分类率为8.9%,而ME网络为13.9%。此外,根据第2节中描述的模型选择方法,选择了一个具有两个专家的网络。这两个专家网络可以被视为对发病率高和低的两组患者进行建模。在发病率低的患者组中检测到预测的特定聚类随机效应之间存在显著差异。

结论

结果表明,扩展后的ME网络在层次数据的二元分类方面优于ME网络。对于甲状腺疾病数据,可以评估不同时期确诊患者相对对数优势的有用信息。该信息可用于评估疾病的治疗计划。因此,所提出的扩展ME网络促进了一种更通用的方法,将数据层次机制纳入网络建模。

相似文献

1
Extension of mixture-of-experts networks for binary classification of hierarchical data.用于层次数据二元分类的专家混合网络扩展
Artif Intell Med. 2007 Sep;41(1):57-67. doi: 10.1016/j.artmed.2007.06.001. Epub 2007 Jul 16.
2
Mixture of experts classification using a hierarchical mixture model.使用分层混合模型的专家混合分类
Neural Comput. 2002 Sep;14(9):2221-44. doi: 10.1162/089976602320264060.
3
Neural networks for longitudinal studies in Alzheimer's disease.用于阿尔茨海默病纵向研究的神经网络。
Artif Intell Med. 2006 Mar;36(3):245-55. doi: 10.1016/j.artmed.2005.10.007. Epub 2006 Jan 19.
4
Similarity classifier with generalized mean applied to medical data.应用广义均值的相似性分类器用于医学数据。
Comput Biol Med. 2006 Sep;36(9):1026-40. doi: 10.1016/j.compbiomed.2005.05.008. Epub 2005 Sep 12.
5
A decision support system to facilitate management of patients with acute gastrointestinal bleeding.一个有助于急性胃肠道出血患者管理的决策支持系统。
Artif Intell Med. 2008 Mar;42(3):247-59. doi: 10.1016/j.artmed.2007.10.003. Epub 2007 Dec 11.
6
A hierarchical Binomial-Poisson model for the analysis of a crossover design for correlated binary data when the number of trials is dose-dependent.一种用于分析相关二元数据交叉设计的分层二项-泊松模型,其中试验次数与剂量相关。
J Biopharm Stat. 2005;15(2):225-39.
7
An incremental EM-based learning approach for on-line prediction of hospital resource utilization.一种基于增量期望最大化的医院资源利用在线预测学习方法。
Artif Intell Med. 2006 Mar;36(3):257-67. doi: 10.1016/j.artmed.2005.07.003. Epub 2005 Oct 6.
8
Bayesian neural networks for bivariate binary data: an application to prostate cancer study.用于二元二元数据的贝叶斯神经网络:在前列腺癌研究中的应用。
Stat Med. 2005 Dec 15;24(23):3645-62. doi: 10.1002/sim.2214.
9
Adaptive classification of two-dimensional gel electrophoretic spot patterns by neural networks and cluster analysis.基于神经网络和聚类分析的二维凝胶电泳斑点模式自适应分类
Electrophoresis. 1997 Dec;18(15):2749-54. doi: 10.1002/elps.1150181508.
10
Modeling the electrophoretic mobility of analytes in binary solvent electrolyte systems in capillary electrophoresis using an artificial neural network.使用人工神经网络对毛细管电泳中二元溶剂电解质系统中分析物的电泳迁移率进行建模。
Pharmazie. 2005 Sep;60(9):656-60.

引用本文的文献

1
Enhancing Cardiovascular Risk Prediction: Development of an Advanced Xgboost Model with Hospital-Level Random Effects.增强心血管风险预测:基于医院水平随机效应的先进Xgboost模型的开发。
Bioengineering (Basel). 2024 Oct 18;11(10):1039. doi: 10.3390/bioengineering11101039.
2
Risk factors and obstetric complications of large for gestational age births with adjustments for community effects: results from a new cohort study.调整社区效应后,巨大儿出生的危险因素和产科并发症:一项新队列研究的结果。
BMC Public Health. 2010 Aug 6;10:460. doi: 10.1186/1471-2458-10-460.
3
Integrative mixture of experts to combine clinical factors and gene markers.
整合专家的综合意见,结合临床因素和基因标志物。
Bioinformatics. 2010 May 1;26(9):1192-8. doi: 10.1093/bioinformatics/btq107. Epub 2010 Mar 11.