• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于知识整合生物标志物发现的贝叶斯规则学习的可调结构先验。

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery.

作者信息

Balasubramanian Jeya Balaji, Gopalakrishnan Vanathi

机构信息

Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA 15260, United States.

Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15206, United States.

出版信息

World J Clin Oncol. 2018 Sep 14;9(5):98-109. doi: 10.5306/wjco.v9.i5.98.

DOI:10.5306/wjco.v9.i5.98
PMID:30254965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6153126/
Abstract

AIM

To develop a framework to incorporate background domain knowledge into classification rule learning for knowledge discovery in biomedicine.

METHODS

Bayesian rule learning (BRL) is a rule-based classifier that uses a greedy best-first search over a space of Bayesian belief-networks (BN) to find the optimal BN to explain the input dataset, and then infers classification rules from this BN. BRL uses a Bayesian score to evaluate the quality of BNs. In this paper, we extended the Bayesian score to include informative structure priors, which encodes our prior domain knowledge about the dataset. We call this extension of BRL as BRL. The structure prior has a λ hyperparameter that allows the user to tune the degree of incorporation of the prior knowledge in the model learning process. We studied the effect of λ on model learning using a simulated dataset and a real-world lung cancer prognostic biomarker dataset, by measuring the degree of incorporation of our specified prior knowledge. We also monitored its effect on the model predictive performance. Finally, we compared BRL to other state-of-the-art classifiers commonly used in biomedicine.

RESULTS

We evaluated the degree of incorporation of prior knowledge into BRL, with simulated data by measuring the Graph Edit Distance between the true data-generating model and the model learned by BRL. We specified the true model using informative structure priors. We observed that by increasing the value of λ we were able to increase the influence of the specified structure priors on model learning. A large value of λ of BRL caused it to return the true model. This also led to a gain in predictive performance measured by area under the receiver operator characteristic curve (AUC). We then obtained a publicly available real-world lung cancer prognostic biomarker dataset and specified a known biomarker from literature [the epidermal growth factor receptor () gene]. We again observed that larger values of λ led to an increased incorporation of EGFR into the final BRL model. This relevant background knowledge also led to a gain in AUC.

CONCLUSION

BRL enables tunable structure priors to be incorporated during Bayesian classification rule learning that integrates data and knowledge as demonstrated using lung cancer biomarker data.

摘要

目的

开发一个框架,将背景领域知识纳入生物医学知识发现的分类规则学习中。

方法

贝叶斯规则学习(BRL)是一种基于规则的分类器,它在贝叶斯信念网络(BN)空间上使用贪婪最佳优先搜索来找到最优的BN以解释输入数据集,然后从该BN中推断分类规则。BRL使用贝叶斯分数来评估BN的质量。在本文中,我们扩展了贝叶斯分数以纳入信息结构先验,其对我们关于数据集的先验领域知识进行编码。我们将BRL的这种扩展称为BRL。结构先验有一个λ超参数,允许用户在模型学习过程中调整先验知识的纳入程度。我们通过测量指定先验知识的纳入程度,使用模拟数据集和真实世界的肺癌预后生物标志物数据集研究了λ对模型学习的影响。我们还监测了其对模型预测性能的影响。最后,我们将BRL与生物医学中常用的其他最新分类器进行了比较。

结果

我们通过测量真实数据生成模型与BRL学习的模型之间的图编辑距离,用模拟数据评估了先验知识纳入BRL的程度。我们使用信息结构先验指定了真实模型。我们观察到,通过增加λ的值,我们能够增加指定结构先验对模型学习的影响。BRL的大λ值使其返回真实模型。这也导致在通过接受者操作特征曲线(AUC)下面积测量的预测性能方面有所提高。然后,我们获得了一个公开可用的真实世界肺癌预后生物标志物数据集,并从文献中指定了一个已知的生物标志物[表皮生长因子受体()基因]。我们再次观察到,较大的λ值导致EGFR更多地纳入最终的BRL模型。这种相关的背景知识也导致AUC增加。

结论

如使用肺癌生物标志物数据所证明的,BRL能够在贝叶斯分类规则学习期间纳入可调整的结构先验,从而整合数据和知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/b32b51ef487a/WJCO-9-98-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/167d6511980d/WJCO-9-98-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/485c189a342c/WJCO-9-98-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/ba069ea35a38/WJCO-9-98-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/1f6ed6485fba/WJCO-9-98-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/07ea8ce5fac1/WJCO-9-98-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/70dfc907fdb6/WJCO-9-98-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/ecbd576e3af6/WJCO-9-98-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/d32cb45f5818/WJCO-9-98-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/5e967ac9cb8d/WJCO-9-98-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/6ad2a4286f12/WJCO-9-98-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/b32b51ef487a/WJCO-9-98-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/167d6511980d/WJCO-9-98-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/485c189a342c/WJCO-9-98-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/ba069ea35a38/WJCO-9-98-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/1f6ed6485fba/WJCO-9-98-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/07ea8ce5fac1/WJCO-9-98-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/70dfc907fdb6/WJCO-9-98-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/ecbd576e3af6/WJCO-9-98-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/d32cb45f5818/WJCO-9-98-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/5e967ac9cb8d/WJCO-9-98-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/6ad2a4286f12/WJCO-9-98-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc00/6153126/b32b51ef487a/WJCO-9-98-g007.jpg

相似文献

1
Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery.用于知识整合生物标志物发现的贝叶斯规则学习的可调结构先验。
World J Clin Oncol. 2018 Sep 14;9(5):98-109. doi: 10.5306/wjco.v9.i5.98.
2
Learning Parsimonious Classification Rules from Gene Expression Data Using Bayesian Networks with Local Structure.使用具有局部结构的贝叶斯网络从基因表达数据中学习简洁分类规则。
Data (Basel). 2017 Mar;2(1). doi: 10.3390/data2010005. Epub 2017 Jan 18.
3
Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors.通过最大知识驱动信息先验将生物先验知识纳入贝叶斯学习。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 14):552. doi: 10.1186/s12859-017-1893-4.
4
Incorporating expert knowledge when learning Bayesian network structure: a medical case study.在学习贝叶斯网络结构时纳入专家知识:一个医学案例研究。
Artif Intell Med. 2011 Nov;53(3):181-204. doi: 10.1016/j.artmed.2011.08.004. Epub 2011 Sep 29.
5
Prediction models for clustered data with informative priors for the random effects: a simulation study.具有信息先验的随机效应聚集数据的预测模型:一项模拟研究。
BMC Med Res Methodol. 2018 Aug 6;18(1):83. doi: 10.1186/s12874-018-0543-5.
6
cMRI-BED: A novel informatics framework for cardiac MRI biomarker extraction and discovery applied to pediatric cardiomyopathy classification.心脏磁共振成像生物标志物提取与发现的新型信息学框架——应用于儿童心肌病分类的cMRI-BED
Biomed Eng Online. 2015;14 Suppl 2(Suppl 2):S7. doi: 10.1186/1475-925X-14-S2-S7. Epub 2015 Aug 13.
7
Selective model averaging with bayesian rule learning for predictive biomedicine.用于预测性生物医学的贝叶斯规则学习选择性模型平均法。
AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:17-22. eCollection 2014.
8
Bayesian rule learning for biomedical data mining.贝叶斯规则学习在生物医学数据挖掘中的应用。
Bioinformatics. 2010 Mar 1;26(5):668-75. doi: 10.1093/bioinformatics/btq005. Epub 2010 Jan 14.
9
A novel approach to modeling multifactorial diseases using Ensemble Bayesian Rule classifiers.基于集成贝叶斯规则分类器的多因素疾病建模新方法。
J Biomed Inform. 2020 Jul;107:103455. doi: 10.1016/j.jbi.2020.103455. Epub 2020 Jun 1.
10
Machine learning for improved pathological staging of prostate cancer: a performance comparison on a range of classifiers.机器学习在前列腺癌病理分期中的应用:一系列分类器的性能比较。
Artif Intell Med. 2012 May;55(1):25-35. doi: 10.1016/j.artmed.2011.11.003. Epub 2011 Dec 27.

本文引用的文献

1
Learning Parsimonious Classification Rules from Gene Expression Data Using Bayesian Networks with Local Structure.使用具有局部结构的贝叶斯网络从基因表达数据中学习简洁分类规则。
Data (Basel). 2017 Mar;2(1). doi: 10.3390/data2010005. Epub 2017 Jan 18.
2
Selective model averaging with bayesian rule learning for predictive biomedicine.用于预测性生物医学的贝叶斯规则学习选择性模型平均法。
AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:17-22. eCollection 2014.
3
NCBI GEO: archive for functional genomics data sets--update.NCBI GEO:功能基因组学数据集存档 - 更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D991-5. doi: 10.1093/nar/gks1193. Epub 2012 Nov 27.
4
Epidermal growth factor receptor (EGFR) in lung cancer: an overview and update.表皮生长因子受体(EGFR)在肺癌中的作用:概述与最新进展。
J Thorac Dis. 2010 Mar;2(1):48-51.
5
Application of an efficient Bayesian discretization method to biomedical data.高效贝叶斯离散化方法在生物医学数据中的应用。
BMC Bioinformatics. 2011 Jul 28;12:309. doi: 10.1186/1471-2105-12-309.
6
Identification of a novel biomarker, SEMA5A, for non-small cell lung carcinoma in nonsmoking women.鉴定一种新型生物标志物 SEMA5A 用于非吸烟女性的非小细胞肺癌。
Cancer Epidemiol Biomarkers Prev. 2010 Oct;19(10):2590-7. doi: 10.1158/1055-9965.EPI-10-0332. Epub 2010 Aug 27.
7
Bayesian rule learning for biomedical data mining.贝叶斯规则学习在生物医学数据挖掘中的应用。
Bioinformatics. 2010 Mar 1;26(5):668-75. doi: 10.1093/bioinformatics/btq005. Epub 2010 Jan 14.
8
Improving classification performance with discretization on biomedical datasets.通过对生物医学数据集进行离散化来提高分类性能。
AMIA Annu Symp Proc. 2008 Nov 6;2008:445-9.
9
Network inference using informative priors.使用信息先验进行网络推断。
Proc Natl Acad Sci U S A. 2008 Sep 23;105(38):14313-8. doi: 10.1073/pnas.0802272105. Epub 2008 Sep 17.
10
Clinical and biological features associated with epidermal growth factor receptor gene mutations in lung cancers.肺癌中与表皮生长因子受体基因突变相关的临床和生物学特征。
J Natl Cancer Inst. 2005 Mar 2;97(5):339-46. doi: 10.1093/jnci/dji055.