• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DPPred:一个具有简洁判别模式的有效预测框架。

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns.

作者信息

Shang Jingbo, Jiang Meng, Tong Wenzhu, Xiao Jinfeng, Peng Jian, Han Jiawei

机构信息

Department of Computer Science in University of Illinois at Urbana-Champaign, IL, USA.

出版信息

IEEE Trans Knowl Data Eng. 2018 Jul;30(7):1226-1239. doi: 10.1109/TKDE.2017.2757476. Epub 2017 Sep 28.

DOI:10.1109/TKDE.2017.2757476
PMID:30745791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6368191/
Abstract

In the literature, two series of models have been proposed to address prediction problems including classification and regression. Simple models, such as generalized linear models, have ordinary performance but strong interpretability on a set of simple features. The other series, including tree-based models, organize numerical, categorical and high dimensional features into a comprehensive structure with rich interpretable information in the data. In this paper, we propose a novel Discriminative Pattern-based Prediction framework (DPPred) to accomplish the prediction tasks by taking their advantages of both effectiveness and interpretability. Specifically, DPPred adopts the concise discriminative patterns that are on the prefix paths from the root to leaf nodes in the tree-based models. DPPred selects a limited number of the useful discriminative patterns by searching for the most effective pattern combination to fit generalized linear models. Extensive experiments show that in many scenarios, DPPred provides competitive accuracy with the state-of-the-art as well as the valuable interpretability for developers and experts. In particular, taking a clinical application dataset as a case study, our DPPred outperforms the baselines by using only 40 concise discriminative patterns out of a potentially exponentially large set of patterns.

摘要

在文献中,已经提出了两类模型来解决包括分类和回归在内的预测问题。简单模型,如广义线性模型,性能普通,但对一组简单特征具有很强的可解释性。另一类模型,包括基于树的模型,将数值、分类和高维特征组织成一个综合结构,数据中具有丰富的可解释信息。在本文中,我们提出了一种新颖的基于判别模式的预测框架(DPPred),通过利用其有效性和可解释性的优势来完成预测任务。具体而言,DPPred采用基于树的模型中从根节点到叶节点的前缀路径上的简洁判别模式。DPPred通过搜索最有效的模式组合来拟合广义线性模型,从而选择有限数量的有用判别模式。大量实验表明,在许多情况下,DPPred提供了与现有技术相当的准确率,同时为开发者和专家提供了有价值的可解释性。特别是,以一个临床应用数据集为例,我们的DPPred仅使用潜在的指数级大量模式中的40个简洁判别模式就优于基线。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/7e2b845e2658/nihms843146f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/6d7dd7e089b1/nihms843146f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/2171fcc3e72a/nihms843146f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/05290295a601/nihms843146f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/4df7e6ce5b52/nihms843146f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/3da834b6278f/nihms843146f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/7e2b845e2658/nihms843146f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/6d7dd7e089b1/nihms843146f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/2171fcc3e72a/nihms843146f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/05290295a601/nihms843146f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/4df7e6ce5b52/nihms843146f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/3da834b6278f/nihms843146f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f58/6368191/7e2b845e2658/nihms843146f6.jpg

相似文献

1
DPPred: An Effective Prediction Framework with Concise Discriminative Patterns.DPPred:一个具有简洁判别模式的有效预测框架。
IEEE Trans Knowl Data Eng. 2018 Jul;30(7):1226-1239. doi: 10.1109/TKDE.2017.2757476. Epub 2017 Sep 28.
2
DPClass: An Effective but Concise Discriminative Patterns-Based Classification Framework.DPClass:一种有效但简洁的基于判别模式的分类框架。
Proc SIAM Int Conf Data Min. 2016 May;2016:567-575. doi: 10.1137/1.9781611974348.64.
3
Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.使用树套索逻辑回归构建用于儿科医院再入院的可解释预测模型。
Artif Intell Med. 2016 Sep;72:12-21. doi: 10.1016/j.artmed.2016.07.003. Epub 2016 Jul 29.
4
Describe, Spot and Explain: Interpretable Representation Learning for Discriminative Visual Reasoning.描述、发现与解释:用于判别式视觉推理的可解释表示学习。
IEEE Trans Image Process. 2023;32:2481-2492. doi: 10.1109/TIP.2023.3268001. Epub 2023 May 8.
5
Interpretable Decision Sets: A Joint Framework for Description and Prediction.可解释决策集:用于描述与预测的联合框架
KDD. 2016 Aug;2016:1675-1684. doi: 10.1145/2939672.2939874.
6
TNT: An Interpretable Tree-Network-Tree Learning Framework using Knowledge Distillation.TNT:一种使用知识蒸馏的可解释树-网络-树学习框架。
Entropy (Basel). 2020 Oct 24;22(11):1203. doi: 10.3390/e22111203.
7
Identification of clinically relevant features in hypertensive patients using penalized regression: a case study of cardiovascular events.使用惩罚回归识别高血压患者的临床相关特征:心血管事件的案例研究。
Med Biol Eng Comput. 2019 Sep;57(9):2011-2026. doi: 10.1007/s11517-019-02007-9. Epub 2019 Jul 25.
8
Weighted discriminative collaborative competitive representation for robust image classification.加权判别协同竞争表示在鲁棒图像分类中的应用。
Neural Netw. 2020 May;125:104-120. doi: 10.1016/j.neunet.2020.01.020. Epub 2020 Feb 10.
9
Topic Modeling for Interpretable Text Classification From EHRs.用于电子健康记录可解释文本分类的主题建模
Front Big Data. 2022 May 4;5:846930. doi: 10.3389/fdata.2022.846930. eCollection 2022.
10
Imbalanced target prediction with pattern discovery on clinical data repositories.基于临床数据存储库的模式发现进行不平衡目标预测。
BMC Med Inform Decis Mak. 2017 Apr 20;17(1):47. doi: 10.1186/s12911-017-0443-3.

本文引用的文献

1
Mining Discriminative Patterns to Predict Health Status for Cardiopulmonary Patients.挖掘判别模式以预测心肺疾病患者的健康状况。
ACM BCB. 2016 Oct;2016:41-49. doi: 10.1145/2975167.2975171.
2
Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression.众包分析临床试验数据以预测肌萎缩侧索硬化症进展。
Nat Biotechnol. 2015 Jan;33(1):51-7. doi: 10.1038/nbt.3051. Epub 2014 Nov 2.
3
DROP: an SVM domain linker predictor trained with optimal features selected by random forest.DROP:一种使用随机森林选择的最佳特征训练的 SVM 域链接器预测器。
Bioinformatics. 2011 Feb 15;27(4):487-94. doi: 10.1093/bioinformatics/btq700. Epub 2010 Dec 17.
4
Placebo-controlled phase I/II studies of minocycline in amyotrophic lateral sclerosis.米诺环素治疗肌萎缩侧索硬化症的安慰剂对照I/II期研究。
Neurology. 2004 May 25;62(10):1845-7. doi: 10.1212/01.wnl.0000125321.92112.7e.
5
Minocycline inhibits cytochrome c release and delays progression of amyotrophic lateral sclerosis in mice.米诺环素可抑制细胞色素c的释放,并延缓小鼠肌萎缩侧索硬化症的进展。
Nature. 2002 May 2;417(6884):74-8. doi: 10.1038/417074a.
6
The spectrum kernel: a string kernel for SVM protein classification.谱核:一种用于支持向量机蛋白质分类的字符串核。
Pac Symp Biocomput. 2002:564-75.