• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最优稀疏回归树

Optimal Sparse Regression Trees.

作者信息

Zhang Rui, Xin Rui, Seltzer Margo, Rudin Cynthia

机构信息

Duke University.

University of British Columbia.

出版信息

Proc AAAI Conf Artif Intell. 2023 Jun;37(9):11270-11279. doi: 10.1609/aaai.v37i9.26334.

DOI:10.1609/aaai.v37i9.26334
PMID:38650922
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11034802/
Abstract

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees, there has been little effort towards full provable optimization, mainly due to the computational hardness of the problem. This work proposes a dynamic-programming-with-bounds approach to the construction of provably-optimal sparse regression trees. We leverage a novel lower bound based on an optimal solution to the k-Means clustering algorithm on one dimensional data. We are often able to find optimal sparse trees in seconds, even for challenging datasets that involve large numbers of samples and highly-correlated features.

摘要

回归树是最古老的人工智能模型形式之一,其预测无需计算器即可完成,这使其具有广泛的用途,特别是在高风险应用中。在关于回归树的大量文献中,几乎没有人致力于完全可证明的优化,主要是因为该问题的计算难度较大。这项工作提出了一种带边界的动态规划方法来构建可证明最优的稀疏回归树。我们利用了一种基于一维数据的k均值聚类算法的最优解的新颖下界。即使对于涉及大量样本和高度相关特征的具有挑战性的数据集,我们通常也能在几秒钟内找到最优的稀疏树。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/999f5b3715fb/nihms-1982232-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/ff86f5d72480/nihms-1982232-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/0ebe75e96054/nihms-1982232-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/fdaf419a835a/nihms-1982232-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/3586240636a4/nihms-1982232-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/e20611917fa7/nihms-1982232-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/c04cae541ec4/nihms-1982232-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/999f5b3715fb/nihms-1982232-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/ff86f5d72480/nihms-1982232-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/0ebe75e96054/nihms-1982232-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/fdaf419a835a/nihms-1982232-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/3586240636a4/nihms-1982232-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/e20611917fa7/nihms-1982232-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/c04cae541ec4/nihms-1982232-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e43e/11034802/999f5b3715fb/nihms-1982232-f0007.jpg

相似文献

1
Optimal Sparse Regression Trees.最优稀疏回归树
Proc AAAI Conf Artif Intell. 2023 Jun;37(9):11270-11279. doi: 10.1609/aaai.v37i9.26334.
2
Optimal Sparse Survival Trees.最优稀疏生存树
Proc Mach Learn Res. 2024 May;238:352-360.
3
Fast Sparse Decision Tree Optimization via Reference Ensembles.通过参考集成实现快速稀疏决策树优化
Proc AAAI Conf Artif Intell. 2022;36(9):9604-9613. doi: 10.1609/aaai.v36i9.21194. Epub 2022 Jun 28.
4
Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design.用于优化治疗方案和最优策略设计的加权稀疏决策树的快速优化
CEUR Workshop Proc. 2022 Oct;3318.
5
SIESTA: enhancing searches for optimal supertrees and species trees.SIESTA:增强最优超级树和物种树搜索。
BMC Genomics. 2018 May 8;19(Suppl 5):252. doi: 10.1186/s12864-018-4621-1.
6
BWM*: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design.BWM*:一种用于计算蛋白质设计稀疏逼近的新型、可证明的、基于集成的动态规划算法。
J Comput Biol. 2016 Jun;23(6):413-24. doi: 10.1089/cmb.2015.0194. Epub 2016 Jan 8.
7
Minimization-Aware Recursive A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape.最小化感知递归算法——一种新颖的、可证明的算法,可加速基于集合的蛋白质设计并可证明逼近能量景观。
J Comput Biol. 2020 Apr;27(4):550-564. doi: 10.1089/cmb.2019.0315. Epub 2019 Dec 6.
8
Detecting Meaningful Clusters From High-Dimensional Data: A Strongly Consistent Sparse Center-Based Clustering Approach.从高维数据中检测有意义的聚类:一种基于强一致性稀疏中心的聚类方法。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2894-2908. doi: 10.1109/TPAMI.2020.3047489. Epub 2022 May 5.
9
Provable randomized rounding for minimum-similarity diversification.用于最小相似度多样化的可证明随机舍入法。
Data Min Knowl Discov. 2022;36(2):709-738. doi: 10.1007/s10618-021-00811-2. Epub 2022 Jan 4.
10
A mixed integer linear programming model to reconstruct phylogenies from single nucleotide polymorphism haplotypes under the maximum parsimony criterion.一种在最大简约标准下从单核苷酸多态性单倍型重建系统发育树的混合整数线性规划模型。
Algorithms Mol Biol. 2013 Jan 23;8(1):3. doi: 10.1186/1748-7188-8-3.

引用本文的文献

1
Predicting Affinity Through Homology (PATH): Interpretable binding affinity prediction with persistent homology.通过同源性预测亲和力(PATH):利用持久同源性进行可解释的结合亲和力预测。
PLoS Comput Biol. 2025 Jun 27;21(6):e1013216. doi: 10.1371/journal.pcbi.1013216. eCollection 2025 Jun.
2
Optimal Sparse Survival Trees.最优稀疏生存树
Proc Mach Learn Res. 2024 May;238:352-360.

本文引用的文献

1
Fast Sparse Decision Tree Optimization via Reference Ensembles.通过参考集成实现快速稀疏决策树优化
Proc AAAI Conf Artif Intell. 2022;36(9):9604-9613. doi: 10.1609/aaai.v36i9.21194. Epub 2022 Jun 28.
2
Efficient weighted univariate clustering maps outstanding dysregulated genomic zones in human cancers.高效加权单变量聚类图谱描绘出人类癌症中显著失调的基因组区域。
Bioinformatics. 2020 Dec 22;36(20):5027-5036. doi: 10.1093/bioinformatics/btaa613.