• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深山老林。

Deep forest.

作者信息

Zhou Zhi-Hua, Feng Ji

机构信息

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China.

出版信息

Natl Sci Rev. 2019 Jan;6(1):74-86. doi: 10.1093/nsr/nwy108. Epub 2018 Oct 8.

DOI:10.1093/nsr/nwy108
PMID:34691833
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8291612/
Abstract

Current deep-learning models are mostly built upon neural networks, i.e. multiple layers of parameterized differentiable non-linear modules that can be trained by backpropagation. In this paper, we explore the possibility of building deep models based on non-differentiable modules such as decision trees. After a discussion about the mystery behind deep neural networks, particularly by contrasting them with shallow neural networks and traditional machine-learning techniques such as decision trees and boosting machines, we conjecture that the success of deep neural networks owes much to three characteristics, i.e. layer-by-layer processing, in-model feature transformation and sufficient model complexity. On one hand, our conjecture may offer inspiration for theoretical understanding of deep learning; on the other hand, to verify the conjecture, we propose an approach that generates deep forest holding these characteristics. This is a decision-tree ensemble approach, with fewer hyper-parameters than deep neural networks, and its model complexity can be automatically determined in a data-dependent way. Experiments show that its performance is quite robust to hyper-parameter settings, such that in most cases, even across different data from different domains, it is able to achieve excellent performance by using the same default setting. This study opens the door to deep learning based on non-differentiable modules without gradient-based adjustment, and exhibits the possibility of constructing deep models without backpropagation.

摘要

当前的深度学习模型大多基于神经网络构建,即由多层可参数化的可微非线性模块组成,这些模块可通过反向传播进行训练。在本文中,我们探索基于诸如决策树等不可微模块构建深度模型的可能性。在讨论了深度神经网络背后的奥秘之后,特别是通过将它们与浅层神经网络以及诸如决策树和提升机器等传统机器学习技术进行对比,我们推测深度神经网络的成功很大程度上归功于三个特征,即逐层处理、模型内特征变换和足够的模型复杂性。一方面,我们的推测可能为深度学习的理论理解提供启示;另一方面,为了验证这一推测,我们提出了一种生成具有这些特征的深度森林的方法。这是一种决策树集成方法,其超参数比深度神经网络少,并且其模型复杂性可以以数据依赖的方式自动确定。实验表明,其性能对超参数设置相当稳健,以至于在大多数情况下,即使是来自不同领域的不同数据,通过使用相同的默认设置也能够实现优异的性能。这项研究为基于不可微模块且无需基于梯度调整的深度学习打开了大门,并展示了无需反向传播构建深度模型的可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/111def4817c8/nwy108fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/a2a1cb88b35b/nwy108fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/e71171b866a0/nwy108fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/6e0fb30980f4/nwy108fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/dea8e3399ad8/nwy108fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/9c3f1c26a498/nwy108fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/df095779e936/nwy108fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/28aff62604f0/nwy108fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/111def4817c8/nwy108fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/a2a1cb88b35b/nwy108fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/e71171b866a0/nwy108fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/6e0fb30980f4/nwy108fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/dea8e3399ad8/nwy108fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/9c3f1c26a498/nwy108fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/df095779e936/nwy108fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/28aff62604f0/nwy108fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2ee/8291612/111def4817c8/nwy108fig8.jpg

相似文献

1
Deep forest.深山老林。
Natl Sci Rev. 2019 Jan;6(1):74-86. doi: 10.1093/nsr/nwy108. Epub 2018 Oct 8.
2
Sparse Coding Driven Deep Decision Tree Ensembles for Nucleus Segmentation in Digital Pathology Images.稀疏编码驱动的深度决策树集成算法在数字病理图像中的细胞核分割。
IEEE Trans Image Process. 2021;30:8088-8101. doi: 10.1109/TIP.2021.3112057. Epub 2021 Sep 27.
3
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
4
DANTE: Deep alternations for training neural networks.DANTE:深度神经网络训练的交替法。
Neural Netw. 2020 Nov;131:127-143. doi: 10.1016/j.neunet.2020.07.026. Epub 2020 Jul 28.
5
Deep-learning: investigating deep neural networks hyper-parameters and comparison of performance to shallow methods for modeling bioactivity data.深度学习:研究深度神经网络超参数以及将其性能与用于生物活性数据建模的浅层方法进行比较。
J Cheminform. 2017 Jun 28;9(1):42. doi: 10.1186/s13321-017-0226-y.
6
BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data.BCDForest:基于基因表达数据的癌症亚型分类的提升级联深森林模型。
BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):118. doi: 10.1186/s12859-018-2095-4.
7
Biologically plausible deep learning - But how far can we go with shallow networks?生物学上合理的深度学习——但我们可以在浅层网络中走多远?
Neural Netw. 2019 Oct;118:90-101. doi: 10.1016/j.neunet.2019.06.001. Epub 2019 Jun 20.
8
A Deep Neural Network-Based Model for Quantitative Evaluation of the Effects of Swimming Training.基于深度神经网络的游泳训练效果定量评估模型
Comput Intell Neurosci. 2022 Sep 30;2022:5508365. doi: 10.1155/2022/5508365. eCollection 2022.
9
Applying Deep Neural Networks and Ensemble Machine Learning Methods to Forecast Airborne Pollen.应用深度神经网络和集成机器学习方法预测气传花粉。
Int J Environ Res Public Health. 2019 Jun 4;16(11):1992. doi: 10.3390/ijerph16111992.
10
Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text.深度卷积森林:一种用于文本中垃圾邮件检测的动态深度集成方法。
Complex Intell Systems. 2022;8(6):4897-4909. doi: 10.1007/s40747-022-00741-6. Epub 2022 Apr 26.

引用本文的文献

1
Enhancing noninvasive pancreatic cystic neoplasm diagnosis with multimodal machine learning.利用多模态机器学习增强非侵入性胰腺囊性肿瘤诊断
Sci Rep. 2025 May 12;15(1):16398. doi: 10.1038/s41598-025-01502-4.
2
Patent value prediction in biomedical textiles: A method based on a fusion of machine learning models.生物医学纺织品中的专利价值预测:一种基于机器学习模型融合的方法。
PLoS One. 2025 Apr 24;20(4):e0322182. doi: 10.1371/journal.pone.0322182. eCollection 2025.
3
Deep cross entropy fusion for pulmonary nodule classification based on ultrasound Imagery.

本文引用的文献

1
Active Learning by Querying Informative and Representative Examples.主动学习通过查询信息丰富且具有代表性的示例。
IEEE Trans Pattern Anal Mach Intell. 2014 Oct;36(10):1936-49. doi: 10.1109/TPAMI.2014.2307881.
2
Improving EMG based classification of basic hand movements using EMD.使用经验模态分解(EMD)改进基于肌电图(EMG)的基本手部动作分类
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:5754-7. doi: 10.1109/EMBC.2013.6610858.
3
A fast learning algorithm for deep belief nets.一种用于深度信念网络的快速学习算法。
基于超声图像的肺部结节分类深度交叉熵融合
Front Oncol. 2025 Apr 4;15:1514779. doi: 10.3389/fonc.2025.1514779. eCollection 2025.
4
DeepMethyGene: a deep-learning model to predict gene expression using DNA methylations.深度甲基化基因:一种利用DNA甲基化预测基因表达的深度学习模型。
BMC Bioinformatics. 2025 Apr 8;26(1):99. doi: 10.1186/s12859-025-06115-2.
5
Utilizing machine learning algorithms for predicting Anxiety-Depression Comorbidity Syndrome in Gastroenterology Inpatients (ADCS-GI).利用机器学习算法预测胃肠病住院患者的焦虑-抑郁共病综合征(ADCS-GI)。
BMC Psychiatry. 2025 Mar 18;25(1):253. doi: 10.1186/s12888-025-06666-x.
6
An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction.一种使用改进差分进化优化算法的优化深度森林算法:宿主-病原体蛋白质-蛋白质相互作用预测实例
Comput Struct Biotechnol J. 2025 Jan 26;27:595-611. doi: 10.1016/j.csbj.2025.01.020. eCollection 2025.
7
Estimation of leaf area index by combining multi-source remote sensing data and machine learning optimization model.结合多源遥感数据与机器学习优化模型估算叶面积指数
Front Plant Sci. 2025 Jan 15;15:1505414. doi: 10.3389/fpls.2024.1505414. eCollection 2024.
8
Substantially underestimated global health risks of current ozone pollution.当前臭氧污染对全球健康的风险被严重低估。
Nat Commun. 2025 Jan 2;16(1):102. doi: 10.1038/s41467-024-55450-0.
9
abnization platform: A monoclonal antibody heterologization server based on rational design and artificial intelligence-assisted computation.异常化平台:一种基于理性设计和人工智能辅助计算的单克隆抗体异源化服务器。
Comput Struct Biotechnol J. 2024 Aug 22;23:3222-3231. doi: 10.1016/j.csbj.2024.08.013. eCollection 2024 Dec.
10
Full-coverage estimation of CO concentrations in China via multisource satellite data and Deep Forest model.基于多源卫星数据和深度森林模型的中国一氧化碳浓度全范围估计
Sci Data. 2024 Nov 14;11(1):1231. doi: 10.1038/s41597-024-04063-9.
Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.