• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Random Bits Forest: a Strong Classifier/Regressor for Big Data.

作者信息

Wang Yi, Li Yi, Pu Weilin, Wen Kathryn, Shugart Yin Yao, Xiong Momiao, Jin Li

机构信息

Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200433, China.

Unit on Statistical Genomics, Division of Intramural Division Programs, National Institute of Mental Health, National Institutes of Health, Bethesda, MD, USA.

出版信息

Sci Rep. 2016 Jul 22;6:30086. doi: 10.1038/srep30086.

DOI:10.1038/srep30086
PMID:27444562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4957112/
Abstract

Efficiency, memory consumption, and robustness are common problems with many popular methods for data analysis. As a solution, we present Random Bits Forest (RBF), a classification and regression algorithm that integrates neural networks (for depth), boosting (for width), and random forests (for prediction accuracy). Through a gradient boosting scheme, it first generates and selects ~10,000 small, 3-layer random neural networks. These networks are then fed into a modified random forest algorithm to obtain predictions. Testing with datasets from the UCI (University of California, Irvine) Machine Learning Repository shows that RBF outperforms other popular methods in both accuracy and robustness, especially with large datasets (N > 1000). The algorithm also performed highly in testing with an independent data set, a real psoriasis genome-wide association study (GWAS).

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/b70f81e15ed3/srep30086-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/624eacee84ee/srep30086-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/160cb10fc76d/srep30086-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/f8660838f89f/srep30086-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/b70f81e15ed3/srep30086-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/624eacee84ee/srep30086-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/160cb10fc76d/srep30086-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/f8660838f89f/srep30086-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df3c/4957112/b70f81e15ed3/srep30086-f4.jpg

相似文献

1
Random Bits Forest: a Strong Classifier/Regressor for Big Data.
Sci Rep. 2016 Jul 22;6:30086. doi: 10.1038/srep30086.
2
An extensive experimental survey of regression methods.回归方法的广泛实验调查。
Neural Netw. 2019 Mar;111:11-34. doi: 10.1016/j.neunet.2018.12.010. Epub 2018 Dec 21.
3
Performance Evaluation of Data-driven Intelligent Algorithms for Big data Ecosystem.大数据生态系统中数据驱动智能算法的性能评估
Wirel Pers Commun. 2022;126(3):2403-2423. doi: 10.1007/s11277-021-09362-7. Epub 2022 Aug 23.
4
Knowledge mining from clinical datasets using rough sets and backpropagation neural network.使用粗糙集和反向传播神经网络从临床数据集中进行知识挖掘。
Comput Math Methods Med. 2015;2015:460189. doi: 10.1155/2015/460189. Epub 2015 Mar 4.
5
Accurate Segmentation of CT Male Pelvic Organs via Regression-Based Deformable Models and Multi-Task Random Forests.通过基于回归的可变形模型和多任务随机森林实现CT男性盆腔器官的精确分割
IEEE Trans Med Imaging. 2016 Jun;35(6):1532-43. doi: 10.1109/TMI.2016.2519264. Epub 2016 Jan 18.
6
Boosting random subspace method.增强随机子空间法
Neural Netw. 2008 Nov;21(9):1344-62. doi: 10.1016/j.neunet.2007.12.046. Epub 2008 Jan 6.
7
Sensitivity versus accuracy in multiclass problems using memetic Pareto evolutionary neural networks.使用混合帕累托进化神经网络处理多类问题时的灵敏度与准确性
IEEE Trans Neural Netw. 2010 May;21(5):750-70. doi: 10.1109/TNN.2010.2041468. Epub 2010 Mar 11.
8
Predicting runtimes of bioinformatics tools based on historical data: five years of Galaxy usage.基于历史数据预测生物信息学工具的运行时间:Galaxy 使用的五年。
Bioinformatics. 2019 Sep 15;35(18):3453-3460. doi: 10.1093/bioinformatics/btz054.
9
Data classification with radial basis function networks based on a novel kernel density estimation algorithm.基于一种新型核密度估计算法的径向基函数网络数据分类
IEEE Trans Neural Netw. 2005 Jan;16(1):225-36. doi: 10.1109/TNN.2004.836229.
10
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

引用本文的文献

1
Prevalence of malnutrition and associated factors in Chinese children and adolescents aged 3-14 years using machine learning algorithms.使用机器学习算法评估中国3至14岁儿童和青少年的营养不良患病率及相关因素
J Glob Health. 2025 Jul 21;15:04204. doi: 10.7189/jogh.15.04204.
2
RCE-IFE: recursive cluster elimination with intra-cluster feature elimination.RCE-IFE:带簇内特征消除的递归簇消除
PeerJ Comput Sci. 2025 Feb 7;11:e2528. doi: 10.7717/peerj-cs.2528. eCollection 2025.
3
Machine learning predicts the short-term requirement for invasive ventilation among Australian critically ill COVID-19 patients.

本文引用的文献

1
Gradient boosting machines, a tutorial.梯度提升机,教程。
Front Neurorobot. 2013 Dec 4;7:21. doi: 10.3389/fnbot.2013.00021. eCollection 2013.
2
Representation learning: a review and new perspectives.表示学习:综述与新视角。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.
3
Quantitative structure-activity relationship models for ready biodegradability of chemicals.化学品好氧生物降解性的定量构效关系模型。
机器学习预测澳大利亚危重症 COVID-19 患者短期需要有创通气。
PLoS One. 2022 Oct 26;17(10):e0276509. doi: 10.1371/journal.pone.0276509. eCollection 2022.
4
Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology.基于大数据技术的高维小样本数据特征选择与特征稳定性测量方法。
Comput Intell Neurosci. 2021 Sep 23;2021:3597051. doi: 10.1155/2021/3597051. eCollection 2021.
5
A multi-hazard map-based flooding, gully erosion, forest fires, and earthquakes in Iran.伊朗多灾害地图:洪水、沟蚀、森林火灾和地震。
Sci Rep. 2021 Jul 21;11(1):14889. doi: 10.1038/s41598-021-94266-6.
6
A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.人工智能和机器学习在自身免疫性疾病中的应用的系统评价。
NPJ Digit Med. 2020 Mar 9;3:30. doi: 10.1038/s41746-020-0229-3. eCollection 2020.
7
Nuclear Norm Clustering: a promising alternative method for clustering tasks.核范聚类:聚类任务的一种很有前途的替代方法。
Sci Rep. 2018 Jul 18;8(1):10873. doi: 10.1038/s41598-018-29246-4.
J Chem Inf Model. 2013 Apr 22;53(4):867-78. doi: 10.1021/ci4000213. Epub 2013 Mar 27.
4
Basic statistical analysis in genetic case-control studies.遗传病例对照研究中的基本统计学分析。
Nat Protoc. 2011 Feb;6(2):121-33. doi: 10.1038/nprot.2010.182. Epub 2011 Feb 3.
5
Psoriasis prediction from genome-wide SNP profiles.基于全基因组单核苷酸多态性(SNP)图谱预测银屑病
BMC Dermatol. 2011 Jan 7;11:1. doi: 10.1186/1471-5945-11-1.
6
Accurate telemonitoring of Parkinson's disease progression by noninvasive speech tests.通过非侵入性语音测试准确监测帕金森病的进展。
IEEE Trans Biomed Eng. 2010 Apr;57(4):884-93. doi: 10.1109/TBME.2009.2036000. Epub 2009 Nov 20.
7
Sequence and haplotype analysis supports HLA-C as the psoriasis susceptibility 1 gene.序列和单倍型分析支持HLA - C作为银屑病1型易感基因。
Am J Hum Genet. 2006 May;78(5):827-851. doi: 10.1086/503821. Epub 2006 Mar 31.
8
Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes.使用人工神经网络与逻辑回归预测医学结果的优缺点。
J Clin Epidemiol. 1996 Nov;49(11):1225-31. doi: 10.1016/s0895-4356(96)00002-9.
9
Oliguric acute renal failure in malignant hypertension.恶性高血压所致少尿型急性肾衰竭
Am J Med. 1972 Feb;52(2):187-97. doi: 10.1016/0002-9343(72)90068-x.
10
Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.比较两条或多条相关的受试者工作特征曲线下的面积:一种非参数方法。
Biometrics. 1988 Sep;44(3):837-45.