• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

注入噪声的神经网络在小样本表达数据的应用中显示出前景。

Noise-injected neural networks show promise for use on small-sample expression data.

作者信息

Hua Jianping, Lowey James, Xiong Zixiang, Dougherty Edward R

机构信息

Computational Biology Division, Translational Genomics Research Institute, Phoenix, USA.

出版信息

BMC Bioinformatics. 2006 May 31;7:274. doi: 10.1186/1471-2105-7-274.

DOI:10.1186/1471-2105-7-274
PMID:16737545
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1524820/
Abstract

BACKGROUND

Overfitting the data is a salient issue for classifier design in small-sample settings. This is why selecting a classifier from a constrained family of classifiers, ones that do not possess the potential to too finely partition the feature space, is typically preferable. But overfitting is not merely a consequence of the classifier family; it is highly dependent on the classification rule used to design a classifier from the sample data. Thus, it is possible to consider families that are rather complex but for which there are classification rules that perform well for small samples. Such classification rules can be advantageous because they facilitate satisfactory classification when the class-conditional distributions are not easily separated and the sample is not large. Here we consider neural networks, from the perspectives of classical design based solely on the sample data and from noise-injection-based design.

RESULTS

This paper provides an extensive simulation-based comparative study of noise-injected neural-network design. It considers a number of different feature-label models across various small sample sizes using varying amounts of noise injection. Besides comparing noise-injected neural-network design to classical neural-network design, the paper compares it to a number of other classification rules. Our particular interest is with the use of microarray data for expression-based classification for diagnosis and prognosis. To that end, we consider noise-injected neural-network design as it relates to a study of survivability of breast cancer patients.

CONCLUSION

The conclusion is that in many instances noise-injected neural network design is superior to the other tested methods, and in almost all cases it does not perform substantially worse than the best of the other methods. Since the amount of noise injected is consequential, the effect of differing amounts of injected noise must be considered.

摘要

背景

在小样本情况下,数据过拟合是分类器设计中的一个突出问题。这就是为什么通常更倾向于从受约束的分类器族中选择分类器,这些分类器没有过度精细划分特征空间的可能性。但过拟合不仅仅是分类器族的结果;它高度依赖于用于从样本数据设计分类器的分类规则。因此,有可能考虑一些相当复杂的分类器族,但存在适用于小样本的分类规则。这样的分类规则可能是有利的,因为当类条件分布不容易分离且样本量不大时,它们有助于实现令人满意的分类。在这里,我们从仅基于样本数据的经典设计以及基于噪声注入的设计这两个角度来考虑神经网络。

结果

本文提供了一项基于广泛模拟的噪声注入神经网络设计的比较研究。它考虑了各种小样本量下的多种不同特征 - 标签模型,并使用了不同量的噪声注入。除了将噪声注入神经网络设计与经典神经网络设计进行比较外,本文还将其与其他一些分类规则进行了比较。我们特别感兴趣的是使用微阵列数据进行基于表达的诊断和预后分类。为此,我们考虑噪声注入神经网络设计与乳腺癌患者生存能力研究的关系。

结论

结论是,在许多情况下,噪声注入神经网络设计优于其他测试方法,并且在几乎所有情况下,其表现都不会比其他最佳方法差太多。由于注入的噪声量很重要,所以必须考虑不同注入噪声量的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/6ba900137bb2/1471-2105-7-274-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/a5a55a44b5fa/1471-2105-7-274-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/852a22bc736f/1471-2105-7-274-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/dca219a24a98/1471-2105-7-274-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/4c1c0b594f0f/1471-2105-7-274-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/ee646ed1392b/1471-2105-7-274-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/53364750d584/1471-2105-7-274-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/344b12e7a6a6/1471-2105-7-274-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/09ca4b5f6f66/1471-2105-7-274-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/6ba900137bb2/1471-2105-7-274-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/a5a55a44b5fa/1471-2105-7-274-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/852a22bc736f/1471-2105-7-274-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/dca219a24a98/1471-2105-7-274-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/4c1c0b594f0f/1471-2105-7-274-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/ee646ed1392b/1471-2105-7-274-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/53364750d584/1471-2105-7-274-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/344b12e7a6a6/1471-2105-7-274-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/09ca4b5f6f66/1471-2105-7-274-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a0d/1524820/6ba900137bb2/1471-2105-7-274-9.jpg

相似文献

1
Noise-injected neural networks show promise for use on small-sample expression data.注入噪声的神经网络在小样本表达数据的应用中显示出前景。
BMC Bioinformatics. 2006 May 31;7:274. doi: 10.1186/1471-2105-7-274.
2
Optimal number of features as a function of sample size for various classification rules.针对各种分类规则,作为样本大小函数的最优特征数量。
Bioinformatics. 2005 Apr 15;21(8):1509-15. doi: 10.1093/bioinformatics/bti171. Epub 2004 Nov 30.
3
Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.用于计算机辅助诊断的分类器设计:有限样本量对经典分类器和神经网络分类器平均性能的影响。
Med Phys. 1999 Dec;26(12):2654-68. doi: 10.1118/1.598805.
4
On the statistical assessment of classifiers using DNA microarray data.关于使用DNA微阵列数据对分类器进行统计评估
BMC Bioinformatics. 2006 Aug 19;7:387. doi: 10.1186/1471-2105-7-387.
5
Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data.基于规则的机器学习在候选疾病基因优先级和癌症基因表达数据样本分类中的应用。
PLoS One. 2012;7(7):e39932. doi: 10.1371/journal.pone.0039932. Epub 2012 Jul 11.
6
Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类
BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.
7
Breast cancer tumor type recognition using graph feature selection technique and radial basis function neural network with optimal structure.基于图特征选择技术和具有最优结构的径向基函数神经网络的乳腺癌肿瘤类型识别
J Cancer Res Ther. 2018 Apr-Jun;14(3):625-633. doi: 10.4103/0973-1482.183561.
8
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。
BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.
9
Is cross-validation better than resubstitution for ranking genes?在对基因进行排名时,交叉验证是否比重替代法更好?
Bioinformatics. 2004 Jan 22;20(2):253-8. doi: 10.1093/bioinformatics/btg399.
10
Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction.微阵列数据分类的系统基准测试:评估非线性和降维的作用。
Bioinformatics. 2004 Nov 22;20(17):3185-95. doi: 10.1093/bioinformatics/bth383. Epub 2004 Jul 1.

引用本文的文献

1
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.利用临床数据,通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。
J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.
2
A Machine Learning Application to Predict Early Lung Involvement in Scleroderma: A Feasibility Evaluation.一种用于预测硬皮病早期肺部受累的机器学习应用:可行性评估。
Diagnostics (Basel). 2021 Oct 12;11(10):1880. doi: 10.3390/diagnostics11101880.
3
A novel artificial neural network method for biomedical prediction based on matrix pseudo-inversion.

本文引用的文献

1
Small sample issues for microarray-based classification.基于微阵列分类的小样本问题。
Comp Funct Genomics. 2001;2(1):28-34. doi: 10.1002/cfg.62.
2
Using additive noise in back-propagation training.在反向传播训练中使用加性噪声。
IEEE Trans Neural Netw. 1992;3(1):24-38. doi: 10.1109/72.105415.
3
Similarities of error regularization, sigmoid gain scaling, target smoothing, and training with jitter.误差正则化、Sigmoid增益缩放、目标平滑以及带抖动训练的相似性。
基于矩阵伪逆的生物医学预测新型人工神经网络方法。
J Biomed Inform. 2014 Apr;48:114-21. doi: 10.1016/j.jbi.2013.12.009. Epub 2013 Dec 18.
4
Comparison of the levels of accuracy of an artificial neural network model and a logistic regression model for the diagnosis of acute appendicitis.人工神经网络模型与逻辑回归模型在急性阑尾炎诊断中的准确性水平比较。
J Med Syst. 2007 Oct;31(5):357-64. doi: 10.1007/s10916-007-9077-9.
IEEE Trans Neural Netw. 1995;6(3):529-38. doi: 10.1109/72.377960.
4
Anisotropic noise injection for input variables relevance determination.用于输入变量相关性确定的各向异性噪声注入。
IEEE Trans Neural Netw. 2000;11(6):1201-12. doi: 10.1109/72.883393.
5
k-nearest neighbors directed noise injection in multilayer perceptron training.
IEEE Trans Neural Netw. 2000;11(2):504-11. doi: 10.1109/72.839019.
6
Optimal number of features as a function of sample size for various classification rules.针对各种分类规则,作为样本大小函数的最优特征数量。
Bioinformatics. 2005 Apr 15;21(8):1509-15. doi: 10.1093/bioinformatics/bti171. Epub 2004 Nov 30.
7
Is cross-validation valid for small-sample microarray classification?交叉验证对小样本微阵列分类是否有效?
Bioinformatics. 2004 Feb 12;20(3):374-80. doi: 10.1093/bioinformatics/btg419.
8
A gene-expression signature as a predictor of survival in breast cancer.一种作为乳腺癌生存预测指标的基因表达特征。
N Engl J Med. 2002 Dec 19;347(25):1999-2009. doi: 10.1056/NEJMoa021967.
9
Strong feature sets from small samples.来自小样本的强大特征集。
J Comput Biol. 2002;9(1):127-46. doi: 10.1089/10665270252833226.
10
Gene expression profiling predicts clinical outcome of breast cancer.基因表达谱分析可预测乳腺癌的临床预后。
Nature. 2002 Jan 31;415(6871):530-6. doi: 10.1038/415530a.