• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自动编码器作为深度神经网络的权重初始化方法进行疾病检测。

Using autoencoders as a weight initialization method on deep neural networks for disease detection.

机构信息

Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, s/n, Porto, 4200-465, Portugal.

INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal.

出版信息

BMC Med Inform Decis Mak. 2020 Aug 20;20(Suppl 5):141. doi: 10.1186/s12911-020-01150-w.

DOI:10.1186/s12911-020-01150-w
PMID:32819347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7439655/
Abstract

BACKGROUND

As of today, cancer is still one of the most prevalent and high-mortality diseases, summing more than 9 million deaths in 2018. This has motivated researchers to study the application of machine learning-based solutions for cancer detection to accelerate its diagnosis and help its prevention. Among several approaches, one is to automatically classify tumor samples through their gene expression analysis.

METHODS

In this work, we aim to distinguish five different types of cancer through RNA-Seq datasets: thyroid, skin, stomach, breast, and lung. To do so, we have adopted a previously described methodology, with which we compare the performance of 3 different autoencoders (AEs) used as a deep neural network weight initialization technique. Our experiments consist in assessing two different approaches when training the classification model - fixing the weights after pre-training the AEs, or allowing fine-tuning of the entire network - and two different strategies for embedding the AEs into the classification network, namely by only importing the encoding layers, or by inserting the complete AE. We then study how varying the number of layers in the first strategy, the AEs latent vector dimension, and the imputation technique in the data preprocessing step impacts the network's overall classification performance. Finally, with the goal of assessing how well does this pipeline generalize, we apply the same methodology to two additional datasets that include features extracted from images of malaria thin blood smears, and breast masses cell nuclei. We also discard the possibility of overfitting by using held-out test sets in the images datasets.

RESULTS

The methodology attained good overall results for both RNA-Seq and image extracted data. We outperformed the established baseline for all the considered datasets, achieving an average F score of 99.03, 89.95, and 98.84 and an MCC of 0.99, 0.84, and 0.98, for the RNA-Seq (when detecting thyroid cancer), the Malaria, and the Wisconsin Breast Cancer data, respectively.

CONCLUSIONS

We observed that the approach of fine-tuning the weights of the top layers imported from the AE reached higher results, for all the presented experiences, and all the considered datasets. We outperformed all the previous reported results when comparing to the established baselines.

摘要

背景

截至今日,癌症仍是最普遍且死亡率最高的疾病之一,仅 2018 年就有超过 900 万人死于癌症。这促使研究人员研究将基于机器学习的解决方案应用于癌症检测,以加速其诊断并帮助预防癌症。在几种方法中,一种是通过基因表达分析自动对肿瘤样本进行分类。

方法

在这项工作中,我们旨在通过 RNA-Seq 数据集区分五种不同类型的癌症:甲状腺、皮肤、胃、乳腺和肺。为此,我们采用了之前描述的方法,通过该方法比较了 3 种不同的自动编码器 (AE) 作为深度神经网络权重初始化技术的性能。我们的实验包括评估两种不同的方法来训练分类模型——在预训练 AE 后固定权重,或允许整个网络微调——以及两种将 AE 嵌入分类网络的不同策略,即仅导入编码层,或插入完整的 AE。然后,我们研究了在第一种策略中改变第一层的数量、AE 潜在向量的维度以及数据预处理步骤中的插补技术如何影响网络的整体分类性能。最后,为了评估该管道的泛化能力,我们将相同的方法应用于另外两个包含从疟疾薄血涂片图像和乳腺肿块细胞核中提取的特征的数据集。我们还通过在图像数据集中使用保留的测试集来排除过拟合的可能性。

结果

该方法在 RNA-Seq 和图像提取数据方面均取得了良好的整体结果。我们在所有考虑的数据集上都超过了既定的基线,分别获得了 99.03、89.95 和 98.84 的平均 F 分数和 0.99、0.84 和 0.98 的 MCC,用于 RNA-Seq(检测甲状腺癌)、疟疾和威斯康星州乳腺癌数据。

结论

我们观察到,在所有提出的经验和所有考虑的数据集上,微调从 AE 导入的顶层权重的方法都达到了更高的结果。与既定的基线相比,我们在所有比较中都超过了之前报道的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/9779365895d7/12911_2020_1150_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/df22c0134331/12911_2020_1150_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/3f882b812a1f/12911_2020_1150_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/dd57bf7c2243/12911_2020_1150_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/9779365895d7/12911_2020_1150_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/df22c0134331/12911_2020_1150_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/3f882b812a1f/12911_2020_1150_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/dd57bf7c2243/12911_2020_1150_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b61/7439655/9779365895d7/12911_2020_1150_Fig4_HTML.jpg

相似文献

1
Using autoencoders as a weight initialization method on deep neural networks for disease detection.使用自动编码器作为深度神经网络的权重初始化方法进行疾病检测。
BMC Med Inform Decis Mak. 2020 Aug 20;20(Suppl 5):141. doi: 10.1186/s12911-020-01150-w.
2
Deep Convolution Neural Network for Malignancy Detection and Classification in Microscopic Uterine Cervix Cell Images.用于子宫颈细胞显微图像中恶性肿瘤检测与分类的深度卷积神经网络
Asian Pac J Cancer Prev. 2019 Nov 1;20(11):3447-3456. doi: 10.31557/APJCP.2019.20.11.3447.
3
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
4
Autoencoder and restricted Boltzmann machine for transfer learning in functional magnetic resonance imaging task classification.用于功能磁共振成像任务分类中迁移学习的自动编码器和受限玻尔兹曼机
Heliyon. 2023 Jul 16;9(7):e18086. doi: 10.1016/j.heliyon.2023.e18086. eCollection 2023 Jul.
5
Optimizing time prediction and error classification in early melanoma detection using a hybrid RCNN-LSTM model.使用混合 RCNN-LSTM 模型优化早期黑色素瘤检测中的时间预测和误差分类。
Microsc Res Tech. 2024 Aug;87(8):1789-1809. doi: 10.1002/jemt.24559. Epub 2024 Mar 22.
6
Deep Convolutional Neural Networks for breast cancer screening.深度学习卷积神经网络在乳腺癌筛查中的应用。
Comput Methods Programs Biomed. 2018 Apr;157:19-30. doi: 10.1016/j.cmpb.2018.01.011. Epub 2018 Jan 11.
7
DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning.DEGnext:使用具有迁移学习的卷积神经网络对 RNA-seq 数据进行差异表达基因分类。
BMC Bioinformatics. 2022 Jan 6;23(1):17. doi: 10.1186/s12859-021-04527-4.
8
Recognition of words from brain-generated signals of speech-impaired people: Application of autoencoders as a neural Turing machine controller in deep neural networks.语音障碍人士脑生成信号的单词识别:自编码器作为神经图灵机控制器在深度神经网络中的应用。
Neural Netw. 2020 Jan;121:186-207. doi: 10.1016/j.neunet.2019.07.012. Epub 2019 Sep 9.
9
Brain tumor classification for MR images using transfer learning and fine-tuning.基于迁移学习和微调的磁共振图像脑肿瘤分类。
Comput Med Imaging Graph. 2019 Jul;75:34-46. doi: 10.1016/j.compmedimag.2019.05.001. Epub 2019 May 18.
10
Breath analysis based early gastric cancer classification from deep stacked sparse autoencoder neural network.基于深度堆叠稀疏自动编码器神经网络的呼吸分析早期胃癌分类。
Sci Rep. 2021 Feb 17;11(1):4014. doi: 10.1038/s41598-021-83184-2.

引用本文的文献

1
BCED-Net: Breast Cancer Ensemble Diagnosis Network using transfer learning and the XGBoost classifier with mammography images.BCED-Net:使用迁移学习和带有乳房X光图像的XGBoost分类器的乳腺癌综合诊断网络。
Osong Public Health Res Perspect. 2024 Oct;15(5):409-419. doi: 10.24171/j.phrp.2023.0361. Epub 2024 Sep 10.
2
Towards reliable hepatocytic anatomy segmentation in laparoscopic cholecystectomy using U-Net with Auto-Encoder.基于 U-Net 与自动编码器的腹腔镜胆囊切除术肝组织可靠分割。
Surg Endosc. 2023 Sep;37(9):7358-7369. doi: 10.1007/s00464-023-10306-4. Epub 2023 Jul 25.
3
Automated screening of potential organ donors using a temporal machine learning model.

本文引用的文献

1
Identification of 12 cancer types through genome deep learning.通过基因组深度学习鉴定 12 种癌症类型。
Sci Rep. 2019 Nov 21;9(1):17256. doi: 10.1038/s41598-019-53989-3.
2
Cancer classification of single-cell gene expression data by neural network.基于神经网络的单细胞基因表达数据的癌症分类。
Bioinformatics. 2020 Mar 1;36(5):1360-1366. doi: 10.1093/bioinformatics/btz772.
3
DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC:一种基于深度学习的新型癌症分子亚型分类框架。
使用时间机器学习模型对潜在器官捐献者进行自动筛查。
Sci Rep. 2023 May 25;13(1):8459. doi: 10.1038/s41598-023-35270-w.
4
Deep Learning for Robust Adaptive Inverse Control of Nonlinear Dynamic Systems: Improved Settling Time with an Autoencoder.深度学习在非线性动态系统鲁棒自适应逆控制中的应用:自动编码器改进了 Settling Time。
Sensors (Basel). 2022 Aug 9;22(16):5935. doi: 10.3390/s22165935.
5
Deep ConvNet: Non-Random Weight Initialization for Repeatable Determinism, Examined with FSGM.深度卷积网络:具有可重复性确定性的非随机权重初始化,用 FSGM 进行检验。
Sensors (Basel). 2021 Jul 13;21(14):4772. doi: 10.3390/s21144772.
Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.
4
Single-cell RNA-seq denoising using a deep count autoencoder.基于深度计数自编码器的单细胞 RNA-seq 去噪。
Nat Commun. 2019 Jan 23;10(1):390. doi: 10.1038/s41467-018-07931-2.
5
A semi-supervised deep learning method based on stacked sparse auto-encoder for cancer prediction using RNA-seq data.基于堆叠稀疏自动编码器的半监督深度学习方法在 RNA-seq 数据癌症预测中的应用。
Comput Methods Programs Biomed. 2018 Nov;166:99-105. doi: 10.1016/j.cmpb.2018.10.004. Epub 2018 Oct 5.
6
A deep auto-encoder model for gene expression prediction.用于基因表达预测的深度自动编码器模型。
BMC Genomics. 2017 Nov 17;18(Suppl 9):845. doi: 10.1186/s12864-017-4226-0.
7
Mobile-Based Analysis of Malaria-Infected Thin Blood Smears: Automated Species and Life Cycle Stage Determination.基于移动设备的疟疾感染薄血涂片分析:自动鉴定物种和生活史阶段。
Sensors (Basel). 2017 Sep 21;17(10):2167. doi: 10.3390/s17102167.
8
RNA Sequencing and Analysis.RNA测序与分析。
Cold Spring Harb Protoc. 2015 Apr 13;2015(11):951-69. doi: 10.1101/pdb.top084970.
9
Machine learning applications in cancer prognosis and prediction.机器学习在癌症预后和预测中的应用。
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17. doi: 10.1016/j.csbj.2014.11.005. eCollection 2015.
10
Comparison of the predicted and observed secondary structure of T4 phage lysozyme.T4噬菌体溶菌酶预测二级结构与观察到的二级结构的比较。
Biochim Biophys Acta. 1975 Oct 20;405(2):442-51. doi: 10.1016/0005-2795(75)90109-9.