去噪自动编码器，一种深度学习算法，有助于鉴定肺腺癌的新型分子特征。

Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma.

机构信息

Department of Thoracic Surgery, Jiangsu Province People's Hospital and the First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China.

State Key Laboratory of Bioelectronics, School of Biological Sciences and Medical Engineering, Southeast University, Nanjing 210096, China.

出版信息

Genomics Proteomics Bioinformatics. 2020 Aug;18(4):468-480. doi: 10.1016/j.gpb.2019.02.003. Epub 2020 Dec 18.

DOI:10.1016/j.gpb.2019.02.003

PMID:33346087

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8242334/

Abstract

Precise biomarker development is a key step in disease management. However, most of the published biomarkers were derived from a relatively small number of samples with supervised approaches. Recent advances in unsupervised machine learning promise to leverage very large datasets for making better predictions of disease biomarkers. Denoising autoencoder (DA) is one of the unsupervised deep learning algorithms, which is a stochastic version of autoencoder techniques. The principle of DA is to force the hidden layer of autoencoder to capture more robust features by reconstructing a clean input from a corrupted one. Here, a DA model was applied to analyze integrated transcriptomic data from 13 published lung cancer studies, which consisted of 1916 human lung tissue samples. Using DA, we discovered a molecular signature composed of multiple genes for lung adenocarcinoma (ADC). In independent validation cohorts, the proposed molecular signature is proved to be an effective classifier for lung cancer histological subtypes. Also, this signature successfully predicts clinical outcome in lung ADC, which is independent of traditional prognostic factors. More importantly, this signature exhibits a superior prognostic power compared with the other published prognostic genes. Our study suggests that unsupervised learning is helpful for biomarker development in the era of precision medicine.

摘要

精准的生物标志物的开发是疾病管理的关键步骤。然而，大多数已发表的生物标志物都是通过有监督的方法从相对较少的样本中得出的。最近无监督机器学习的进展有望利用非常大的数据集来更好地预测疾病生物标志物。去噪自编码器（DA）是一种无监督深度学习算法，是自动编码器技术的随机版本。DA 的原理是通过从损坏的输入中重建干净的输入，迫使自动编码器的隐藏层捕获更稳健的特征。在这里，我们应用 DA 模型来分析来自 13 项已发表的肺癌研究的整合转录组数据，这些研究包含 1916 个人类肺组织样本。使用 DA，我们发现了一个由多个基因组成的肺腺癌（ADC）分子特征。在独立验证队列中，所提出的分子特征被证明是用于肺癌组织亚型的有效分类器。此外，该特征还成功地预测了肺 ADC 的临床结果，这与传统的预后因素无关。更重要的是，与其他已发表的预后基因相比，该特征表现出更高的预后能力。我们的研究表明，在精准医学时代，无监督学习有助于生物标志物的开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a18f/8242334/f85492cab437/gr1.jpg

相似文献

Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma.去噪自动编码器，一种深度学习算法，有助于鉴定肺腺癌的新型分子特征。

Genomics Proteomics Bioinformatics. 2020 Aug;18(4):468-480. doi: 10.1016/j.gpb.2019.02.003. Epub 2020 Dec 18.

[Sparse Denoising Autoencoder Application in Identification of Counterfeit Pharmaceutical].稀疏去噪自编码器在假冒药品识别中的应用

Guang Pu Xue Yu Guang Pu Fen Xi. 2016 Sep;36(9):2774-9.

Classification of lung adenocarcinoma transcriptome subtypes from pathological images using deep convolutional networks.利用深度卷积网络从病理图像中对肺腺癌转录组亚型进行分类。

Int J Comput Assist Radiol Surg. 2018 Dec;13(12):1905-1913. doi: 10.1007/s11548-018-1835-2. Epub 2018 Aug 29.

Development and validation of a robust immune-related prognostic signature in early-stage lung adenocarcinoma.早期肺腺癌中一种稳健的免疫相关预后标志物的开发与验证

J Transl Med. 2020 Oct 7;18(1):380. doi: 10.1186/s12967-020-02545-z.

Development and validation of an immune-related prognostic signature in lung adenocarcinoma.肺腺癌免疫相关预后标志物的建立和验证

Cancer Med. 2020 Aug;9(16):5960-5975. doi: 10.1002/cam4.3240. Epub 2020 Jun 26.

Recognition of Lung Adenocarcinoma-specific Gene Pairs Based on Genetic Algorithm and Establishment of a Deep Learning Prediction Model.基于遗传算法的肺腺癌特异性基因对识别及深度学习预测模型的建立

Comb Chem High Throughput Screen. 2019;22(4):256-265. doi: 10.2174/1386207322666190530102245.

Multi-scale supervised clustering-based feature selection for tumor classification and identification of biomarkers and targets on genomic data.基于多尺度监督聚类的特征选择在肿瘤分类和基因组数据的生物标志物和靶标鉴定中的应用。

BMC Genomics. 2020 Sep 22;21(1):650. doi: 10.1186/s12864-020-07038-3.

Systematic analysis of transcriptome signature for improving outcomes in lung adenocarcinoma.肺腺癌患者转归改善的转录组特征的系统分析。

J Cancer Res Clin Oncol. 2023 Sep;149(11):8951-8968. doi: 10.1007/s00432-023-04814-y. Epub 2023 May 9.

Analysis of genomic and transcriptomic variations as prognostic signature for lung adenocarcinoma.分析基因组和转录组变化作为肺腺癌的预后特征。

BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):368. doi: 10.1186/s12859-020-03691-3.

A novel protein-based prognostic signature improves risk stratification to guide clinical management in early-stage lung adenocarcinoma patients.一种新型基于蛋白质的预后标志物可改善风险分层，以指导早期肺腺癌患者的临床管理。

J Pathol. 2018 Aug;245(4):421-432. doi: 10.1002/path.5096. Epub 2018 Jun 20.

引用本文的文献

Autoencoder-Transformed Transcriptome Improves Genotype-Phenotype Association Studies.自动编码器转换转录组改善基因型-表型关联研究。

IEEE Trans Comput Biol Bioinform. 2025 Jul-Aug;22(4):1703-1715. doi: 10.1109/TCBBIO.2025.3568376.

tRNA-derived small RNAs in human cancers: roles, mechanisms, and clinical application.tRNA 衍生的小 RNA 在人类癌症中的作用、机制及临床应用。

Mol Cancer. 2024 Apr 15;23(1):76. doi: 10.1186/s12943-024-01992-2.

Immune, metabolic landscapes of prognostic signatures for lung adenocarcinoma based on a novel deep learning framework.基于新型深度学习框架的肺腺癌预后标志物的免疫、代谢特征。

Sci Rep. 2024 Jan 4;14(1):527. doi: 10.1038/s41598-023-51108-x.

AI/ML advances in non-small cell lung cancer biomarker discovery.人工智能/机器学习在非小细胞肺癌生物标志物发现方面的进展。

Front Oncol. 2023 Dec 11;13:1260374. doi: 10.3389/fonc.2023.1260374. eCollection 2023.

CT radiomics model combined with clinical and radiographic features for discriminating peripheral small cell lung cancer from peripheral lung adenocarcinoma.结合临床和影像学特征的CT影像组学模型用于鉴别外周型小细胞肺癌和外周型肺腺癌

Front Oncol. 2023 Mar 20;13:1157891. doi: 10.3389/fonc.2023.1157891. eCollection 2023.

Survival prediction for patients with glioblastoma multiforme using a Cox proportional hazards denoising autoencoder network.使用Cox比例风险去噪自动编码器网络对多形性胶质母细胞瘤患者进行生存预测。

Front Comput Neurosci. 2023 Jan 10;16:916511. doi: 10.3389/fncom.2022.916511. eCollection 2022.

Combining metabolome and clinical indicators with machine learning provides some promising diagnostic markers to precisely detect smear-positive/negative pulmonary tuberculosis.将代谢组学和临床指标与机器学习相结合，为精确检测菌阳/菌阴肺结核提供了一些有前途的诊断标志物。

BMC Infect Dis. 2022 Aug 25;22(1):707. doi: 10.1186/s12879-022-07694-8.

AI applications in functional genomics.人工智能在功能基因组学中的应用。

Comput Struct Biotechnol J. 2021 Oct 11;19:5762-5790. doi: 10.1016/j.csbj.2021.10.009. eCollection 2021.

A Survey of Autoencoder Algorithms to Pave the Diagnosis of Rare Diseases.自动编码器算法在罕见病诊断中的应用研究综述。

Int J Mol Sci. 2021 Oct 8;22(19):10891. doi: 10.3390/ijms221910891.

Origins and evolving functionalities of tRNA-derived small RNAs.tRNA 衍生的小 RNA 的起源和不断进化的功能。

Trends Biochem Sci. 2021 Oct;46(10):790-804. doi: 10.1016/j.tibs.2021.05.001. Epub 2021 May 27.

本文引用的文献

Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.无监督提取公共文库中稳定表达特征的神经网络集成方法。

Cell Syst. 2017 Jul 26;5(1):63-71.e6. doi: 10.1016/j.cels.2017.06.003. Epub 2017 Jul 12.

An 8-gene signature for prediction of prognosis and chemoresponse in non-small cell lung cancer.一种用于预测非小细胞肺癌预后和化疗反应的8基因特征。

Oncotarget. 2016 Dec 27;7(52):86561-86572. doi: 10.18632/oncotarget.13357.

ADAGE-Based Integration of Publicly Available Gene Expression Data with Denoising Autoencoders Illuminates Microbe-Host Interactions.基于ADAGE的公开可用基因表达数据与去噪自动编码器的整合揭示了微生物与宿主的相互作用。

mSystems. 2016 Jan 19;1(1). doi: 10.1128/mSystems.00025-15. eCollection 2016 Jan-Feb.

Deep learning for computational biology.用于计算生物学的深度学习。

Mol Syst Biol. 2016 Jul 29;12(7):878. doi: 10.15252/msb.20156651.

Expression of nuclear factor, erythroid 2-like 2-mediated genes differentiates tuberculosis.核因子红细胞2样2介导基因的表达可区分结核病。

Tuberculosis (Edinb). 2016 Jul;99:56-62. doi: 10.1016/j.tube.2016.04.008. Epub 2016 Apr 26.

Molecular gene signature and prognosis of non-small cell lung cancer.非小细胞肺癌的分子基因特征与预后

Oncotarget. 2016 Aug 9;7(32):51898-51907. doi: 10.18632/oncotarget.10622.

Biomarker development in the precision medicine era: lung cancer as a case study.精准医学时代的生物标志物开发：以肺癌为例

Nat Rev Cancer. 2016 Aug;16(8):525-37. doi: 10.1038/nrc.2016.56. Epub 2016 Jul 8.

Circulating Biomarkers in Non-Small-Cell Lung Cancer: Current Status and Future Challenges.非小细胞肺癌中的循环生物标志物：现状与未来挑战

Clin Lung Cancer. 2016 Nov;17(6):507-516. doi: 10.1016/j.cllc.2016.05.021. Epub 2016 Jun 8.

Distinct patterns of somatic genome alterations in lung adenocarcinomas and squamous cell carcinomas.肺腺癌和肺鳞癌中体细胞基因组改变的不同模式。

Nat Genet. 2016 Jun;48(6):607-16. doi: 10.1038/ng.3564. Epub 2016 May 9.

DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.DanQ：一种用于量化DNA序列功能的卷积与循环相结合的深度神经网络。

Nucleic Acids Res. 2016 Jun 20;44(11):e107. doi: 10.1093/nar/gkw226. Epub 2016 Apr 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

去噪自动编码器，一种深度学习算法，有助于鉴定肺腺癌的新型分子特征。

Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献