基于拷贝数变异和染色质 3D 结构的癌症类型预测的卷积神经网络方法。

Cancer type prediction based on copy number aberration and chromatin 3D structure with convolutional neural networks.

机构信息

Key Laboratory of Systems Biomedicine, Shanghai Center for Systems Biomedicine, Shanghai Jiaotong University, Shanghai, 200240, China.

School of Information Technologies, University of Sydney, Sydney, NSW, 2006, Australia.

出版信息

BMC Genomics. 2018 Aug 13;19(Suppl 6):565. doi: 10.1186/s12864-018-4919-z.

DOI:10.1186/s12864-018-4919-z

PMID:30367576

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6101087/

Abstract

BACKGROUND

With the developments of DNA sequencing technology, large amounts of sequencing data have been produced that provides unprecedented opportunities for advanced association studies between somatic mutations and cancer types/subtypes which further contributes to more accurate somatic mutation based cancer typing (SMCT). In existing SMCT methods however, the absence of high-level feature extraction is a major obstacle in improving the classification performance.

RESULTS

We propose DeepCNA, an advanced convolutional neural network (CNN) based classifier, which utilizes copy number aberrations (CNAs) and HiC data, to address this issue. DeepCNA first pre-process the CNA data by clipping, zero padding and reshaping. Then, the processed data is fed into a CNN classifier, which extracts high-level features for accurate classification. Experimental results on the COSMIC CNA dataset indicate that 2D CNN with both cell lines of HiC data lead to the best performance. We further compare DeepCNA with three widely adopted classifiers, and demonstrate that DeepCNA has at least 78% improvement of performance.

CONCLUSIONS

This paper demonstrates the advantages and potential of the proposed DeepCNA model for processing of somatic point mutation based gene data, and proposes that its usage may be extended to other complex genotype-phenotype association studies.

摘要

背景

随着 DNA 测序技术的发展，产生了大量的测序数据，为体细胞突变与癌症类型/亚型之间的高级关联研究提供了前所未有的机会，这进一步促进了更准确的基于体细胞突变的癌症分型（SMCT）。然而，在现有的 SMCT 方法中，缺乏高级特征提取是提高分类性能的主要障碍。

结果

我们提出了 DeepCNA，一种基于先进的卷积神经网络（CNN）的分类器，它利用拷贝数异常（CNAs）和 HiC 数据来解决这个问题。DeepCNA 首先通过裁剪、零填充和重塑来预处理 CNA 数据。然后，将处理后的数据输入到 CNN 分类器中，该分类器提取高级特征以进行准确分类。在 COSMIC CNA 数据集上的实验结果表明，使用 HiC 数据的 2D CNN 可以获得最佳性能。我们进一步将 DeepCNA 与三种广泛应用的分类器进行比较，证明 DeepCNA 的性能至少提高了 78%。

结论

本文证明了所提出的 DeepCNA 模型在基于体细胞点突变的基因数据处理方面的优势和潜力，并提出其用途可能扩展到其他复杂的基因型-表型关联研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15da/6101087/89fb2c1c470f/12864_2018_4919_Fig1_HTML.jpg

相似文献

Cancer type prediction based on copy number aberration and chromatin 3D structure with convolutional neural networks.基于拷贝数变异和染色质 3D 结构的癌症类型预测的卷积神经网络方法。

BMC Genomics. 2018 Aug 13;19(Suppl 6):565. doi: 10.1186/s12864-018-4919-z.

DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations.DeepGene：一种基于深度学习和体细胞点突变的先进癌症类型分类器。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):476. doi: 10.1186/s12859-016-1334-9.

Inferring single-cell copy number profiles through cross-cell segmentation of read counts.通过读取计数的跨细胞分割推断单细胞拷贝数谱。

BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.

Convolutional neural network models for cancer type prediction based on gene expression.基于基因表达的癌症类型预测卷积神经网络模型。

BMC Med Genomics. 2020 Apr 3;13(Suppl 5):44. doi: 10.1186/s12920-020-0677-2.

A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.基于卷积神经网络，深入探究利用多参数磁共振成像进行肿瘤病灶分类。

Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.

Deep convolutional neural network based hyperspectral brain tissue classification.基于深度卷积神经网络的高光谱脑组织分类。

J Xray Sci Technol. 2023;31(4):777-796. doi: 10.3233/XST-230045.

fMRI volume classification using a 3D convolutional neural network robust to shifted and scaled neuronal activations.使用对移位和缩放神经元激活具有鲁棒性的 3D 卷积神经网络进行 fMRI 体积分类。

Neuroimage. 2020 Dec;223:117328. doi: 10.1016/j.neuroimage.2020.117328. Epub 2020 Sep 5.

Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.评估单细胞 DNA 测序数据中拷贝数变异检测方法的性能。

PLoS Comput Biol. 2020 Jul 13;16(7):e1008012. doi: 10.1371/journal.pcbi.1008012. eCollection 2020 Jul.

DeepSSV: detecting somatic small variants in paired tumor and normal sequencing data with convolutional neural network.DeepSSV：使用卷积神经网络检测配对肿瘤和正常测序数据中的体细胞小变异。

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa272.

Cervical cell classification with graph convolutional network.基于图卷积网络的宫颈细胞分类

Comput Methods Programs Biomed. 2021 Jan;198:105807. doi: 10.1016/j.cmpb.2020.105807. Epub 2020 Oct 22.

引用本文的文献

3D genome contributes to MHC-II neoantigen prediction.三维基因组影响 MHC-II 新抗原预测。

BMC Genomics. 2024 Sep 26;25(Suppl 2):889. doi: 10.1186/s12864-024-10687-3.

Using Copy Number Variation Data and Neural Networks to Predict Cancer Metastasis Origin Achieves High Area under the Curve Value with a Trade-Off in Precision.利用拷贝数变异数据和神经网络预测癌症转移起源在精度上有所权衡的情况下实现了较高的曲线下面积值。

Curr Issues Mol Biol. 2024 Aug 1;46(8):8301-8319. doi: 10.3390/cimb46080490.

Prediction of Alzheimer's Disease Based on 3D Genome Selected circRNA.基于 3D 基因组选择的 circRNA 预测阿尔茨海默病。

J Prev Alzheimers Dis. 2024;11(4):1055-1062. doi: 10.14283/jpad.2024.52.

Enhancing cancer stage prediction through hybrid deep neural networks: a comparative study.通过混合深度神经网络增强癌症分期预测：一项比较研究。

Front Big Data. 2024 Mar 22;7:1359703. doi: 10.3389/fdata.2024.1359703. eCollection 2024.

Deep learning in cancer genomics and histopathology.深度学习在癌症基因组学和组织病理学中的应用。

Genome Med. 2024 Mar 27;16(1):44. doi: 10.1186/s13073-024-01315-6.

3D genome-selected microRNAs to improve Alzheimer's disease prediction.用于改善阿尔茨海默病预测的3D基因组选择的微小RNA

Front Neurol. 2023 Feb 13;14:1059492. doi: 10.3389/fneur.2023.1059492. eCollection 2023.

BoT-Net: a lightweight bag of tricks-based neural network for efficient LncRNA-miRNA interaction prediction.BoT-Net：一种基于轻量级技巧的神经网络，用于高效的 LncRNA-miRNA 相互作用预测。

Interdiscip Sci. 2022 Dec;14(4):841-862. doi: 10.1007/s12539-022-00535-x. Epub 2022 Aug 10.

Multiclass Cancer Prediction Based on Copy Number Variation Using Deep Learning.基于深度学习的拷贝数变异的多癌症预测。

Comput Intell Neurosci. 2022 Jun 9;2022:4742986. doi: 10.1155/2022/4742986. eCollection 2022.

Secure tumor classification by shallow neural network using homomorphic encryption.利用同态加密实现浅层神经网络的肿瘤分类安全。

BMC Genomics. 2022 Apr 9;23(1):284. doi: 10.1186/s12864-022-08469-w.

Deep Learning-Based Pan-Cancer Classification Model Reveals Tissue-of-Origin Specific Gene Expression Signatures.基于深度学习的泛癌分类模型揭示组织起源特异性基因表达特征

Cancers (Basel). 2022 Feb 24;14(5):1185. doi: 10.3390/cancers14051185.

本文引用的文献

DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations.DeepGene：一种基于深度学习和体细胞点突变的先进癌症类型分类器。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):476. doi: 10.1186/s12859-016-1334-9.

Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

COSMIC: exploring the world's knowledge of somatic mutations in human cancer.COSMIC：探索全球关于人类癌症体细胞突变的知识。

Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11. doi: 10.1093/nar/gku1075. Epub 2014 Oct 29.

Chromosomal instability, aneuploidy, and cancer.染色体不稳定性、非整倍体与癌症。

Front Oncol. 2014 Jun 19;4:161. doi: 10.3389/fonc.2014.00161. eCollection 2014.

Copy number variation detection using next generation sequencing read counts.使用下一代测序读段计数进行拷贝数变异检测。

BMC Bioinformatics. 2014 Apr 14;15:109. doi: 10.1186/1471-2105-15-109.

Pan-cancer patterns of somatic copy number alteration.体细胞拷贝数改变的泛癌模式

Nat Genet. 2013 Oct;45(10):1134-40. doi: 10.1038/ng.2760.

The causes and consequences of genetic heterogeneity in cancer evolution.癌症进化中遗传异质性的原因和后果。

Nature. 2013 Sep 19;501(7467):338-45. doi: 10.1038/nature12625.

Potential risks of pharmacy compounding.药剂配制的潜在风险。

Drugs R D. 2013 Mar;13(1):1-8. doi: 10.1007/s40268-013-0005-9.

Functional genomic analysis of chromosomal aberrations in a compendium of 8000 cancer genomes.对 8000 个癌症基因组中染色体畸变的功能基因组分析。

Genome Res. 2013 Feb;23(2):217-27. doi: 10.1101/gr.140301.112. Epub 2012 Nov 6.

Circulating tumor cells, disease recurrence and survival in newly diagnosed breast cancer.新诊断乳腺癌中的循环肿瘤细胞、疾病复发与生存情况

Breast Cancer Res. 2012 Oct 22;14(5):R133. doi: 10.1186/bcr3333.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于拷贝数变异和染色质 3D 结构的癌症类型预测的卷积神经网络方法。

Cancer type prediction based on copy number aberration and chromatin 3D structure with convolutional neural networks.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献