Suppr超能文献

基于深度学习的无创性癌症检测,通过整合个体游离 DNA 读取的 DNA 序列和甲基化信息。

DISMIR: Deep learning-based noninvasive cancer detection by integrating DNA sequence and methylation information of individual cell-free DNA reads.

机构信息

Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing 100084, China.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab250.

Abstract

Detecting cancer signals in cell-free DNA (cfDNA) high-throughput sequencing data is emerging as a novel noninvasive cancer detection method. Due to the high cost of sequencing, it is crucial to make robust and precise predictions with low-depth cfDNA sequencing data. Here we propose a novel approach named DISMIR, which can provide ultrasensitive and robust cancer detection by integrating DNA sequence and methylation information in plasma cfDNA whole-genome bisulfite sequencing (WGBS) data. DISMIR introduces a new feature termed as 'switching region' to define cancer-specific differentially methylated regions, which can enrich the cancer-related signal at read-resolution. DISMIR applies a deep learning model to predict the source of every single read based on its DNA sequence and methylation state and then predicts the risk that the plasma donor is suffering from cancer. DISMIR exhibited high accuracy and robustness on hepatocellular carcinoma detection by plasma cfDNA WGBS data even at ultralow sequencing depths. Further analysis showed that DISMIR tends to be insensitive to alterations of single CpG sites' methylation states, which suggests DISMIR could resist to technical noise of WGBS. All these results showed DISMIR with the potential to be a precise and robust method for low-cost early cancer detection.

摘要

在无细胞游离 DNA(cfDNA)高通量测序数据中检测癌症信号正在成为一种新的无创癌症检测方法。由于测序成本高,因此使用低深度 cfDNA 测序数据进行稳健而精确的预测至关重要。在这里,我们提出了一种名为 DISMIR 的新方法,该方法可以通过整合血浆 cfDNA 全基因组亚硫酸氢盐测序(WGBS)数据中的 DNA 序列和甲基化信息,提供超灵敏和稳健的癌症检测。DISMIR 引入了一个新特征,称为“切换区”,以定义癌症特异性差异甲基化区域,从而在读取分辨率上富集与癌症相关的信号。DISMIR 应用深度学习模型基于其 DNA 序列和甲基化状态预测每个读取的来源,然后预测血浆供体是否患有癌症的风险。即使在超低测序深度下,DISMIR 也能在基于血浆 cfDNA WGBS 数据的肝细胞癌检测中表现出高精度和稳健性。进一步的分析表明,DISMIR 往往对单个 CpG 位点甲基化状态的改变不敏感,这表明 DISMIR 可以抵抗 WGBS 的技术噪声。所有这些结果表明,DISMIR 有可能成为一种用于低成本早期癌症检测的精确和稳健的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3db1/8575022/3988739f972c/bbab250f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验