Han Bo-Wei, Yang Xu, Qu Shou-Fang, Guo Zhi-Wei, Huang Li-Min, Li Kun, Ouyang Guo-Jun, Cai Geng-Xi, Xiao Wei-Wei, Weng Rong-Tao, Xu Shun, Huang Jie, Yang Xue-Xi, Wu Ying-Song
Key Laboratory of Antibody Engineering of Guangdong Higher Education Institutes, School of Laboratory Medicine and Biotechnology, Southern Medical University, Guangzhou, China.
Division of in vitro Diagnostic Reagents, National Institutes for Food and Drug Control (NIFDC), Beijing, China.
Front Med (Lausanne). 2021 Dec 3;8:684238. doi: 10.3389/fmed.2021.684238. eCollection 2021.
Cell-free DNA (cfDNA) serves as a footprint of the nucleosome occupancy status of transcription start sites (TSSs), and has been subject to wide development for use in noninvasive health monitoring and disease detection. However, the requirement for high sequencing depth limits its clinical use. Here, we introduce a deep-learning pipeline designed for TSS coverage profiles generated from shallow cfDNA sequencing called the Autoencoder of cfDNA TSS (AECT) coverage profile. AECT outperformed existing single-cell sequencing imputation algorithms in terms of improvements to TSS coverage accuracy and the capture of latent biological features that distinguish sex or tumor status. We built classifiers for the detection of breast and rectal cancer using AECT-imputed shallow sequencing data, and their performance was close to that achieved by high-depth sequencing, suggesting that AECT could provide a broadly applicable noninvasive screening approach with high accuracy and at a moderate cost.
游离DNA(cfDNA)作为转录起始位点(TSS)核小体占据状态的印记,已在非侵入性健康监测和疾病检测中得到广泛开发应用。然而,对高测序深度的要求限制了其临床应用。在此,我们介绍一种深度学习流程,专为从浅层cfDNA测序生成的TSS覆盖谱设计,称为cfDNA TSS自动编码器(AECT)覆盖谱。在提高TSS覆盖准确性以及捕获区分性别或肿瘤状态的潜在生物学特征方面,AECT优于现有的单细胞测序插补算法。我们使用AECT插补的浅层测序数据构建了用于检测乳腺癌和直肠癌的分类器,其性能接近深度测序所达到的性能,这表明AECT可以提供一种广泛适用的非侵入性筛查方法,具有高精度且成本适中。