School of Biomedical Engineering, School of Ophthalmology & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, 325027, China.
Clin Epigenetics. 2021 Oct 24;13(1):197. doi: 10.1186/s13148-021-01183-6.
5-Hydroxymethylcytosine (5hmC) is a significant DNA epigenetic modification. However, the 5hmC modification alterations in genomic regions encoding long non-coding RNA (lncRNA) and their clinical significance remain poorly characterized.
A three-phase discovery-modeling-validation study was conducted to explore the potential of the plasma-derived 5hmC modification level in genomic regions encoding lncRNAs as a superior alternative biomarker for cancer diagnosis and surveillance. Genome-wide 5hmC profiles in the plasma circulating cell-free DNA of 1632 cancer and 1379 non-cancerous control samples from different cancer types and multiple centers were repurposed and characterized. A large number of altered 5hmC modifications were distributed at genomic regions encoding lncRNAs in cancerous compared with healthy subjects. Furthermore, most 5hmC-modified lncRNA genes were cancer-specific, with only a relatively small number of 5hmC-modified lncRNA genes shared by various cancer types. A 5hmC-LncRNA diagnostic score (5hLD-score) comprising 39 tissue-shared 5hmC-modified lncRNA gene markers was developed using elastic net regularization. The 5hLD-score was able to accurately distinguish tumors from healthy controls with an area under the curve (AUC) of 0.963 [95% confidence interval (CI) 0.940-0.985] and 0.912 (95% CI 0.837-0.987) in the training and internal validation cohorts, respectively. Results from three independent validations confirmed the robustness and stability of the 5hLD-score with an AUC of 0.851 (95% CI 0.786-0.916) in Zhang's non-small cell lung cancer cohort, AUC of 0.887 (95% CI 0.852-0.922) in Tian's esophageal cancer cohort, and AUC of 0.768 (95% CI 0.746-0.790) in Cai's hepatocellular carcinoma cohort. In addition, a significant association was identified between the 5hLD-score and the progression from hepatitis to liver cancer. Finally, lncRNA genes modified by tissue-specific 5hmC alteration were again found to be capable of identifying the origin and location of tumors.
The present study will contribute to the ongoing effort to understand the transcriptional programs of lncRNA genes, as well as facilitate the development of novel invasive genomic tools for early cancer detection and surveillance.
5-羟甲基胞嘧啶(5hmC)是一种重要的 DNA 表观遗传修饰。然而,基因组区域编码长非编码 RNA(lncRNA)的 5hmC 修饰改变及其临床意义仍知之甚少。
进行了一项三阶段的发现-建模-验证研究,以探索血浆来源的 lncRNA 基因基因组区域 5hmC 修饰水平作为癌症诊断和监测的优异替代生物标志物的潜力。重新利用和表征了来自不同癌症类型和多个中心的 1632 例癌症和 1379 例非癌症对照样本的血浆循环无细胞 DNA 的全基因组 5hmC 图谱。与健康受试者相比,在癌症患者中,大量改变的 5hmC 修饰分布在编码 lncRNA 的基因组区域。此外,大多数 5hmC 修饰的 lncRNA 基因是癌症特异性的,只有相对较少的 5hmC 修饰的 lncRNA 基因存在于各种癌症类型中。使用弹性网络正则化开发了一个包含 39 个组织共享 5hmC 修饰的 lncRNA 基因标记的 5hmC-LncRNA 诊断评分(5hLD-score)。在训练和内部验证队列中,5hLD-score 能够以 0.963(95%置信区间[CI]0.940-0.985)和 0.912(95%CI0.837-0.987)的曲线下面积(AUC)准确地区分肿瘤与健康对照,在三个独立验证中,5hLD-score 的 AUC 分别为 0.851(95%CI0.786-0.916)在 Zhang 的非小细胞肺癌队列,AUC 为 0.887(95%CI0.852-0.922)在 Tian 的食管癌队列,和 AUC 为 0.768(95%CI0.746-0.790)在 Cai 的肝细胞癌队列。此外,还发现 5hLD-score 与从肝炎到肝癌的进展之间存在显著关联。最后,再次发现组织特异性 5hmC 改变修饰的 lncRNA 基因能够识别肿瘤的起源和位置。
本研究将有助于理解 lncRNA 基因转录程序的努力,并促进开发用于早期癌症检测和监测的新型侵袭性基因组工具。