• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LightCpG:一种基于单细胞全基因组序列数据的多视角 CpG 位点检测方法。

LightCpG: a multi-view CpG sites detection on single-cell whole genome sequence data.

机构信息

School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China.

School of Chemical Engineering and Technology, Tianjin University, Tianjin, China.

出版信息

BMC Genomics. 2019 Apr 23;20(1):306. doi: 10.1186/s12864-019-5654-9.

DOI:10.1186/s12864-019-5654-9
PMID:31014252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6480911/
Abstract

BACKGROUND

DNA methylation plays an important role in multiple biological processes that are closely related to human health. The study of DNA methylation can provide an insight into the mechanism behind human health and can also have a positive effect on the assessment of human health status. However, the available sequencing technology is limited by incomplete CpG coverage. Therefore, it is crucial to discover an efficient and convenient method capable of distinguishing between the states of CpG sites. Previous studies focused on identifying methylation states of the CpG sites in single cell, which only evaluated sequence information or structural information.

RESULTS

In this paper, we propose a novel model, LightCpG, which combines the positional features with the sequence and structural features to provide information on the CpG sites at two stages. Next, we used the LightGBM model for training of the CpG site identification, and further utilized sample extraction and merged features to reduce the training time. Our results indicate that our method achieves outstanding performance in recognition of DNA methylation. The average AUC values of our method using the 25 human hepatocellular carcinoma cells (HCC) cell datasets and six human heptoplastoma-derived (HepG2) cell datasets were 0.9616 and 0.9213, respectively. Moreover, the average training times for our method on the HCC and HepG2 datasets were 8.3 and 5.06 s, respectively. Furthermore, the computational complexity of our model was much lower compared with other available methods that detect methylation states of the CpG sites.

CONCLUSIONS

In summary, LightCpG is an accurate model for identifying the DNA methylation status of CpG sites in single cells. Furthermore, three types of feature extraction methods and two strategies used in LightCpG are helpful for other prediction problems.

摘要

背景

DNA 甲基化在与人类健康密切相关的多种生物学过程中发挥着重要作用。对 DNA 甲基化的研究可以深入了解人类健康背后的机制,也可以对评估人类健康状况产生积极影响。然而,现有的测序技术受到 CpG 覆盖不完全的限制。因此,发现一种能够区分 CpG 位点状态的高效便捷方法至关重要。以前的研究集中在识别单细胞中 CpG 位点的甲基化状态,这些研究仅评估了序列信息或结构信息。

结果

在本文中,我们提出了一种新的模型 LightCpG,该模型将位置特征与序列和结构特征相结合,提供了 CpG 位点在两个阶段的信息。接下来,我们使用 LightGBM 模型对 CpG 位点进行识别训练,并进一步利用样本提取和合并特征来减少训练时间。我们的结果表明,我们的方法在识别 DNA 甲基化方面表现出色。使用 25 个人肝癌细胞(HCC)细胞数据集和 6 个人肝母细胞瘤衍生(HepG2)细胞数据集,我们方法的平均 AUC 值分别为 0.9616 和 0.9213。此外,我们的方法在 HCC 和 HepG2 数据集上的平均训练时间分别为 8.3 和 5.06 秒。此外,与其他检测 CpG 位点甲基化状态的可用方法相比,我们模型的计算复杂度要低得多。

结论

总之,LightCpG 是一种用于识别单细胞中 CpG 位点 DNA 甲基化状态的准确模型。此外,LightCpG 中使用的三种特征提取方法和两种策略有助于解决其他预测问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5393/6480911/d5bfc63dc7b2/12864_2019_5654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5393/6480911/701a24b5e65c/12864_2019_5654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5393/6480911/d5bfc63dc7b2/12864_2019_5654_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5393/6480911/701a24b5e65c/12864_2019_5654_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5393/6480911/d5bfc63dc7b2/12864_2019_5654_Fig2_HTML.jpg

相似文献

1
LightCpG: a multi-view CpG sites detection on single-cell whole genome sequence data.LightCpG:一种基于单细胞全基因组序列数据的多视角 CpG 位点检测方法。
BMC Genomics. 2019 Apr 23;20(1):306. doi: 10.1186/s12864-019-5654-9.
2
iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data.iCpG-Pos:一种使用单细胞全基因组序列数据上的位置特征来识别 CpG 位点的准确计算方法。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad474.
3
Exploring genome-wide DNA methylation profiles altered in hepatocellular carcinoma using Infinium HumanMethylation 450 BeadChips.利用 Infinium HumanMethylation 450 BeadChips 探索肝癌中全基因组 DNA 甲基化谱的改变。
Epigenetics. 2013 Jan;8(1):34-43. doi: 10.4161/epi.23062. Epub 2012 Dec 3.
4
[Screening for differential methylation status by CpG island microarray in the hepatocellular carcinoma cell lines].[利用CpG岛微阵列筛选肝癌细胞系中的差异甲基化状态]
Zhonghua Zhong Liu Za Zhi. 2008 Dec;30(12):891-6.
5
Frequent hypermethylation of CpG islands and loss of expression of the 14-3-3 sigma gene in human hepatocellular carcinoma.人肝细胞癌中CpG岛频繁发生高甲基化及14-3-3西格玛基因表达缺失。
Oncogene. 2000 Nov 2;19(46):5298-302. doi: 10.1038/sj.onc.1203898.
6
Linking the epigenome to the genome: correlation of different features to DNA methylation of CpG islands.将表观基因组与基因组联系起来:不同特征与 CpG 岛 DNA 甲基化的相关性。
PLoS One. 2012;7(4):e35327. doi: 10.1371/journal.pone.0035327. Epub 2012 Apr 30.
7
Using Illumina Infinium HumanMethylation 450K BeadChip to explore genome‑wide DNA methylation profiles in a human hepatocellular carcinoma cell line.采用 Illumina Infinium HumanMethylation 450K BeadChip 技术探索人肝癌细胞系的全基因组 DNA 甲基化图谱。
Mol Med Rep. 2018 Nov;18(5):4446-4456. doi: 10.3892/mmr.2018.9441. Epub 2018 Sep 3.
8
5' CpG island methylation analysis identifies the MAGE-A1 and MAGE-A3 genes as potential markers of HCC.5' 甲基化分析确定MAGE - A1和MAGE - A3基因是肝癌的潜在标志物。
Clin Biochem. 2006 Mar;39(3):259-66. doi: 10.1016/j.clinbiochem.2006.01.014. Epub 2006 Mar 6.
9
Identification of cell type-specific methylation signals in bulk whole genome bisulfite sequencing data.批量全基因组亚硫酸氢盐测序数据中细胞类型特异性甲基化信号的鉴定。
Genome Biol. 2020 Jul 1;21(1):156. doi: 10.1186/s13059-020-02065-5.
10
Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles.从基因组学和表观基因组学特征对早期和晚期肝癌患者进行分类。
PLoS One. 2019 Sep 6;14(9):e0221476. doi: 10.1371/journal.pone.0221476. eCollection 2019.

引用本文的文献

1
MambaCpG: an accurate model for single-cell DNA methylation status imputation using mamba.曼巴CpG:一种使用曼巴进行单细胞DNA甲基化状态插补的精确模型。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf360.
2
UNet with Attention Networks: A Novel Deep Learning Approach for DNA Methylation Prediction in HeLa Cells.带有注意力网络的UNet:一种用于预测HeLa细胞中DNA甲基化的新型深度学习方法。
Genes (Basel). 2025 May 28;16(6):655. doi: 10.3390/genes16060655.
3
Genome-wide methylome modeling via generative AI incorporating long- and short-range interactions.

本文引用的文献

1
BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.BoostMe 能够准确预测多种人类组织全基因组亚硫酸氢盐测序中的 DNA 甲基化值。
BMC Genomics. 2018 May 23;19(1):390. doi: 10.1186/s12864-018-4766-y.
2
DNA methylation-based classification of central nervous system tumours.基于 DNA 甲基化的中枢神经系统肿瘤分类。
Nature. 2018 Mar 22;555(7697):469-474. doi: 10.1038/nature26000. Epub 2018 Mar 14.
3
The association between copy number aberration, DNA methylation and gene expression in tumor samples.
通过整合长程和短程相互作用的生成式人工智能进行全基因组甲基化组建模。
Sci Adv. 2025 Apr 11;11(15):eadt4152. doi: 10.1126/sciadv.adt4152.
4
Deep learning imputes DNA methylation states in single cells and enhances the detection of epigenetic alterations in schizophrenia.深度学习可推断单细胞中的DNA甲基化状态,并增强对精神分裂症表观遗传改变的检测。
Cell Genom. 2025 Mar 12;5(3):100774. doi: 10.1016/j.xgen.2025.100774. Epub 2025 Feb 21.
5
CpGFuse: a holistic approach for accurate identification of methylation states of DNA CpG sites.CpGFuse:一种用于准确识别DNA CpG位点甲基化状态的整体方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf063.
6
Advancing epigenetic profiling in cervical cancer: machine learning techniques for classifying DNA methylation patterns.宫颈癌中表观遗传谱分析的进展:用于分类DNA甲基化模式的机器学习技术
3 Biotech. 2024 Nov;14(11):264. doi: 10.1007/s13205-024-04107-2. Epub 2024 Oct 9.
7
GraphCpG: imputation of single-cell methylomes based on locus-aware neighboring subgraphs.GraphCpG:基于位置感知邻域子图的单细胞甲基化组数据插补。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad533.
8
iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data.iCpG-Pos:一种使用单细胞全基因组序列数据上的位置特征来识别 CpG 位点的准确计算方法。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad474.
9
Completing Single-Cell DNA Methylome Profiles Transfer Learning Together With KL-Divergence.结合KL散度的迁移学习完成单细胞DNA甲基化组图谱分析
Front Genet. 2022 Jul 22;13:910439. doi: 10.3389/fgene.2022.910439. eCollection 2022.
10
Computational Methods for Single-cell DNA Methylome Analysis.单细胞 DNA 甲基化组分析的计算方法。
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):48-66. doi: 10.1016/j.gpb.2022.05.007. Epub 2022 Jun 17.
肿瘤样本中拷贝数变异、DNA 甲基化和基因表达之间的关联。
Nucleic Acids Res. 2018 Apr 6;46(6):3009-3018. doi: 10.1093/nar/gky131.
4
A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.一种基于 DNA 序列信息和理化性质的新型 DNA 甲基化位点检测计算方法。
Int J Mol Sci. 2018 Feb 8;19(2):511. doi: 10.3390/ijms19020511.
5
Prediction of DNA methylation in the promoter of gene suppressor tumor.
Gene. 2018 Apr 20;651:166-173. doi: 10.1016/j.gene.2018.01.082. Epub 2018 Feb 9.
6
Genome-wide DNA methylation is predictive of outcome in juvenile myelomonocytic leukemia.全基因组 DNA 甲基化可预测少年粒单细胞白血病的预后。
Nat Commun. 2017 Dec 19;8(1):2127. doi: 10.1038/s41467-017-02178-9.
7
Mitochondrial DNA Methylation and Related Disease.线粒体 DNA 甲基化与相关疾病。
Adv Exp Med Biol. 2017;1038:117-132. doi: 10.1007/978-981-10-6674-0_9.
8
Statistical and integrative system-level analysis of DNA methylation data.统计和综合系统水平的 DNA 甲基化数据分析。
Nat Rev Genet. 2018 Mar;19(3):129-147. doi: 10.1038/nrg.2017.86. Epub 2017 Nov 13.
9
Simultaneous measurement of chromatin accessibility, DNA methylation, and nucleosome phasing in single cells.单细胞中染色质可及性、DNA甲基化和核小体相位的同步测量。
Elife. 2017 Jun 27;6:e23203. doi: 10.7554/eLife.23203.
10
DIRECTION: a machine learning framework for predicting and characterizing DNA methylation and hydroxymethylation in mammalian genomes.一种用于预测和描述哺乳动物基因组中 DNA 甲基化和羟甲基化的机器学习框架。
Bioinformatics. 2017 Oct 1;33(19):2986-2994. doi: 10.1093/bioinformatics/btx316.