• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从基因组学和表观基因组学特征对早期和晚期肝癌患者进行分类。

Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles.

机构信息

Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India.

Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.

出版信息

PLoS One. 2019 Sep 6;14(9):e0221476. doi: 10.1371/journal.pone.0221476. eCollection 2019.

DOI:10.1371/journal.pone.0221476
PMID:31490960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6730898/
Abstract

BACKGROUND

Liver Hepatocellular Carcinoma (LIHC) is one of the major cancers worldwide, responsible for millions of premature deaths every year. Prediction of clinical staging is vital to implement optimal therapeutic strategy and prognostic prediction in cancer patients. However, to date, no method has been developed for predicting the stage of LIHC from the genomic profile of samples.

METHODS

The Cancer Genome Atlas (TCGA) dataset of 173 early stage (stage-I), 177 late stage (stage-II, Stage-III and stage-IV) and 50 adjacent normal tissue samples for 60,483 RNA transcripts and 485,577 methylation CpG sites, was extensively analyzed to identify the key transcriptomic expression and methylation-based features using different feature selection techniques. Further, different classification models were developed based on selected key features to categorize different classes of samples implementing different machine learning algorithms.

RESULTS

In the current study, in silico models have been developed for classifying LIHC patients in the early vs. late stage and cancerous vs. normal samples using RNA expression and DNA methylation data. TCGA datasets were extensively analyzed to identify differentially expressed RNA transcripts and methylated CpG sites that can discriminate early vs. late stages and cancer vs. normal samples of LIHC with high precision. Naive Bayes model developed using 51 features that combine 21 CpG methylation sites and 30 RNA transcripts achieved maximum MCC (Matthew's correlation coefficient) 0.58 with an accuracy of 78.87% on the validation dataset in discrimination of early and late stage. Additionally, the prediction models developed based on 5 RNA transcripts and 5 CpG sites classify LIHC and normal samples with an accuracy of 96-98% and AUC (Area Under the Receiver Operating Characteristic curve) 0.99. Besides, multiclass models also developed for classifying samples in the normal, early and late stage of cancer and achieved an accuracy of 76.54% and AUC of 0.86.

CONCLUSION

Our study reveals stage prediction of LIHC samples with high accuracy based on the genomics and epigenomics profiling is a challenging task in comparison to the classification of cancerous and normal samples. Comprehensive analysis, differentially expressed RNA transcripts, methylated CpG sites in LIHC samples and prediction models are available from CancerLSP (http://webs.iiitd.edu.in/raghava/cancerlsp/).

摘要

背景

肝癌(LIHC)是全球主要癌症之一,每年导致数百万人过早死亡。预测临床分期对于癌症患者实施最佳治疗策略和预后预测至关重要。然而,迄今为止,尚无方法可从样本的基因组谱中预测 LIHC 的分期。

方法

对 173 例早期(I 期)、177 例晚期(II 期、III 期和 IV 期)和 50 例相邻正常组织样本的癌症基因组图谱(TCGA)数据集进行了广泛分析,以使用不同的特征选择技术识别关键转录组表达和基于甲基化的特征。此外,基于选定的关键特征开发了不同的分类模型,以使用不同的机器学习算法对不同类别的样本进行分类。

结果

在本研究中,使用 RNA 表达和 DNA 甲基化数据,为早期与晚期 LIHC 患者以及癌症与正常样本的分类开发了计算机模型。对 TCGA 数据集进行了广泛分析,以识别可区分 LIHC 早期与晚期以及癌症与正常样本的差异表达 RNA 转录物和甲基化 CpG 位点,具有高精度。使用结合了 21 个 CpG 甲基化位点和 30 个 RNA 转录物的 51 个特征开发的朴素贝叶斯模型在验证数据集中实现了最大 MCC(马修相关系数)0.58,准确率为 78.87%。此外,基于 5 个 RNA 转录物和 5 个 CpG 位点开发的预测模型可将 LIHC 和正常样本分类,准确率为 96-98%,AUC(接收器工作特征曲线下的面积)为 0.99。此外,还开发了用于分类正常、早期和晚期癌症样本的多类模型,准确率为 76.54%,AUC 为 0.86。

结论

与癌症和正常样本的分类相比,基于基因组学和表观基因组学谱对 LIHC 样本进行高精度分期预测是一项具有挑战性的任务。癌症 LSP(http://webs.iiitd.edu.in/raghava/cancerlsp/)提供了全面的分析、LIHC 样本中差异表达的 RNA 转录物、甲基化的 CpG 位点和预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/d9c52b8f1a16/pone.0221476.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/c620d5439fe3/pone.0221476.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/e09ce6476743/pone.0221476.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/5dcdb68d74a3/pone.0221476.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/d9c52b8f1a16/pone.0221476.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/c620d5439fe3/pone.0221476.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/e09ce6476743/pone.0221476.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/5dcdb68d74a3/pone.0221476.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a504/6730898/d9c52b8f1a16/pone.0221476.g004.jpg

相似文献

1
Classification of early and late stage liver hepatocellular carcinoma patients from their genomics and epigenomics profiles.从基因组学和表观基因组学特征对早期和晚期肝癌患者进行分类。
PLoS One. 2019 Sep 6;14(9):e0221476. doi: 10.1371/journal.pone.0221476. eCollection 2019.
2
Expression based biomarkers and models to classify early and late-stage samples of Papillary Thyroid Carcinoma.基于表达谱的生物标志物和模型,用于分类甲状腺乳头状癌的早期和晚期样本。
PLoS One. 2020 Apr 23;15(4):e0231629. doi: 10.1371/journal.pone.0231629. eCollection 2020.
3
Using epigenomics data to predict gene expression in lung cancer.利用表观基因组学数据预测肺癌中的基因表达。
BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S10. doi: 10.1186/1471-2105-16-S5-S10. Epub 2015 Mar 18.
4
Using Illumina Infinium HumanMethylation 450K BeadChip to explore genome‑wide DNA methylation profiles in a human hepatocellular carcinoma cell line.采用 Illumina Infinium HumanMethylation 450K BeadChip 技术探索人肝癌细胞系的全基因组 DNA 甲基化图谱。
Mol Med Rep. 2018 Nov;18(5):4446-4456. doi: 10.3892/mmr.2018.9441. Epub 2018 Sep 3.
5
Integrative analysis of DNA methylation and gene expression reveals hepatocellular carcinoma-specific diagnostic biomarkers.整合 DNA 甲基化和基因表达分析揭示肝细胞癌特异性诊断生物标志物。
Genome Med. 2018 May 30;10(1):42. doi: 10.1186/s13073-018-0548-z.
6
CpG Methylation Signature Predicts Recurrence in Early-Stage Hepatocellular Carcinoma: Results From a Multicenter Study.CpG 甲基化特征可预测早期肝细胞癌的复发:来自多中心研究的结果。
J Clin Oncol. 2017 Mar;35(7):734-742. doi: 10.1200/JCO.2016.68.2153. Epub 2017 Jan 9.
7
Four differentially methylated gene pairs to predict the prognosis for early stage hepatocellular carcinoma patients.四个差异甲基化基因对预测早期肝细胞癌患者的预后。
J Cell Physiol. 2018 Sep;233(9):6583-6590. doi: 10.1002/jcp.26256. Epub 2018 Feb 28.
8
Deciphering dysregulation and CpG methylation in hepatocellular carcinoma using multi-omics and machine learning.利用多组学和机器学习破译肝细胞癌中的失调和 CpG 甲基化。
Epigenomics. 2024;16(13):961-983. doi: 10.1080/17501911.2024.2374702. Epub 2024 Jul 29.
9
Integrative analysis of DNA methylation and gene expression identify a six epigenetic driver signature for predicting prognosis in hepatocellular carcinoma.整合 DNA 甲基化和基因表达分析,鉴定出用于预测肝细胞癌预后的六个表观遗传驱动特征。
J Cell Physiol. 2019 Jul;234(7):11942-11950. doi: 10.1002/jcp.27882. Epub 2018 Dec 7.
10
A novel microRNA signature predicts survival in liver hepatocellular carcinoma after hepatectomy.一种新型 microRNA 标志物可预测肝癌切除术后的生存情况。
Sci Rep. 2018 May 21;8(1):7933. doi: 10.1038/s41598-018-26374-9.

引用本文的文献

1
Role of the USP family in autophagy regulation and cancer progression.USP家族在自噬调节和癌症进展中的作用。
Apoptosis. 2025 Jun;30(5-6):1133-1151. doi: 10.1007/s10495-025-02095-z. Epub 2025 Mar 5.
2
Comprehensive review and updated analysis of DNA methylation in hepatocellular carcinoma: From basic research to clinical application.肝细胞癌中 DNA 甲基化的综合回顾和最新分析:从基础研究到临床应用。
Clin Transl Med. 2024 Nov;14(11):e70066. doi: 10.1002/ctm2.70066.
3
Characterization of tumor suppressors and oncogenes evaluated from TCGA cancers.

本文引用的文献

1
Cancer statistics, 2019.癌症统计数据,2019 年。
CA Cancer J Clin. 2019 Jan;69(1):7-34. doi: 10.3322/caac.21551. Epub 2019 Jan 8.
2
Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.全球癌症统计数据 2018:GLOBOCAN 对全球 185 个国家/地区 36 种癌症的发病率和死亡率的估计。
CA Cancer J Clin. 2018 Nov;68(6):394-424. doi: 10.3322/caac.21492. Epub 2018 Sep 12.
3
Analysis of the Cancer Genome Atlas Data Reveals Novel Putative ncRNAs Targets in Hepatocellular Carcinoma.
从TCGA癌症数据评估肿瘤抑制基因和癌基因的特征。
Am J Clin Exp Immunol. 2024 Aug 25;13(4):187-194. doi: 10.62347/XMZW6604. eCollection 2024.
4
Deciphering dysregulation and CpG methylation in hepatocellular carcinoma using multi-omics and machine learning.利用多组学和机器学习破译肝细胞癌中的失调和 CpG 甲基化。
Epigenomics. 2024;16(13):961-983. doi: 10.1080/17501911.2024.2374702. Epub 2024 Jul 29.
5
Uncovering myocardial infarction genetic signatures using GWAS exploration in Saudi and European cohorts.利用 GWAS 探索在沙特和欧洲队列中揭示心肌梗死的遗传特征。
Sci Rep. 2023 Dec 10;13(1):21866. doi: 10.1038/s41598-023-49105-1.
6
The use of SP/Neurokinin-1 as a Therapeutic Target in Colon and Rectal Cancer.SP/神经激肽-1 在结直肠癌中的治疗靶点作用。
Curr Med Chem. 2024;31(39):6487-6509. doi: 10.2174/0109298673261625230924114406.
7
In silico transcriptional analysis of asymptomatic and severe COVID-19 patients reveals the susceptibility of severe patients to other comorbidities and non-viral pathological conditions.无症状和重症新冠肺炎患者的计算机转录分析揭示了重症患者对其他合并症和非病毒病理状况的易感性。
Hum Gene (Amst). 2023 Feb;35:201135. doi: 10.1016/j.humgen.2022.201135. Epub 2022 Dec 16.
8
The Role of Artificial Intelligence in the Detection and Implementation of Biomarkers for Hepatocellular Carcinoma: Outlook and Opportunities.人工智能在肝细胞癌生物标志物检测与应用中的作用:展望与机遇
Cancers (Basel). 2023 May 26;15(11):2928. doi: 10.3390/cancers15112928.
9
Key therapeutic targets implicated at the early stage of hepatocellular carcinoma identified through machine-learning approaches.通过机器学习方法鉴定出肝癌早期涉及的关键治疗靶点。
Sci Rep. 2023 Mar 7;13(1):3840. doi: 10.1038/s41598-023-30720-x.
10
Artificial intelligence in the detection, characterisation and prediction of hepatocellular carcinoma: a narrative review.人工智能在肝细胞癌检测、特征描述及预测中的应用:一项叙述性综述
Transl Gastroenterol Hepatol. 2022 Oct 25;7:41. doi: 10.21037/tgh-20-242. eCollection 2022.
分析癌症基因组图谱数据揭示了肝癌中新型潜在 ncRNA 靶点。
Biomed Res Int. 2018 Jun 26;2018:2864120. doi: 10.1155/2018/2864120. eCollection 2018.
4
Computer-aided prediction of antigen presenting cell modulators for designing peptide-based vaccine adjuvants.基于计算机辅助预测抗原呈递细胞调节剂设计基于肽的疫苗佐剂
J Transl Med. 2018 Jul 3;16(1):181. doi: 10.1186/s12967-018-1560-1.
5
RAMP3 is a prognostic indicator of liver cancer and might reduce the adverse effect of TP53 mutation on survival.RAMP3 是肝癌的预后指标,可能降低 TP53 突变对生存的不良影响。
Future Oncol. 2018 Oct;14(25):2615-2625. doi: 10.2217/fon-2018-0296. Epub 2018 Jun 8.
6
Approach for Prediction of Antifungal Peptides.抗真菌肽的预测方法。
Front Microbiol. 2018 Feb 26;9:323. doi: 10.3389/fmicb.2018.00323. eCollection 2018.
7
CDCA5 regulates proliferation in hepatocellular carcinoma and has potential as a negative prognostic marker.CDCA5调节肝细胞癌的增殖,具有作为阴性预后标志物的潜力。
Onco Targets Ther. 2018 Feb 20;11:891-901. doi: 10.2147/OTT.S154754. eCollection 2018.
8
Intragenic DNA methylation of PITX1 and the adjacent long non-coding RNA C5orf66-AS1 are prognostic biomarkers in patients with head and neck squamous cell carcinomas.PITX1基因内DNA甲基化及相邻的长链非编码RNA C5orf66-AS1是头颈部鳞状细胞癌患者的预后生物标志物。
PLoS One. 2018 Feb 9;13(2):e0192742. doi: 10.1371/journal.pone.0192742. eCollection 2018.
9
Methionine adenosyltransferases in liver health and diseases.肝脏健康与疾病中的甲硫氨酸腺苷转移酶
Liver Res. 2017 Sep;1(2):103-111. doi: 10.1016/j.livres.2017.07.002.
10
Long Noncoding RNAs as a Key Player in Hepatocellular Carcinoma.长链非编码RNA在肝细胞癌中扮演关键角色。
Biomark Cancer. 2017 Nov 2;9:1179299X17737301. doi: 10.1177/1179299X17737301. eCollection 2017.