• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用肿瘤及肿瘤旁组织学正常组织的“组学”数据预测肺癌亚型

On Predicting lung cancer subtypes using 'omic' data from tumor and tumor-adjacent histologically-normal tissue.

作者信息

Pineda Arturo López, Ogoe Henry Ato, Balasubramanian Jeya Balaji, Rangel Escareño Claudia, Visweswaran Shyam, Herman James Gordon, Gopalakrishnan Vanathi

机构信息

Department of Biomedical Informatics, University of Pittsburgh School of Medicine, 5607 Baum Boulevard, 15206, Pittsburgh, PA, USA.

Department of Computational Genomics, National Institute of Genomic Medicine, Periferico Sur No. 4809, Col. Arenal Tepepan, Tlalpan, 14610, Mexico City, Mexico.

出版信息

BMC Cancer. 2016 Mar 4;16:184. doi: 10.1186/s12885-016-2223-3.

DOI:10.1186/s12885-016-2223-3
PMID:26944944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4778315/
Abstract

BACKGROUND

Adenocarcinoma (ADC) and squamous cell carcinoma (SCC) are the most prevalent histological types among lung cancers. Distinguishing between these subtypes is critically important because they have different implications for prognosis and treatment. Normally, histopathological analyses are used to distinguish between the two, where the tissue samples are collected based on small endoscopic samples or needle aspirations. However, the lack of cell architecture in these small tissue samples hampers the process of distinguishing between the two subtypes. Molecular profiling can also be used to discriminate between the two lung cancer subtypes, on condition that the biopsy is composed of at least 50 % of tumor cells. However, for some cases, the tissue composition of a biopsy might be a mix of tumor and tumor-adjacent histologically normal tissue (TAHN). When this happens, a new biopsy is required, with associated cost, risks and discomfort to the patient. To avoid this problem, we hypothesize that a computational method can distinguish between lung cancer subtypes given tumor and TAHN tissue.

METHODS

Using publicly available datasets for gene expression and DNA methylation, we applied four classification tasks, depending on the possible combinations of tumor and TAHN tissue. First, we used a feature selector (ReliefF/Limma) to select relevant variables, which were then used to build a simple naïve Bayes classification model. Then, we evaluated the classification performance of our models by measuring the area under the receiver operating characteristic curve (AUC). Finally, we analyzed the relevance of the selected genes using hierarchical clustering and IPA® software for gene functional analysis.

RESULTS

All Bayesian models achieved high classification performance (AUC > 0.94), which were confirmed by hierarchical cluster analysis. From the genes selected, 25 (93 %) were found to be related to cancer (19 were associated with ADC or SCC), confirming the biological relevance of our method.

CONCLUSIONS

The results from this study confirm that computational methods using tumor and TAHN tissue can serve as a prognostic tool for lung cancer subtype classification. Our study complements results from other studies where TAHN tissue has been used as prognostic tool for prostate cancer. The clinical implications of this finding could greatly benefit lung cancer patients.

摘要

背景

腺癌(ADC)和鳞状细胞癌(SCC)是肺癌中最常见的组织学类型。区分这些亚型至关重要,因为它们对预后和治疗有不同的影响。通常,组织病理学分析用于区分两者,组织样本基于小的内镜样本或针吸活检采集。然而,这些小组织样本中缺乏细胞结构阻碍了区分这两种亚型的过程。分子谱分析也可用于区分两种肺癌亚型,前提是活检样本至少由50%的肿瘤细胞组成。然而,对于某些病例,活检的组织组成可能是肿瘤组织与肿瘤相邻的组织学正常组织(TAHN)的混合。当这种情况发生时,需要重新进行活检,这会给患者带来相关费用、风险和不适。为避免这个问题,我们假设一种计算方法可以在给定肿瘤组织和TAHN组织的情况下区分肺癌亚型。

方法

利用公开可用的基因表达和DNA甲基化数据集,根据肿瘤组织和TAHN组织的可能组合应用了四项分类任务。首先,我们使用一个特征选择器(ReliefF/Limma)来选择相关变量,然后用这些变量构建一个简单的朴素贝叶斯分类模型。然后,我们通过测量受试者工作特征曲线(AUC)下的面积来评估我们模型的分类性能。最后,我们使用层次聚类和IPA®软件进行基因功能分析,分析所选基因的相关性。

结果

所有贝叶斯模型均取得了较高的分类性能(AUC>0.94),这通过层次聚类分析得到了证实。从所选基因中发现,25个(93%)与癌症相关(19个与ADC或SCC相关),证实了我们方法的生物学相关性。

结论

本研究结果证实,使用肿瘤组织和TAHN组织的计算方法可作为肺癌亚型分类的预后工具。我们的研究补充了其他将TAHN组织用作前列腺癌预后工具的研究结果。这一发现的临床意义可能会使肺癌患者受益匪浅。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/69ef0a1183d4/12885_2016_2223_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/65c966816557/12885_2016_2223_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/001041c20b95/12885_2016_2223_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/69ef0a1183d4/12885_2016_2223_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/65c966816557/12885_2016_2223_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/001041c20b95/12885_2016_2223_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be91/4778315/69ef0a1183d4/12885_2016_2223_Fig3_HTML.jpg

相似文献

1
On Predicting lung cancer subtypes using 'omic' data from tumor and tumor-adjacent histologically-normal tissue.利用肿瘤及肿瘤旁组织学正常组织的“组学”数据预测肺癌亚型
BMC Cancer. 2016 Mar 4;16:184. doi: 10.1186/s12885-016-2223-3.
2
Proliferation genes in lung development associated with the prognosis of lung adenocarcinoma but not squamous cell carcinoma.肺发育中的增殖基因与肺腺癌的预后相关,但与肺鳞状细胞癌无关。
Cancer Sci. 2018 Feb;109(2):308-316. doi: 10.1111/cas.13456. Epub 2017 Dec 27.
3
Analysis of gene expression profiles of lung cancer subtypes with machine learning algorithms.基于机器学习算法的肺癌亚型基因表达谱分析。
Biochim Biophys Acta Mol Basis Dis. 2020 Aug 1;1866(8):165822. doi: 10.1016/j.bbadis.2020.165822. Epub 2020 Apr 28.
4
Identification of microRNAs differentially expressed between lung squamous cell carcinoma and lung adenocarcinoma.鉴定肺鳞癌和肺腺癌之间差异表达的 microRNAs。
Mol Med Rep. 2013 Aug;8(2):456-62. doi: 10.3892/mmr.2013.1517. Epub 2013 Jun 11.
5
A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma.一种强大的用于配对样本判别分析和特征选择的工具,影响了对将肺组织重编程为腺癌所必需的基因的识别。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S24. doi: 10.1186/1471-2164-12-S3-S24.
6
miRNAs expression profiling to distinguish lung squamous-cell carcinoma from adenocarcinoma subtypes.miRNAs 表达谱区分肺鳞癌与腺癌亚型。
J Cancer Res Clin Oncol. 2012 Oct;138(10):1641-50. doi: 10.1007/s00432-012-1240-0. Epub 2012 May 22.
7
Analysis of lncRNA expression profiles in non-small cell lung cancers (NSCLC) and their clinical subtypes.非小细胞肺癌(NSCLC)及其临床亚型中lncRNA表达谱分析。
Lung Cancer. 2014 Aug;85(2):110-5. doi: 10.1016/j.lungcan.2014.05.011. Epub 2014 May 22.
8
In silico comparative genomic analysis of two non-small cell lung cancer subtypes and their potentials for cancer classification.两种非小细胞肺癌亚型的计算机模拟比较基因组分析及其癌症分类潜力
Cancer Genomics Proteomics. 2014 Nov-Dec;11(6):303-10.
9
Identification of transcriptional subtypes in lung adenocarcinoma and squamous cell carcinoma through integrative analysis of microarray and RNA sequencing data.通过微阵列和 RNA 测序数据的综合分析鉴定肺腺癌和鳞状细胞癌中的转录亚型。
Sci Rep. 2021 Apr 22;11(1):8709. doi: 10.1038/s41598-021-88209-4.
10
Cancer Stemness-Based Prognostic Immune-Related Gene Signatures in Lung Adenocarcinoma and Lung Squamous Cell Carcinoma.基于癌症干性的肺腺癌和肺鳞癌预后免疫相关基因特征。
Front Endocrinol (Lausanne). 2021 Oct 21;12:755805. doi: 10.3389/fendo.2021.755805. eCollection 2021.

引用本文的文献

1
Computational identification of key genetic drivers in COPD: A first step towards uncovering candidate biomarkers in smokers.慢性阻塞性肺疾病关键基因驱动因素的计算识别:揭示吸烟者候选生物标志物的第一步。
Biochem Biophys Rep. 2025 Aug 1;43:102193. doi: 10.1016/j.bbrep.2025.102193. eCollection 2025 Sep.
2
Machine learning approaches for EGFR mutation status prediction in NSCLC: an updated systematic review.用于非小细胞肺癌中表皮生长因子受体突变状态预测的机器学习方法:一项更新的系统评价
Front Oncol. 2025 Jul 10;15:1576461. doi: 10.3389/fonc.2025.1576461. eCollection 2025.
3
Classification of lung cancer severity using gene expression data based on deep learning.

本文引用的文献

1
Staging of lung cancer: the role of noninvasive, minimally invasive and invasive techniques.肺癌分期:非侵入性、微创性和侵入性技术的作用。
Eur Respir J. 2015 Aug;46(2):521-31. doi: 10.1183/09031936.00126714. Epub 2015 May 14.
2
Genomic profiling toward precision medicine in non-small cell lung cancer: getting beyond EGFR.非小细胞肺癌精准医学的基因组分析:超越表皮生长因子受体
Pharmgenomics Pers Med. 2015 Feb 20;8:63-79. doi: 10.2147/PGPM.S52845. eCollection 2015.
3
ATDC induces an invasive switch in KRAS-induced pancreatic tumorigenesis.
基于深度学习的基因表达数据对肺癌严重程度的分类
BMC Med Inform Decis Mak. 2025 May 14;25(1):184. doi: 10.1186/s12911-025-03011-w.
4
Identification of shared pathogenic signatures of multiple sclerosis and chronic obstructive pulmonary disease: an integrated transcriptomic analysis of blood specimens.多发性硬化症和慢性阻塞性肺疾病共同致病特征的鉴定:血液样本的综合转录组分析
Mol Genet Genomics. 2024 Dec 27;300(1):8. doi: 10.1007/s00438-024-02215-5.
5
HE2Gene: image-to-RNA translation via multi-task learning for spatial transcriptomics data.HE2 基因:通过多任务学习进行空间转录组学数据的图像到 RNA 的翻译。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae343.
6
Endometrial Cancer-Adjacent Tissues Express Higher Levels of Cancer-Promoting Genes than the Matched Tumors.子宫内膜癌旁组织表达的促癌基因水平高于相应肿瘤。
Genes (Basel). 2022 Sep 8;13(9):1611. doi: 10.3390/genes13091611.
7
Genome-Wide Plasma Cell-Free DNA Methylation Profiling Identifies Potential Biomarkers for Lung Cancer.全基因组血浆游离 DNA 甲基化分析鉴定肺癌潜在生物标志物
Dis Markers. 2019 Feb 5;2019:4108474. doi: 10.1155/2019/4108474. eCollection 2019.
8
Application of omics in predicting anti-TNF efficacy in rheumatoid arthritis.组学在预测类风湿关节炎抗 TNF 疗效中的应用。
Clin Rheumatol. 2018 Jan;37(1):13-23. doi: 10.1007/s10067-017-3639-0. Epub 2017 Jun 10.
ATDC在KRAS诱导的胰腺肿瘤发生过程中引发侵袭性转变。
Genes Dev. 2015 Jan 15;29(2):171-83. doi: 10.1101/gad.253591.114.
4
Improving CT-guided transthoracic biopsy of mediastinal lesions by diffusion-weighted magnetic resonance imaging.通过扩散加权磁共振成像改善CT引导下纵隔病变的经胸活检
Clinics (Sao Paulo). 2014 Nov;69(11):787-91. doi: 10.6061/clinics/2014(11)13.
5
Classification of lung cancer using ensemble-based feature selection and machine learning methods.基于集成特征选择和机器学习方法的肺癌分类
Mol Biosyst. 2015 Mar;11(3):791-800. doi: 10.1039/c4mb00659c. Epub 2014 Dec 16.
6
In silico comparative genomic analysis of two non-small cell lung cancer subtypes and their potentials for cancer classification.两种非小细胞肺癌亚型的计算机模拟比较基因组分析及其癌症分类潜力
Cancer Genomics Proteomics. 2014 Nov-Dec;11(6):303-10.
7
COSMIC: exploring the world's knowledge of somatic mutations in human cancer.COSMIC:探索全球关于人类癌症体细胞突变的知识。
Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11. doi: 10.1093/nar/gku1075. Epub 2014 Oct 29.
8
Stratified randomization controls better for batch effects in 450K methylation analysis: a cautionary tale.在450K甲基化分析中,分层随机化能更好地控制批次效应:一个警示故事。
Front Genet. 2014 Oct 13;5:354. doi: 10.3389/fgene.2014.00354. eCollection 2014.
9
Visualization-aided classification ensembles discriminate lung adenocarcinoma and squamous cell carcinoma samples using their gene expression profiles.可视化辅助分类集成通过基因表达谱区分肺腺癌和鳞状细胞癌样本。
PLoS One. 2014 Oct 15;9(10):e110052. doi: 10.1371/journal.pone.0110052. eCollection 2014.
10
Suitability of small bronchoscopic tumour specimens for lung cancer genotyping.小支气管镜肿瘤标本用于肺癌基因分型的适用性
Respiration. 2014;88(5):371-7. doi: 10.1159/000366136. Epub 2014 Oct 9.