• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种强大的用于配对样本判别分析和特征选择的工具,影响了对将肺组织重编程为腺癌所必需的基因的识别。

A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma.

机构信息

Bioinformatics Institute, A-STAR, Singapore.

出版信息

BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S24. doi: 10.1186/1471-2164-12-S3-S24.

DOI:10.1186/1471-2164-12-S3-S24
PMID:22369099
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3377915/
Abstract

BACKGROUND

Lung cancer is the leading cause of cancer deaths in the world. The most common type of lung cancer is lung adenocarcinoma (AC). The genetic mechanisms of the early stages and lung AC progression steps are poorly understood. There is currently no clinically applicable gene test for the early diagnosis and AC aggressiveness. Among the major reasons for the lack of reliable diagnostic biomarkers are the extraordinary heterogeneity of the cancer cells, complex and poorly understudied interactions of the AC cells with adjacent tissue and immune system, gene variation across patient cohorts, measurement variability, small sample sizes and sub-optimal analytical methods. We suggest that gene expression profiling of the primary tumours and adjacent tissues (PT-AT) handled with a rational statistical and bioinformatics strategy of biomarker prediction and validation could provide significant progress in the identification of clinical biomarkers of AC. To minimise sample-to-sample variability, repeated multivariate measurements in the same object (organ or tissue, e.g. PT-AT in lung) across patients should be designed, but prediction and validation on the genome scale with small sample size is a great methodical challenge.

RESULTS

To analyse PT-AT relationships efficiently in the statistical modelling, we propose an Extreme Class Discrimination (ECD) feature selection method that identifies a sub-set of the most discriminative variables (e.g. expressed genes). Our method consists of a paired Cross-normalization (CN) step followed by a modified sign Wilcoxon test with multivariate adjustment carried out for each variable. Using an Affymetrix U133A microarray paired dataset of 27 AC patients, we reviewed the global reprogramming of the transcriptome in human lung AC tissue versus normal lung tissue, which is associated with about 2,300 genes discriminating the tissues with 100% accuracy. Cluster analysis applied to these genes resulted in four distinct gene groups which we classified as associated with (i) up-regulated genes in the mitotic cell cycle lung AC, (ii) silenced/suppressed gene specific for normal lung tissue, (iii) cell communication and cell motility and (iv) the immune system features. The genes related to mutagenesis, specific lung cancers, early stage of AC development, tumour aggressiveness and metabolic pathway alterations and adaptations of cancer cells are strongly enriched in the AC PT-AT discriminative gene set. Two AC diagnostic biomarkers SPP1 and CENPA were successfully validated on RT-RCR tissue array. ECD method was systematically compared to several alternative methods and proved to be of better performance and as well as it was validated by comparison of the predicted gene set with literature meta-signature.

CONCLUSIONS

We developed a method that identifies and selects highly discriminative variables from high dimensional data spaces of potential biomarkers based on a statistical analysis of paired samples when the number of samples is small. This method provides superior selection in comparison to conventional methods and can be widely used in different applications. Our method revealed at least 23 hundreds patho-biologically essential genes associated with the global transcriptional reprogramming of human lung epithelium cells and lung AC aggressiveness. This gene set includes many previously published AC biomarkers reflecting inherent disease complexity and specifies the mechanisms of carcinogenesis in the lung AC. SPP1, CENPA and many other PT-AT discriminative genes could be considered as the prospective diagnostic and prognostic biomarkers of lung AC.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/0847a3953d30/1471-2164-12-S3-S24-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/9142dbc1e50e/1471-2164-12-S3-S24-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/ed7f77d25d14/1471-2164-12-S3-S24-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/2d9e8c7f441f/1471-2164-12-S3-S24-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/791b3e7d3d09/1471-2164-12-S3-S24-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/67469e3e450d/1471-2164-12-S3-S24-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/d8eddf7ebcb8/1471-2164-12-S3-S24-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/1cf04dafac23/1471-2164-12-S3-S24-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/daab005cf25c/1471-2164-12-S3-S24-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/69be8d607e44/1471-2164-12-S3-S24-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/0847a3953d30/1471-2164-12-S3-S24-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/9142dbc1e50e/1471-2164-12-S3-S24-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/ed7f77d25d14/1471-2164-12-S3-S24-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/2d9e8c7f441f/1471-2164-12-S3-S24-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/791b3e7d3d09/1471-2164-12-S3-S24-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/67469e3e450d/1471-2164-12-S3-S24-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/d8eddf7ebcb8/1471-2164-12-S3-S24-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/1cf04dafac23/1471-2164-12-S3-S24-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/daab005cf25c/1471-2164-12-S3-S24-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/69be8d607e44/1471-2164-12-S3-S24-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8654/3377915/0847a3953d30/1471-2164-12-S3-S24-10.jpg
摘要

背景

肺癌是世界上癌症死亡的主要原因。最常见的肺癌类型是肺腺癌(AC)。肺癌早期阶段和 AC 进展步骤的遗传机制还了解甚少。目前,没有用于早期诊断和 AC 侵袭性的临床适用的基因检测。缺乏可靠诊断生物标志物的主要原因之一是癌细胞的非凡异质性、AC 细胞与相邻组织和免疫系统之间复杂且研究不足的相互作用、患者队列之间的基因变异、测量变异性、样本量小以及分析方法不理想。我们建议,通过对原发性肿瘤和相邻组织(PT-AT)进行基因表达谱分析,并采用合理的统计和生物信息学策略进行生物标志物预测和验证,可能会在识别 AC 的临床生物标志物方面取得重大进展。为了最大程度地减少样本间的变异性,应针对患者的同一对象(器官或组织,例如肺中的 PT-AT)设计重复的多变量测量,但在基因组范围内进行小样本量的预测和验证是一项重大的方法学挑战。

结果

为了在统计建模中有效地分析 PT-AT 关系,我们提出了一种极端类别判别(ECD)特征选择方法,该方法可以识别出最具判别力的变量子集(例如表达基因)。我们的方法包括配对的交叉归一化(CN)步骤,然后对每个变量进行多元调整的改进符号 Wilcoxon 检验。使用 27 例 AC 患者的 Affymetrix U133A 微阵列配对数据集,我们回顾了人类肺 AC 组织与正常肺组织中转录组的全局重编程,该重编程与约 2300 个基因有关,这些基因可以 100%准确地区分组织。对这些基因进行聚类分析得到了四个不同的基因群,我们将其分类为与(i)肺 AC 有丝分裂细胞周期中上调的基因、(ii)正常肺组织中沉默/抑制的基因、(iii)细胞通讯和细胞运动以及(iv)免疫系统特征相关的基因。与致突变、特定肺癌、AC 早期发展、肿瘤侵袭性和代谢途径改变以及癌细胞适应相关的基因在 AC-PT-AT 判别基因集中强烈富集。两种 AC 诊断生物标志物 SPP1 和 CENPA 已成功通过 RT-RCR 组织阵列进行验证。ECD 方法与几种替代方法进行了系统比较,结果证明其性能更好,并且通过将预测基因集与文献荟萃签名进行比较进行了验证。

结论

我们开发了一种方法,该方法基于对小样本量的配对样本的统计分析,从潜在生物标志物的高维数据空间中识别和选择高度判别性变量。与传统方法相比,该方法具有更好的选择性能,并且可以广泛应用于不同的应用。我们的方法揭示了至少 2300 个与人类肺上皮细胞和肺 AC 侵袭性的全局转录重编程相关的病理生物学必需基因。该基因集包括许多先前发表的 AC 生物标志物,反映了疾病的固有复杂性,并指定了肺 AC 中的致癌机制。SPP1、CENPA 和许多其他 PT-AT 判别基因可被视为肺 AC 的潜在诊断和预后生物标志物。

相似文献

1
A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma.一种强大的用于配对样本判别分析和特征选择的工具,影响了对将肺组织重编程为腺癌所必需的基因的识别。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S24. doi: 10.1186/1471-2164-12-S3-S24.
2
A novel strategy of integrated microarray analysis identifies CENPA, CDK1 and CDC20 as a cluster of diagnostic biomarkers in lung adenocarcinoma.一种综合微阵列分析的新策略确定 CENPA、CDK1 和 CDC20 为肺腺癌的一组诊断生物标志物。
Cancer Lett. 2018 Jul 1;425:43-53. doi: 10.1016/j.canlet.2018.03.043. Epub 2018 Mar 31.
3
Development and Validation of an Individualized Immune Prognostic Signature in Early-Stage Nonsquamous Non-Small Cell Lung Cancer.早期非鳞状非小细胞肺癌个体化免疫预后标志物的建立与验证。
JAMA Oncol. 2017 Nov 1;3(11):1529-1537. doi: 10.1001/jamaoncol.2017.1609.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes.联合分析两个微阵列基因表达数据集以选择肺腺癌标记基因。
BMC Bioinformatics. 2004 Jun 24;5:81. doi: 10.1186/1471-2105-5-81.
6
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
7
Robust prognostic gene expression signatures in bladder cancer and lung adenocarcinoma depend on cell cycle related genes.膀胱癌和肺腺癌中强大的预后基因表达特征取决于细胞周期相关基因。
PLoS One. 2014 Jan 22;9(1):e85249. doi: 10.1371/journal.pone.0085249. eCollection 2014.
8
Pathway-based identification of a smoking associated 6-gene signature predictive of lung cancer risk and survival.基于通路的方法鉴定出与吸烟相关的 6 个基因签名,可预测肺癌风险和生存。
Artif Intell Med. 2012 Jun;55(2):97-105. doi: 10.1016/j.artmed.2012.01.001. Epub 2012 Feb 11.
9
Test on existence of histology subtype-specific prognostic signatures among early stage lung adenocarcinoma and squamous cell carcinoma patients using a Cox-model based filter.使用基于Cox模型的筛选方法对早期肺腺癌和肺鳞癌患者中组织学亚型特异性预后特征的存在情况进行检测。
Biol Direct. 2015 Apr 7;10:15. doi: 10.1186/s13062-015-0051-z.
10
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

引用本文的文献

1
GRANT Motif Regulates CENP-A Incorporation and Restricts RNA Polymerase II Accessibility at Centromere.GRANT 基序调节着着丝粒蛋白 A 的掺入,并限制着 RNA 聚合酶 II 在着丝粒处的可及性。
Genes (Basel). 2022 Sep 22;13(10):1697. doi: 10.3390/genes13101697.
2
Hub genes and key pathways of non-small lung cancer identified using bioinformatics.使用生物信息学鉴定非小细胞肺癌的枢纽基因和关键通路。
Oncol Lett. 2018 Aug;16(2):2344-2354. doi: 10.3892/ol.2018.8882. Epub 2018 Jun 4.
3
Multiple signatures of a disease in potential biomarker space: Getting the signatures consensus and identification of novel biomarkers.

本文引用的文献

1
A novel SNP analysis method to detect copy number alterations with an unbiased reference signal directly from tumor samples.一种新型 SNP 分析方法,可直接从肿瘤样本中利用无偏参考信号检测拷贝数改变。
BMC Med Genomics. 2011 Jan 26;4:14. doi: 10.1186/1755-8794-4-14.
2
Gene expression-based classification of non-small cell lung carcinomas and survival prediction.基于基因表达的非小细胞肺癌分类和生存预测。
PLoS One. 2010 Apr 22;5(4):e10312. doi: 10.1371/journal.pone.0010312.
3
SNP array analysis in hematologic malignancies: avoiding false discoveries.
潜在生物标志物空间中疾病的多种特征:达成特征共识并识别新型生物标志物。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-16-S7-S2. Epub 2015 Jun 11.
4
Analyzing large gene expression and methylation data profiles using StatBicRM: statistical biclustering-based rule mining.使用StatBicRM分析大型基因表达和甲基化数据概况:基于统计双聚类的规则挖掘
PLoS One. 2015 Apr 1;10(4):e0119448. doi: 10.1371/journal.pone.0119448. eCollection 2015.
5
How bioinformatics influences health informatics: usage of biomolecular sequences, expression profiles and automated microscopic image analyses for clinical needs and public health.生物信息学如何影响健康信息学:生物分子序列、表达谱和自动化显微镜图像分析在临床需求和公共卫生中的应用。
Health Inf Sci Syst. 2013 Jan 10;1:2. doi: 10.1186/2047-2501-1-2. eCollection 2013.
6
Lung cancer transcriptomes refined with laser capture microdissection.利用激光捕获显微切割技术优化的肺癌转录组
Am J Pathol. 2014 Nov;184(11):2868-84. doi: 10.1016/j.ajpath.2014.06.028. Epub 2014 Aug 14.
7
Secretomes are a potential source of molecular targets for cancer therapies and indicate that APOE is a candidate biomarker for lung adenocarcinoma metastasis.分泌组是癌症治疗分子靶点的潜在来源,并表明载脂蛋白E是肺腺癌转移的候选生物标志物。
Mol Biol Rep. 2014 Nov;41(11):7507-23. doi: 10.1007/s11033-014-3641-4. Epub 2014 Aug 7.
8
Analysis of global changes in gene expression induced by human polynucleotide phosphorylase (hPNPase(old-35)).人类多核苷酸磷酸酶(hPNPase(old-35))诱导的基因表达全局变化分析。
J Cell Physiol. 2014 Dec;229(12):1952-62. doi: 10.1002/jcp.24645.
9
InCoB celebrates its tenth anniversary as first joint conference with ISCB-Asia.InCoB 庆祝其成立十周年,这是它首次与 ISCB-Asia 联合举办会议。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2164-12-S3-S1.
血液恶性肿瘤中的 SNP 阵列分析:避免假发现。
Blood. 2010 May 27;115(21):4157-61. doi: 10.1182/blood-2009-11-203182. Epub 2010 Mar 19.
4
Targeting metabolic transformation for cancer therapy.针对癌症治疗的代谢重编程。
Nat Rev Cancer. 2010 Apr;10(4):267-77. doi: 10.1038/nrc2817. Epub 2010 Mar 19.
5
Gene expression-based prognostic signatures in lung cancer: ready for clinical use?基于基因表达的肺癌预后标志物:是否准备好用于临床?
J Natl Cancer Inst. 2010 Apr 7;102(7):464-74. doi: 10.1093/jnci/djq025. Epub 2010 Mar 16.
6
CENPA a genomic marker for centromere activity and human diseases.着丝粒活动的基因组标记 CENPA 与人类疾病。
Curr Genomics. 2009 Aug;10(5):326-35. doi: 10.2174/138920209788920985.
7
Frequent genetic differences between matched primary and metastatic breast cancer provide an approach to identification of biomarkers for disease progression.匹配的原发性和转移性乳腺癌之间频繁的遗传差异为识别疾病进展的生物标志物提供了一种方法。
Eur J Hum Genet. 2010 May;18(5):560-8. doi: 10.1038/ejhg.2009.230. Epub 2010 Jan 6.
8
Human cancers converge at the HIF-2alpha oncogenic axis.人类癌症集中在 HIF-2alpha 致癌轴上。
Proc Natl Acad Sci U S A. 2009 Dec 15;106(50):21306-11. doi: 10.1073/pnas.0906432106. Epub 2009 Dec 2.
9
Feature selection for predicting tumor metastases in microarray experiments using paired design.使用配对设计在微阵列实验中预测肿瘤转移的特征选择
Cancer Inform. 2007 Mar 20;3:213-8.
10
Epidermal growth factor receptor mutations in plasma DNA samples predict tumor response in Chinese patients with stages IIIB to IV non-small-cell lung cancer.血浆DNA样本中的表皮生长因子受体突变可预测中国IIIB至IV期非小细胞肺癌患者的肿瘤反应。
J Clin Oncol. 2009 Jun 1;27(16):2653-9. doi: 10.1200/JCO.2008.17.3930. Epub 2009 May 4.