• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CrossNorm:一种用于癌症微阵列数据的新型标准化策略。

CrossNorm: a novel normalization strategy for microarray data in cancers.

作者信息

Cheng Lixin, Lo Leung-Yau, Tang Nelson L S, Wang Dong, Leung Kwong-Sak

机构信息

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

Department of Chemical Pathology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong.

出版信息

Sci Rep. 2016 Jan 6;6:18898. doi: 10.1038/srep18898.

DOI:10.1038/srep18898
PMID:26732145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4702063/
Abstract

Normalization is essential to get rid of biases in microarray data for their accurate analysis. Existing normalization methods for microarray gene expression data commonly assume a similar global expression pattern among samples being studied. However, scenarios of global shifts in gene expressions are dominant in cancers, making the assumption invalid. To alleviate the problem, here we propose and develop a novel normalization strategy, Cross Normalization (CrossNorm), for microarray data with unbalanced transcript levels among samples. Conventional procedures, such as RMA and LOESS, arbitrarily flatten the difference between case and control groups leading to biased gene expression estimates. Noticeably, applying these methods under the strategy of CrossNorm, which makes use of the overall statistics of the original signals, the results showed significantly improved robustness and accuracy in estimating transcript level dynamics for a series of publicly available datasets, including titration experiment, simulated data, spike-in data and several real-life microarray datasets across various types of cancers. The results have important implications for the past and the future cancer studies based on microarray samples with non-negligible difference. Moreover, the strategy can also be applied to other sorts of high-throughput data as long as the experiments have global expression variations between conditions.

摘要

归一化对于消除微阵列数据中的偏差以进行准确分析至关重要。现有的微阵列基因表达数据归一化方法通常假定所研究的样本之间存在相似的全局表达模式。然而,基因表达全局变化的情况在癌症中占主导地位,这使得该假设无效。为了缓解这个问题,我们在此提出并开发了一种新的归一化策略——交叉归一化(CrossNorm),用于处理样本间转录本水平不平衡的微阵列数据。传统方法,如RMA和LOESS,会任意抹平病例组和对照组之间的差异,导致基因表达估计出现偏差。值得注意的是,在利用原始信号的总体统计信息的交叉归一化策略下应用这些方法时,对于一系列公开可用的数据集,包括滴定实验、模拟数据、掺入数据以及跨越各种癌症类型的几个实际微阵列数据集,结果显示在估计转录本水平动态方面具有显著提高的稳健性和准确性。这些结果对于过去和未来基于差异不可忽略的微阵列样本的癌症研究具有重要意义。此外,只要实验在不同条件之间存在全局表达变化,该策略也可应用于其他类型的高通量数据。

相似文献

1
CrossNorm: a novel normalization strategy for microarray data in cancers.CrossNorm:一种用于癌症微阵列数据的新型标准化策略。
Sci Rep. 2016 Jan 6;6:18898. doi: 10.1038/srep18898.
2
Deciphering global signal features of high-throughput array data from cancers.解析来自癌症的高通量阵列数据的全局信号特征。
Mol Biosyst. 2014 Jun;10(6):1549-56. doi: 10.1039/c4mb00084f. Epub 2014 Apr 3.
3
Global rank-invariant set normalization (GRSN) to reduce systematic distortions in microarray data.用于减少微阵列数据中系统偏差的全局秩不变集归一化(GRSN)。
BMC Bioinformatics. 2008 Dec 4;9:520. doi: 10.1186/1471-2105-9-520.
4
A novel normalization method for effective removal of systematic variation in microarray data.一种有效去除微阵列数据中系统变异的新型标准化方法。
Nucleic Acids Res. 2006 Mar 9;34(5):e38. doi: 10.1093/nar/gkl024. Print 2006.
5
The effects of normalization on the correlation structure of microarray data.标准化对微阵列数据相关结构的影响。
BMC Bioinformatics. 2005 May 16;6:120. doi: 10.1186/1471-2105-6-120.
6
Using generalized procrustes analysis (GPA) for normalization of cDNA microarray data.使用广义普罗克汝斯分析(GPA)对cDNA微阵列数据进行标准化处理。
BMC Bioinformatics. 2008 Jan 16;9:25. doi: 10.1186/1471-2105-9-25.
7
ICN: a normalization method for gene expression data considering the over-expression of informative genes.ICN:一种考虑信息基因过表达的基因表达数据归一化方法。
Mol Biosyst. 2016 Oct 20;12(10):3057-66. doi: 10.1039/c6mb00386a. Epub 2016 Jul 25.
8
A new outlier removal approach for cDNA microarray normalization.一种用于cDNA微阵列标准化的新离群值去除方法。
Biotechniques. 2009 Aug;47(2):691-2, 694-700. doi: 10.2144/000113195.
9
Optimized LOWESS normalization parameter selection for DNA microarray data.用于DNA微阵列数据的优化LOWESS归一化参数选择
BMC Bioinformatics. 2004 Dec 9;5:194. doi: 10.1186/1471-2105-5-194.
10
Optimal consistency in microRNA expression analysis using reference-gene-based normalization.使用基于参考基因标准化的方法在微小RNA表达分析中实现最佳一致性
Mol Biosyst. 2015 May;11(5):1235-40. doi: 10.1039/c4mb00711e.

引用本文的文献

1
Adaptive individualized gene pair signatures distinguishing melanoma and predicting response to immune checkpoint blockade.区分黑色素瘤并预测免疫检查点阻断反应的适应性个体化基因对特征
iScience. 2025 Aug 8;28(9):113329. doi: 10.1016/j.isci.2025.113329. eCollection 2025 Sep 19.
2
Normalization and Selecting Non-Differentially Expressed Genes Improve Machine Learning Modelling of Cross-Platform Transcriptomic Data.归一化和选择非差异表达基因可改善跨平台转录组数据的机器学习建模
Trans Artif Intell. 2025;1(1). doi: 10.53941/tai.2025.100005. Epub 2025 May 25.
3
Normalization and selecting non-differentially expressed genes improve machine learning modelling of cross-platform transcriptomic data.

本文引用的文献

1
Identification of real microRNA precursors with a pseudo structure status composition approach.采用伪结构状态组成方法鉴定真实的微小RNA前体。
PLoS One. 2015 Mar 30;10(3):e0121501. doi: 10.1371/journal.pone.0121501. eCollection 2015.
2
Individual-level analysis of differential expression of genes and pathways for personalized medicine.用于个性化医疗的基因和通路差异表达的个体水平分析。
Bioinformatics. 2015 Jan 1;31(1):62-8. doi: 10.1093/bioinformatics/btu522. Epub 2014 Aug 26.
3
Deciphering global signal features of high-throughput array data from cancers.
标准化和选择非差异表达基因可改善跨平台转录组数据的机器学习建模。
ArXiv. 2025 Jan 24:arXiv:2501.14248v1.
4
Less is more: relative rank is more informative than absolute abundance for compositional NGS data.少即是多:对于组成性NGS数据,相对排名比绝对丰度更具信息量。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae045.
5
Pairwise analysis of gene expression for oral squamous cell carcinoma via a large-scale transcriptome integration.通过大规模转录组整合对口腔鳞状细胞癌的基因表达进行成对分析。
J Cell Mol Med. 2024 Oct;28(20):e70153. doi: 10.1111/jcmm.70153.
6
Deep learning model to discriminate diverse infection types based on pairwise analysis of host gene expression.基于宿主基因表达成对分析的深度学习模型,用于区分不同感染类型。
iScience. 2024 May 7;27(6):109908. doi: 10.1016/j.isci.2024.109908. eCollection 2024 Jun 21.
7
Machine learning-based investigation of regulated cell death for predicting prognosis and immunotherapy response in glioma patients.基于机器学习的调控性细胞死亡研究预测胶质瘤患者的预后和免疫治疗反应。
Sci Rep. 2024 Feb 20;14(1):4173. doi: 10.1038/s41598-024-54643-3.
8
MrGPS: an m6A-related gene pair signature to predict the prognosis and immunological impact of glioma patients.MrGPS:一种与 m6A 相关的基因对特征,用于预测胶质瘤患者的预后和免疫影响。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad498.
9
MicroRNAs as Targets for Cancer Diagnosis: Interests and Limitations.作为癌症诊断靶点的微小RNA:研究意义与局限性
Adv Pharm Bull. 2023 Jul;13(3):435-445. doi: 10.34172/apb.2023.047. Epub 2022 Jul 2.
10
Co-expression module analysis reveals high expression homogeneity for both coding and non-coding genes in sepsis.共表达模块分析揭示了脓毒症中编码基因和非编码基因的高表达同质性。
BMC Genomics. 2023 Jul 24;24(1):418. doi: 10.1186/s12864-023-09460-9.
解析来自癌症的高通量阵列数据的全局信号特征。
Mol Biosyst. 2014 Jun;10(6):1549-56. doi: 10.1039/c4mb00084f. Epub 2014 Apr 3.
4
QChIPat: a quantitative method to identify distinct binding patterns for two biological ChIP-seq samples in different experimental conditions.QChIPat:一种用于鉴定两种不同实验条件下的两个生物 ChIP-seq 样本的独特结合模式的定量方法。
BMC Genomics. 2013;14 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2164-14-S8-S3. Epub 2013 Dec 9.
5
Global gene expression distribution in non-cancerous complex diseases.非癌性复杂疾病中的全局基因表达分布
Mol Biosyst. 2014 Apr;10(4):728-31. doi: 10.1039/c3mb70524b. Epub 2014 Jan 31.
6
Survey of MapReduce frame operation in bioinformatics.生物信息学中MapReduce框架操作的调查。
Brief Bioinform. 2014 Jul;15(4):637-47. doi: 10.1093/bib/bbs088. Epub 2013 Feb 7.
7
Revisiting global gene expression analysis.重新审视全球基因表达分析。
Cell. 2012 Oct 26;151(3):476-82. doi: 10.1016/j.cell.2012.10.012.
8
Transcriptional amplification in tumor cells with elevated c-Myc.肿瘤细胞中 c-Myc 升高导致的转录扩增。
Cell. 2012 Sep 28;151(1):56-67. doi: 10.1016/j.cell.2012.08.026.
9
PaGeFinder: quantitative identification of spatiotemporal pattern genes.PaGeFinder:时空模式基因的定量识别。
Bioinformatics. 2012 Jun 1;28(11):1544-5. doi: 10.1093/bioinformatics/bts169. Epub 2012 Apr 6.
10
Extensive up-regulation of gene expression in cancer: the normalised use of microarray data.癌症中基因表达的广泛上调:微阵列数据的标准化应用
Mol Biosyst. 2012 Mar;8(3):818-27. doi: 10.1039/c2mb05466c. Epub 2012 Jan 10.