• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在异质组织样本中进行生物标志物发现——采用计算机去混淆方法。

Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach.

机构信息

Department of Genetics and Biometry, Research Institute for Biology of Farm Animals, Wilhelm-Stahl Allee 2, D 18196 Dummerstorf, Germany.

出版信息

BMC Bioinformatics. 2010 Jan 14;11:27. doi: 10.1186/1471-2105-11-27.

DOI:10.1186/1471-2105-11-27
PMID:20070912
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3098067/
Abstract

BACKGROUND

For heterogeneous tissues, such as blood, measurements of gene expression are confounded by relative proportions of cell types involved. Conclusions have to rely on estimation of gene expression signals for homogeneous cell populations, e.g. by applying micro-dissection, fluorescence activated cell sorting, or in-silico deconfounding. We studied feasibility and validity of a non-negative matrix decomposition algorithm using experimental gene expression data for blood and sorted cells from the same donor samples. Our objective was to optimize the algorithm regarding detection of differentially expressed genes and to enable its use for classification in the difficult scenario of reversely regulated genes. This would be of importance for the identification of candidate biomarkers in heterogeneous tissues.

RESULTS

Experimental data and simulation studies involving noise parameters estimated from these data revealed that for valid detection of differential gene expression, quantile normalization and use of non-log data are optimal. We demonstrate the feasibility of predicting proportions of constituting cell types from gene expression data of single samples, as a prerequisite for a deconfounding-based classification approach.Classification cross-validation errors with and without using deconfounding results are reported as well as sample-size dependencies. Implementation of the algorithm, simulation and analysis scripts are available.

CONCLUSIONS

The deconfounding algorithm without decorrelation using quantile normalization on non-log data is proposed for biomarkers that are difficult to detect, and for cases where confounding by varying proportions of cell types is the suspected reason. In this case, a deconfounding ranking approach can be used as a powerful alternative to, or complement of, other statistical learning approaches to define candidate biomarkers for molecular diagnosis and prediction in biomedicine, in realistically noisy conditions and with moderate sample sizes.

摘要

背景

对于异质组织,如血液,基因表达的测量受到涉及的细胞类型相对比例的影响。结论必须依赖于对同质细胞群体的基因表达信号的估计,例如通过应用显微切割、荧光激活细胞分选或计算机去混淆。我们使用来自同一供体样本的血液和分选细胞的实验基因表达数据研究了非负矩阵分解算法的可行性和有效性。我们的目的是优化该算法,以检测差异表达基因,并使其能够用于在反向调节基因的困难情况下进行分类。这对于在异质组织中识别候选生物标志物非常重要。

结果

实验数据和涉及从这些数据估计的噪声参数的模拟研究表明,对于有效检测差异基因表达,分位数归一化和使用非对数数据是最优的。我们证明了从单个样本的基因表达数据预测构成细胞类型比例的可行性,这是去混淆分类方法的前提。报告了有无去混淆结果的分类交叉验证错误以及样本大小依赖性。该算法的实现、模拟和分析脚本都可用。

结论

提出了一种使用分位数归一化和非对数数据的非相关去混淆算法,用于难以检测的生物标志物,以及怀疑细胞类型比例变化是混杂因素的情况。在这种情况下,去混淆排序方法可以作为其他统计学习方法的替代方法或补充方法,用于在合理噪声条件下和中等样本量下定义生物医学分子诊断和预测的候选生物标志物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/e82369983c7e/1471-2105-11-27-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/296ae8c151bd/1471-2105-11-27-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/c1e6ea863cb6/1471-2105-11-27-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/a00687132196/1471-2105-11-27-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/ef3b9b1cda17/1471-2105-11-27-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/0c22e1e5d620/1471-2105-11-27-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/e71ce850320f/1471-2105-11-27-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/a874e243c5ee/1471-2105-11-27-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/c541afd1b47e/1471-2105-11-27-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/0b17a8ac4c45/1471-2105-11-27-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/e82369983c7e/1471-2105-11-27-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/296ae8c151bd/1471-2105-11-27-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/c1e6ea863cb6/1471-2105-11-27-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/a00687132196/1471-2105-11-27-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/ef3b9b1cda17/1471-2105-11-27-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/0c22e1e5d620/1471-2105-11-27-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/e71ce850320f/1471-2105-11-27-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/a874e243c5ee/1471-2105-11-27-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/c541afd1b47e/1471-2105-11-27-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/0b17a8ac4c45/1471-2105-11-27-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45ed/3098067/e82369983c7e/1471-2105-11-27-10.jpg

相似文献

1
Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach.在异质组织样本中进行生物标志物发现——采用计算机去混淆方法。
BMC Bioinformatics. 2010 Jan 14;11:27. doi: 10.1186/1471-2105-11-27.
2
In silico microdissection of microarray data from heterogeneous cell populations.对来自异质细胞群体的微阵列数据进行计算机模拟显微切割。
BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.
3
A non-transformation method for identifying differentially expressed genes from cDNA microarrays.一种从cDNA微阵列中鉴定差异表达基因的非转化方法。
Yi Chuan Xue Bao. 2006 Jan;33(1):80-8. doi: 10.1016/S0379-4172(06)60012-7.
4
Emerging translational bioinformatics: knowledge-guided biomarker identification for cancer diagnostics.新兴的转化生物信息学:用于癌症诊断的知识引导生物标志物识别
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:4162-5. doi: 10.1109/IEMBS.2009.5333937.
5
Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: a case study.半监督非负矩阵分解在基因表达解卷积中的应用:案例研究。
Infect Genet Evol. 2012 Jul;12(5):913-21. doi: 10.1016/j.meegid.2011.08.014. Epub 2011 Sep 10.
6
Probabilistic analysis of gene expression measurements from heterogeneous tissues.从异质组织中测量基因表达的概率分析。
Bioinformatics. 2010 Oct 15;26(20):2571-7. doi: 10.1093/bioinformatics/btq406. Epub 2010 Jul 14.
7
Use of normalization methods for analysis of microarrays containing a high degree of gene effects.使用标准化方法分析含有高度基因效应的微阵列。
BMC Bioinformatics. 2008 Nov 28;9:505. doi: 10.1186/1471-2105-9-505.
8
Biomarker discovery based on BBHA and AdaboostM1 on microarray data for cancer classification.基于BBHA和AdaboostM1的微阵列数据用于癌症分类的生物标志物发现。
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:3080-3083. doi: 10.1109/EMBC.2016.7591380.
9
Deconfounding microarray analysis - independent measurements of cell type proportions used in a regression model to resolve tissue heterogeneity bias.消除微阵列分析中的混杂因素——在回归模型中使用细胞类型比例的独立测量来解决组织异质性偏差。
Methods Inf Med. 2006;45(5):557-63.
10
AUCTSP: an improved biomarker gene pair class predictor.AUCTSP:一种改进的生物标志物基因对分类预测器。
BMC Bioinformatics. 2018 Jun 26;19(1):244. doi: 10.1186/s12859-018-2231-1.

引用本文的文献

1
Approaching the holistic transcriptome-convolution and deconvolution in transcriptomics.探索转录组学中的整体转录组卷积与反卷积
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf388.
2
Reference-free deconvolution of complex samples based on cross-cell-type differential analysis: Systematic evaluations with various feature selection options.基于跨细胞类型差异分析的复杂样本无参考去卷积:使用各种特征选择选项的系统评估
Front Genet. 2025 May 30;16:1570781. doi: 10.3389/fgene.2025.1570781. eCollection 2025.
3
An augmented GSNMF model for complete deconvolution of bulk RNA-seq data.

本文引用的文献

1
A HaemAtlas: characterizing gene expression in differentiated human blood cells.一个血液图谱:表征分化的人类血细胞中的基因表达。
Blood. 2009 May 7;113(19):e1-9. doi: 10.1182/blood-2008-06-162958. Epub 2009 Feb 19.
2
Novel strategies to identify biomarkers in tuberculosis.用于鉴定结核病生物标志物的新策略。
Biol Chem. 2008 May;389(5):487-95. doi: 10.1515/bc.2008.053.
3
GlobalANCOVA: exploration and assessment of gene group effects.全局协方差分析:基因组效应的探索与评估
用于批量RNA测序数据完全反卷积的增强型广义非负矩阵分解模型。
Math Biosci Eng. 2025 Mar 14;22(4):988-1018. doi: 10.3934/mbe.2025036.
4
STsisal: a reference-free deconvolution pipeline for spatial transcriptomics data.STsisal:一种用于空间转录组学数据的无参考去卷积流程。
Front Genet. 2025 Mar 3;16:1512435. doi: 10.3389/fgene.2025.1512435. eCollection 2025.
5
Alleviating batch effects in cell type deconvolution with SCCAF-D.使用SCCAF-D减轻细胞类型反卷积中的批次效应。
Nat Commun. 2024 Dec 30;15(1):10867. doi: 10.1038/s41467-024-55213-x.
6
Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data.通过利用样本和基因之间的相似性以及单细胞 RNA-Seq 数据进行批量基因表达的反卷积。
BMC Genomics. 2024 Sep 18;25(1):875. doi: 10.1186/s12864-024-10728-x.
7
Assessing transcriptomic heterogeneity of single-cell RNASeq data by bulk-level gene expression data.通过批量水平基因表达数据评估单细胞RNA测序数据的转录组异质性。
BMC Bioinformatics. 2024 Jun 12;25(1):209. doi: 10.1186/s12859-024-05825-3.
8
CATD: a reproducible pipeline for selecting cell-type deconvolution methods across tissues.CATD:一种用于跨组织选择细胞类型反卷积方法的可重复流程。
Bioinform Adv. 2024 Mar 23;4(1):vbae048. doi: 10.1093/bioadv/vbae048. eCollection 2024.
9
Fourteen years of cellular deconvolution: methodology, applications, technical evaluation and outstanding challenges.十四年的细胞去卷积:方法学、应用、技术评估和突出挑战。
Nucleic Acids Res. 2024 May 22;52(9):4761-4783. doi: 10.1093/nar/gkae267.
10
GEOMETRIC STRUCTURE GUIDED MODEL AND ALGORITHMS FOR COMPLETE DECONVOLUTION OF GENE EXPRESSION DATA.用于基因表达数据完全反卷积的几何结构引导模型及算法
Found Data Sci. 2022 Sep;4(3):441-466. doi: 10.3934/fods.2022013.
Bioinformatics. 2008 Jan 1;24(1):78-85. doi: 10.1093/bioinformatics/btm531. Epub 2007 Nov 17.
4
A comparison of background correction methods for two-colour microarrays.双色微阵列背景校正方法的比较
Bioinformatics. 2007 Oct 15;23(20):2700-7. doi: 10.1093/bioinformatics/btm412. Epub 2007 Aug 25.
5
Deconfounding microarray analysis - independent measurements of cell type proportions used in a regression model to resolve tissue heterogeneity bias.消除微阵列分析中的混杂因素——在回归模型中使用细胞类型比例的独立测量来解决组织异质性偏差。
Methods Inf Med. 2006;45(5):557-63.
6
Classifying gene expression profiles from pairwise mRNA comparisons.通过成对的mRNA比较对基因表达谱进行分类。
Stat Appl Genet Mol Biol. 2004;3:Article19. doi: 10.2202/1544-6115.1071. Epub 2004 Aug 30.
7
Sample selection for microarray gene expression studies.用于微阵列基因表达研究的样本选择。
Methods Inf Med. 2005;44(3):461-7.
8
Automated tissue analysis--a bioinformatics perspective.自动化组织分析——生物信息学视角
Methods Inf Med. 2005;44(1):32-7.
9
In silico microdissection of microarray data from heterogeneous cell populations.对来自异质细胞群体的微阵列数据进行计算机模拟显微切割。
BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.
10
Efficient two-sample designs for microarray experiments with biological replications.用于具有生物学重复的微阵列实验的高效双样本设计。
In Silico Biol. 2004;4(4):461-70.