• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用二次规划进行转录谱数据分析的最优去卷积及其在复杂临床血液样本中的应用。

Optimal deconvolution of transcriptional profiling data using quadratic programming with application to complex clinical blood samples.

机构信息

Biomarker Development, Novartis Institutes for BioMedical Research, Cambridge, Massachusetts, United States of America.

出版信息

PLoS One. 2011;6(11):e27156. doi: 10.1371/journal.pone.0027156. Epub 2011 Nov 16.

DOI:10.1371/journal.pone.0027156
PMID:22110609
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3217948/
Abstract

Large-scale molecular profiling technologies have assisted the identification of disease biomarkers and facilitated the basic understanding of cellular processes. However, samples collected from human subjects in clinical trials possess a level of complexity, arising from multiple cell types, that can obfuscate the analysis of data derived from them. Failure to identify, quantify, and incorporate sources of heterogeneity into an analysis can have widespread and detrimental effects on subsequent statistical studies.We describe an approach that builds upon a linear latent variable model, in which expression levels from mixed cell populations are modeled as the weighted average of expression from different cell types. We solve these equations using quadratic programming, which efficiently identifies the globally optimal solution while preserving non-negativity of the fraction of the cells. We applied our method to various existing platforms to estimate proportions of different pure cell or tissue types and gene expression profilings of distinct phenotypes, with a focus on complex samples collected in clinical trials. We tested our methods on several well controlled benchmark data sets with known mixing fractions of pure cell or tissue types and mRNA expression profiling data from samples collected in a clinical trial. Accurate agreement between predicted and actual mixing fractions was observed. In addition, our method was able to predict mixing fractions for more than ten species of circulating cells and to provide accurate estimates for relatively rare cell types (<10% total population). Furthermore, accurate changes in leukocyte trafficking associated with Fingolomid (FTY720) treatment were identified that were consistent with previous results generated by both cell counts and flow cytometry. These data suggest that our method can solve one of the open questions regarding the analysis of complex transcriptional data: namely, how to identify the optimal mixing fractions in a given experiment.

摘要

大规模分子分析技术有助于鉴定疾病生物标志物,并促进对细胞过程的基本理解。然而,临床试验中从人体样本中收集的样本具有一定程度的复杂性,来自多种细胞类型,这可能会使对从中获得的数据的分析变得复杂。未能识别、量化并将异质性来源纳入分析中会对随后的统计研究产生广泛而不利的影响。我们描述了一种基于线性潜在变量模型的方法,其中混合细胞群体的表达水平被建模为不同细胞类型表达的加权平均值。我们使用二次规划来求解这些方程,该方法有效地确定了全局最优解,同时保持细胞分数的非负性。我们将我们的方法应用于各种现有的平台,以估计不同纯细胞或组织类型的比例以及不同表型的基因表达谱,重点是在临床试验中收集的复杂样本。我们在几个具有已知纯细胞或组织类型混合分数的受控基准数据集和临床试验中收集的样本的 mRNA 表达谱数据上测试了我们的方法。观察到预测和实际混合分数之间的准确一致性。此外,我们的方法能够预测超过十种循环细胞的混合分数,并为相对罕见的细胞类型(<10%总群体)提供准确的估计。此外,还确定了与 Fingolomid(FTY720)治疗相关的白细胞迁移的准确变化,这与细胞计数和流式细胞术生成的先前结果一致。这些数据表明,我们的方法可以解决分析复杂转录数据的一个悬而未决的问题:即如何在给定实验中确定最佳混合分数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/5e35437df8e1/pone.0027156.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/cc1ef3cac2db/pone.0027156.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/61454fa9eddf/pone.0027156.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/1536faafc4ef/pone.0027156.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/c724d8e70fab/pone.0027156.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/5e35437df8e1/pone.0027156.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/cc1ef3cac2db/pone.0027156.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/61454fa9eddf/pone.0027156.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/1536faafc4ef/pone.0027156.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/c724d8e70fab/pone.0027156.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49d8/3217948/5e35437df8e1/pone.0027156.g005.jpg

相似文献

1
Optimal deconvolution of transcriptional profiling data using quadratic programming with application to complex clinical blood samples.使用二次规划进行转录谱数据分析的最优去卷积及其在复杂临床血液样本中的应用。
PLoS One. 2011;6(11):e27156. doi: 10.1371/journal.pone.0027156. Epub 2011 Nov 16.
2
DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data.DeconRNASeq:基于 mRNA-Seq 数据对异质组织样本进行去卷积的统计框架。
Bioinformatics. 2013 Apr 15;29(8):1083-5. doi: 10.1093/bioinformatics/btt090. Epub 2013 Feb 21.
3
Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures.基于转录特征的线性关系实现细胞混合物的完全去卷积。
Nat Commun. 2019 May 17;10(1):2209. doi: 10.1038/s41467-019-09990-5.
4
DeCompress: tissue compartment deconvolution of targeted mRNA expression panels using compressed sensing.DeCompress:使用压缩感知对靶向 mRNA 表达谱进行组织区室去卷积。
Nucleic Acids Res. 2021 May 7;49(8):e48. doi: 10.1093/nar/gkab031.
5
Computational expression deconvolution in a complex mammalian organ.复杂哺乳动物器官中的计算性表达反卷积
BMC Bioinformatics. 2006 Jul 3;7:328. doi: 10.1186/1471-2105-7-328.
6
Exploration of the sputum methylome and omics deconvolution by quadratic programming in molecular profiling of asthma and COPD: the road to sputum omics 2.0.哮喘和 COPD 分子谱分析中通过二次规划进行痰甲基组学和组学去卷积的探索:迈向痰组学 2.0。
Respir Res. 2020 Oct 19;21(1):274. doi: 10.1186/s12931-020-01544-4.
7
Deconvolving the contributions of cell-type heterogeneity on cortical gene expression.解析细胞类型异质性对皮质基因表达的贡献。
PLoS Comput Biol. 2020 Aug 17;16(8):e1008120. doi: 10.1371/journal.pcbi.1008120. eCollection 2020 Aug.
8
In silico microdissection of microarray data from heterogeneous cell populations.对来自异质细胞群体的微阵列数据进行计算机模拟显微切割。
BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.
9
Deblender: a semi-/unsupervised multi-operational computational method for complete deconvolution of expression data from heterogeneous samples.Deblender:一种半/无监督的多操作计算方法,用于从异质样本中完全反卷积表达数据。
BMC Bioinformatics. 2018 Nov 7;19(1):408. doi: 10.1186/s12859-018-2442-5.
10
Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: a case study.半监督非负矩阵分解在基因表达解卷积中的应用:案例研究。
Infect Genet Evol. 2012 Jul;12(5):913-21. doi: 10.1016/j.meegid.2011.08.014. Epub 2011 Sep 10.

引用本文的文献

1
Approaching the holistic transcriptome-convolution and deconvolution in transcriptomics.探索转录组学中的整体转录组卷积与反卷积
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf388.
2
Reference-free deconvolution of complex samples based on cross-cell-type differential analysis: Systematic evaluations with various feature selection options.基于跨细胞类型差异分析的复杂样本无参考去卷积:使用各种特征选择选项的系统评估
Front Genet. 2025 May 30;16:1570781. doi: 10.3389/fgene.2025.1570781. eCollection 2025.
3
Unveiling the immune microenvironment of complex tissues and tumors in transcriptomics through a deconvolution approach.

本文引用的文献

1
Probabilistic analysis of gene expression measurements from heterogeneous tissues.从异质组织中测量基因表达的概率分析。
Bioinformatics. 2010 Oct 15;26(20):2571-7. doi: 10.1093/bioinformatics/btq406. Epub 2010 Jul 14.
2
Cell type-specific gene expression differences in complex tissues.复杂组织中的细胞类型特异性基因表达差异。
Nat Methods. 2010 Apr;7(4):287-9. doi: 10.1038/nmeth.1439. Epub 2010 Mar 7.
3
Statistical expression deconvolution from mixed tissue samples.统计表达去卷积从混合组织样本。
通过反卷积方法揭示转录组学中复杂组织和肿瘤的免疫微环境。
BMC Cancer. 2025 May 1;25(Suppl 1):733. doi: 10.1186/s12885-025-14089-w.
4
Deconer: An Evaluation Toolkit for Reference-based Deconvolution Methods Using Gene Expression Data.Deconer:一种使用基因表达数据的基于参考的反卷积方法评估工具包。
Genomics Proteomics Bioinformatics. 2025 Feb 18. doi: 10.1093/gpbjnl/qzaf009.
5
Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data.通过利用样本和基因之间的相似性以及单细胞 RNA-Seq 数据进行批量基因表达的反卷积。
BMC Genomics. 2024 Sep 18;25(1):875. doi: 10.1186/s12864-024-10728-x.
6
CATD: a reproducible pipeline for selecting cell-type deconvolution methods across tissues.CATD:一种用于跨组织选择细胞类型反卷积方法的可重复流程。
Bioinform Adv. 2024 Mar 23;4(1):vbae048. doi: 10.1093/bioadv/vbae048. eCollection 2024.
7
Highly Accurate Estimation of Cell Type Abundance in Bulk Tissues Based on Single-Cell Reference and Domain Adaptive Matching.基于单细胞参考和领域自适应匹配的批量组织中细胞类型丰度的高精度估计。
Adv Sci (Weinh). 2024 Feb;11(7):e2306329. doi: 10.1002/advs.202306329. Epub 2023 Dec 10.
8
Systematic investigation of mitochondrial transfer between cancer cells and T cells at single-cell resolution.系统研究了单细胞分辨率下癌细胞和 T 细胞之间的线粒体转移。
Cancer Cell. 2023 Oct 9;41(10):1788-1802.e10. doi: 10.1016/j.ccell.2023.09.003.
9
Performance of tumour microenvironment deconvolution methods in breast cancer using single-cell simulated bulk mixtures.使用单细胞模拟的肿瘤微环境去卷积方法在乳腺癌中的性能。
Nat Commun. 2023 Sep 16;14(1):5758. doi: 10.1038/s41467-023-41385-5.
10
Deconvolution of bulk gene expression profiles reveals the association between immune cell polarization and the prognosis of hepatocellular carcinoma patients.对批量基因表达谱进行反卷积揭示了免疫细胞极化与肝癌患者预后之间的关联。
Cancer Med. 2023 Jul;12(14):15736-15760. doi: 10.1002/cam4.6197. Epub 2023 Jun 27.
Bioinformatics. 2010 Apr 15;26(8):1043-9. doi: 10.1093/bioinformatics/btq097. Epub 2010 Mar 4.
4
Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach.在异质组织样本中进行生物标志物发现——采用计算机去混淆方法。
BMC Bioinformatics. 2010 Jan 14;11:27. doi: 10.1186/1471-2105-11-27.
5
Functional analysis of OMICs data and small molecule compounds in an integrated "knowledge-based" platform.在一个集成的“基于知识”平台中对组学数据和小分子化合物进行功能分析。
Methods Mol Biol. 2009;563:177-96. doi: 10.1007/978-1-60761-175-2_10.
6
Deconvolution of blood microarray data identifies cellular activation patterns in systemic lupus erythematosus.血液微阵列数据的反卷积识别系统性红斑狼疮中的细胞活化模式。
PLoS One. 2009 Jul 1;4(7):e6098. doi: 10.1371/journal.pone.0006098.
7
ISOLATE: a computational strategy for identifying the primary origin of cancers using high-throughput sequencing.ISOLATE:一种使用高通量测序识别癌症主要起源的计算策略。
Bioinformatics. 2009 Nov 1;25(21):2882-9. doi: 10.1093/bioinformatics/btp378. Epub 2009 Jun 19.
8
Alternative isoform regulation in human tissue transcriptomes.人类组织转录组中的可变亚型调控
Nature. 2008 Nov 27;456(7221):470-6. doi: 10.1038/nature07509.
9
FTY720 therapy exerts differential effects on T cell subsets in multiple sclerosis.FTY720疗法对多发性硬化症中的T细胞亚群具有不同的作用。
Neurology. 2008 Oct 14;71(16):1261-7. doi: 10.1212/01.wnl.0000327609.57688.ea.
10
Rat toxicogenomic study reveals analytical consistency across microarray platforms.大鼠毒理基因组学研究揭示了不同微阵列平台间的分析一致性。
Nat Biotechnol. 2006 Sep;24(9):1162-9. doi: 10.1038/nbt1238.