• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CDSeqR:从批量组织中进行基因表达数据的快速完整去卷积。

CDSeqR: fast complete deconvolution for gene expression data from bulk tissues.

机构信息

Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, Research Triangle Park, Durham, NC, 27709, USA.

出版信息

BMC Bioinformatics. 2021 May 24;22(1):262. doi: 10.1186/s12859-021-04186-5.

DOI:10.1186/s12859-021-04186-5
PMID:34030626
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8142515/
Abstract

BACKGROUND

Biological tissues consist of heterogenous populations of cells. Because gene expression patterns from bulk tissue samples reflect the contributions from all cells in the tissue, understanding the contribution of individual cell types to the overall gene expression in the tissue is fundamentally important. We recently developed a computational method, CDSeq, that can simultaneously estimate both sample-specific cell-type proportions and cell-type-specific gene expression profiles using only bulk RNA-Seq counts from multiple samples. Here we present an R implementation of CDSeq (CDSeqR) with significant performance improvement over the original implementation in MATLAB and an added new function to aid cell type annotation. The R package would be of interest for the broader R community.

RESULT

We developed a novel strategy to substantially improve computational efficiency in both speed and memory usage. In addition, we designed and implemented a new function for annotating the CDSeq estimated cell types using single-cell RNA sequencing (scRNA-seq) data. This function allows users to readily interpret and visualize the CDSeq estimated cell types. In addition, this new function further allows the users to annotate CDSeq-estimated cell types using marker genes. We carried out additional validations of the CDSeqR software using synthetic, real cell mixtures, and real bulk RNA-seq data from the Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) project.

CONCLUSIONS

The existing bulk RNA-seq repositories, such as TCGA and GTEx, provide enormous resources for better understanding changes in transcriptomics and human diseases. They are also potentially useful for studying cell-cell interactions in the tissue microenvironment. Bulk level analyses neglect tissue heterogeneity, however, and hinder investigation of a cell-type-specific expression. The CDSeqR package may aid in silico dissection of bulk expression data, enabling researchers to recover cell-type-specific information.

摘要

背景

生物组织由异质细胞群体组成。由于来自组织的批量样本的基因表达模式反映了组织中所有细胞的贡献,因此了解单个细胞类型对组织中整体基因表达的贡献从根本上是重要的。我们最近开发了一种计算方法 CDSeq,它可以仅使用来自多个样本的批量 RNA-Seq 计数同时估计样本特异性细胞类型比例和细胞类型特异性基因表达谱。在这里,我们提出了 CDSeq 的 R 实现(CDSeqR),与原始的 MATLAB 实现相比,它具有显著的性能改进,并增加了一个新功能来辅助细胞类型注释。这个 R 包将引起更广泛的 R 社区的兴趣。

结果

我们开发了一种新策略,可以大大提高计算效率,包括速度和内存使用。此外,我们设计并实现了一个新功能,用于使用单细胞 RNA 测序 (scRNA-seq) 数据注释 CDSeq 估计的细胞类型。该功能允许用户轻松解释和可视化 CDSeq 估计的细胞类型。此外,这个新功能还允许用户使用标记基因注释 CDSeq 估计的细胞类型。我们使用合成的、真实的细胞混合物以及来自癌症基因组图谱 (TCGA) 和基因型组织表达 (GTEx) 项目的真实批量 RNA-seq 数据,对 CDSeqR 软件进行了额外的验证。

结论

现有的批量 RNA-seq 存储库,如 TCGA 和 GTEx,为更好地理解转录组学和人类疾病的变化提供了巨大的资源。它们也可能有助于研究组织微环境中的细胞间相互作用。然而,批量水平分析忽略了组织异质性,并阻碍了对细胞类型特异性表达的研究。CDSeqR 包可能有助于对批量表达数据进行计算分析,使研究人员能够恢复细胞类型特异性信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/f9e253aaedbe/12859_2021_4186_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/525d89a71510/12859_2021_4186_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/eb8918a8969d/12859_2021_4186_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/9953bfb462ce/12859_2021_4186_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/f5a5be84c08e/12859_2021_4186_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/f9e253aaedbe/12859_2021_4186_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/525d89a71510/12859_2021_4186_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/eb8918a8969d/12859_2021_4186_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/9953bfb462ce/12859_2021_4186_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/f5a5be84c08e/12859_2021_4186_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/8142515/f9e253aaedbe/12859_2021_4186_Fig5_HTML.jpg

相似文献

1
CDSeqR: fast complete deconvolution for gene expression data from bulk tissues.CDSeqR:从批量组织中进行基因表达数据的快速完整去卷积。
BMC Bioinformatics. 2021 May 24;22(1):262. doi: 10.1186/s12859-021-04186-5.
2
CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data.CDSeq:一种使用基因表达数据对异质样本进行全面剖析的全新去卷积方法。
PLoS Comput Biol. 2019 Dec 2;15(12):e1007510. doi: 10.1371/journal.pcbi.1007510. eCollection 2019 Dec.
3
Omnibus and robust deconvolution scheme for bulk RNA sequencing data integrating multiple single-cell reference sets and prior biological knowledge.用于批量 RNA 测序数据的整体且稳健的去卷积方案,该方案整合了多个单细胞参考集和先验生物学知识。
Bioinformatics. 2022 Sep 30;38(19):4530-4536. doi: 10.1093/bioinformatics/btac563.
4
Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data.通过利用样本和基因之间的相似性以及单细胞 RNA-Seq 数据进行批量基因表达的反卷积。
BMC Genomics. 2024 Sep 18;25(1):875. doi: 10.1186/s12864-024-10728-x.
5
Transcriptome size matters for single-cell RNA-seq normalization and bulk deconvolution.转录组大小对单细胞RNA测序标准化和批量反卷积很重要。
Nat Commun. 2025 Feb 1;16(1):1246. doi: 10.1038/s41467-025-56623-1.
6
SimBu: bias-aware simulation of bulk RNA-seq data with variable cell-type composition.SimBu:具有可变细胞类型组成的批量 RNA-seq 数据的偏差感知模拟。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii141-ii147. doi: 10.1093/bioinformatics/btac499.
7
Single cell transcriptomics in human osteoarthritis synovium and in silico deconvoluted bulk RNA sequencing.人骨关节炎滑膜单细胞转录组学和计算机反卷积批量 RNA 测序。
Osteoarthritis Cartilage. 2022 Mar;30(3):475-480. doi: 10.1016/j.joca.2021.12.007. Epub 2021 Dec 29.
8
MuSiC2: cell-type deconvolution for multi-condition bulk RNA-seq data.MuSiC2:用于多条件批量 RNA-seq 数据的细胞类型去卷积。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac430.
9
Comprehensive analysis of scRNA-seq and bulk RNA-seq reveals the non-cardiomyocytes heterogeneity and novel cell populations in dilated cardiomyopathy.单细胞RNA测序和批量RNA测序的综合分析揭示了扩张型心肌病中非心肌细胞的异质性和新的细胞群。
J Transl Med. 2025 Jan 6;23(1):17. doi: 10.1186/s12967-024-05983-1.
10
DeSide: A unified deep learning approach for cellular deconvolution of tumor microenvironment.DeSide:一种用于肿瘤微环境细胞去卷积的统一深度学习方法。
Proc Natl Acad Sci U S A. 2024 Nov 12;121(46):e2407096121. doi: 10.1073/pnas.2407096121. Epub 2024 Nov 8.

引用本文的文献

1
Penalised regression improves imputation of cell-type specific expression using RNA-seq data from mixed cell populations compared to domain-specific methods.与特定领域方法相比,惩罚回归利用来自混合细胞群体的RNA测序数据改进了细胞类型特异性表达的插补。
PLoS Comput Biol. 2025 Mar 7;21(3):e1012859. doi: 10.1371/journal.pcbi.1012859. eCollection 2025 Mar.
2
Muscle fibroblasts and stem cells stimulate motor neurons in an age and exercise-dependent manner.肌肉成纤维细胞和干细胞以年龄和运动依赖的方式刺激运动神经元。
Aging Cell. 2025 Mar;24(3):e14413. doi: 10.1111/acel.14413. Epub 2024 Nov 18.
3
DECOMICS, a shiny application for unsupervised cell type deconvolution and biological interpretation of bulk omic data.

本文引用的文献

1
Benchmarking of cell type deconvolution pipelines for transcriptomics data.基于转录组数据的细胞类型去卷积分析流水线的基准测试
Nat Commun. 2020 Nov 6;11(1):5650. doi: 10.1038/s41467-020-19015-1.
2
The GTEx Consortium atlas of genetic regulatory effects across human tissues.GTEx 联盟人类组织遗传调控效应图谱
Science. 2020 Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.
3
Systematic comparison of single-cell and single-nucleus RNA-sequencing methods.单细胞和单细胞核 RNA 测序方法的系统比较。
DECOMICS,一款用于批量组学数据的无监督细胞类型反卷积和生物学解释的出色应用程序。
Bioinform Adv. 2024 Sep 20;4(1):vbae136. doi: 10.1093/bioadv/vbae136. eCollection 2024.
4
Performance of computational algorithms to deconvolve heterogeneous bulk ovarian tumor tissue depends on experimental factors.计算算法在去卷积异质卵巢肿瘤组织方面的性能取决于实验因素。
Genome Biol. 2023 Oct 20;24(1):239. doi: 10.1186/s13059-023-03077-7.
5
Detecting the effect of genetic diversity on brain composition in an Alzheimer's disease mouse model.在阿尔茨海默病小鼠模型中检测基因多样性对大脑组成的影响。
bioRxiv. 2023 Feb 28:2023.02.27.530226. doi: 10.1101/2023.02.27.530226.
6
Neuron-Glia-Ratio-Like Approach Evidenced for Limited Variability and In-Aggregate Circadian Shifts in Cortical Cell-Specific Transcriptomes.在皮质细胞特异性转录组中,类似神经元与神经胶质细胞比例的方法证明了有限的变异性和总体昼夜节律变化。
J Mol Neurosci. 2023 Mar;73(2-3):159-170. doi: 10.1007/s12031-023-02103-4. Epub 2023 Feb 6.
7
One Step Nucleic Acid Amplification (OSNA) Lysate Samples Are Suitable to Establish a Transcriptional Metastatic Signature in Patients with Early Stage Hormone Receptors-Positive Breast Cancer.一步核酸扩增(OSNA)裂解物样本适用于建立早期激素受体阳性乳腺癌患者的转录转移特征。
Cancers (Basel). 2022 Nov 28;14(23):5855. doi: 10.3390/cancers14235855.
Nat Biotechnol. 2020 Jun;38(6):737-746. doi: 10.1038/s41587-020-0465-8. Epub 2020 Apr 6.
4
Accurate estimation of cell composition in bulk expression through robust integration of single-cell information.通过稳健整合单细胞信息,准确估计批量表达中的细胞组成。
Nat Commun. 2020 Apr 24;11(1):1971. doi: 10.1038/s41467-020-15816-6.
5
Construction of a human cell landscape at single-cell level.在单细胞水平构建人类细胞图谱。
Nature. 2020 May;581(7808):303-309. doi: 10.1038/s41586-020-2157-4. Epub 2020 Mar 25.
6
SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references.SCDC:通过多个单细胞 RNA 测序参考进行批量基因表达去卷积。
Brief Bioinform. 2021 Jan 18;22(1):416-427. doi: 10.1093/bib/bbz166.
7
CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data.CDSeq:一种使用基因表达数据对异质样本进行全面剖析的全新去卷积方法。
PLoS Comput Biol. 2019 Dec 2;15(12):e1007510. doi: 10.1371/journal.pcbi.1007510. eCollection 2019 Dec.
8
Harnessing innate immunity in cancer therapy.利用先天免疫进行癌症治疗。
Nature. 2019 Oct;574(7776):45-56. doi: 10.1038/s41586-019-1593-5. Epub 2019 Oct 2.
9
Comprehensive Integration of Single-Cell Data.单细胞数据的综合整合。
Cell. 2019 Jun 13;177(7):1888-1902.e21. doi: 10.1016/j.cell.2019.05.031. Epub 2019 Jun 6.
10
deconvSeq: deconvolution of cell mixture distribution in sequencing data.deconvSeq:测序数据中细胞混合物分布的反卷积。
Bioinformatics. 2019 Dec 15;35(24):5095-5102. doi: 10.1093/bioinformatics/btz444.