• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从组织样本的共表达模式中鉴定细胞类型特异性标记基因。

Identification of cell-type-specific marker genes from co-expression patterns in tissue samples.

作者信息

Qiu Yixuan, Wang Jiebiao, Lei Jing, Roeder Kathryn

机构信息

Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.

Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, USA.

出版信息

Bioinformatics. 2021 Oct 11;37(19):3228-3234. doi: 10.1093/bioinformatics/btab257.

DOI:10.1093/bioinformatics/btab257
PMID:33904573
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8504631/
Abstract

MOTIVATION

Marker genes, defined as genes that are expressed primarily in a single-cell type, can be identified from the single-cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, however, are highly correlated in bulk data, because their expression levels depend primarily on the proportion of that cell type in the samples. Therefore, when many tissue samples are analyzed, it is possible to identify these marker genes from the correlation pattern.

RESULTS

To capitalize on this pattern, we develop a new algorithm to detect marker genes by combining published information about likely marker genes with bulk transcriptome data in the form of a semi-supervised algorithm. The algorithm then exploits the correlation structure of the bulk data to refine the published marker genes by adding or removing genes from the list.

AVAILABILITY AND IMPLEMENTATION

We implement this method as an R package markerpen, hosted on CRAN (https://CRAN.R-project.org/package=markerpen).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

标记基因被定义为主要在单一细胞类型中表达的基因,可从单细胞转录组中识别出来;然而,对于标记基因的许多用途(如批量组织的反卷积),此类数据并非总是可用。然而,细胞类型的标记基因在批量数据中高度相关,因为它们的表达水平主要取决于样本中该细胞类型的比例。因此,当分析许多组织样本时,有可能从相关模式中识别出这些标记基因。

结果

为了利用这种模式,我们开发了一种新算法,通过将关于可能的标记基因的已发表信息与批量转录组数据以半监督算法的形式相结合来检测标记基因。然后,该算法利用批量数据的相关结构,通过从列表中添加或删除基因来优化已发表的标记基因。

可用性和实现方式

我们将此方法实现为一个R包markerpen,托管在CRAN上(https://CRAN.R-project.org/package=markerpen)。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/f3f180b21a9b/btab257f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/2a3f64a329ce/btab257f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/ca0b9fe1b782/btab257f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/e3c3380514bf/btab257f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/f3f180b21a9b/btab257f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/2a3f64a329ce/btab257f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/ca0b9fe1b782/btab257f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/e3c3380514bf/btab257f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8504631/f3f180b21a9b/btab257f4.jpg

相似文献

1
Identification of cell-type-specific marker genes from co-expression patterns in tissue samples.从组织样本的共表达模式中鉴定细胞类型特异性标记基因。
Bioinformatics. 2021 Oct 11;37(19):3228-3234. doi: 10.1093/bioinformatics/btab257.
2
selectBoost: a general algorithm to enhance the performance of variable selection methods.选择提升:一种增强变量选择方法性能的通用算法。
Bioinformatics. 2021 May 5;37(5):659-668. doi: 10.1093/bioinformatics/btaa855.
3
dtangle: accurate and robust cell type deconvolution.dtangle:准确且稳健的细胞类型去卷积。
Bioinformatics. 2019 Jun 1;35(12):2093-2099. doi: 10.1093/bioinformatics/bty926.
4
Semi-CAM: A semi-supervised deconvolution method for bulk transcriptomic data with partial marker gene information.Semi-CAM:一种具有部分标记基因信息的批量转录组数据的半监督解卷积方法。
Sci Rep. 2020 Mar 25;10(1):5434. doi: 10.1038/s41598-020-62330-2.
5
deTS: tissue-specific enrichment analysis to decode tissue specificity.组织特异性富集分析,用于解码组织特异性。
Bioinformatics. 2019 Oct 1;35(19):3842-3845. doi: 10.1093/bioinformatics/btz138.
6
SCIBER: a simple method for removing batch effects from single-cell RNA-sequencing data.SCIBER:一种从单细胞 RNA 测序数据中去除批次效应的简单方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac819.
7
swCAM: estimation of subtype-specific expressions in individual samples with unsupervised sample-wise deconvolution.swCAM:基于无监督样本分解的个体样本中亚型特异性表达的估计。
Bioinformatics. 2022 Feb 7;38(5):1403-1410. doi: 10.1093/bioinformatics/btab839.
8
debCAM: a bioconductor R package for fully unsupervised deconvolution of complex tissues.debCAM:一个用于复杂组织完全无监督去卷积的 Bioconductor R 包。
Bioinformatics. 2020 Jun 1;36(12):3927-3929. doi: 10.1093/bioinformatics/btaa205.
9
Using multiple measurements of tissue to estimate subject- and cell-type-specific gene expression.使用多种组织测量方法来估计个体和细胞类型特异性基因表达。
Bioinformatics. 2020 Feb 1;36(3):782-788. doi: 10.1093/bioinformatics/btz619.
10
powerEQTL: an R package and shiny application for sample size and power calculation of bulk tissue and single-cell eQTL analysis.powerEQTL:一个用于批量组织和单细胞 eQTL 分析样本量和功效计算的 R 包和 shiny 应用程序。
Bioinformatics. 2021 Nov 18;37(22):4269-4271. doi: 10.1093/bioinformatics/btab385.

引用本文的文献

1
Expanding canonical cortical cell type markers in the era of single-cell transcriptomics.在单细胞转录组学时代扩展经典皮质细胞类型标志物
bioRxiv. 2025 Aug 26:2025.08.26.672469. doi: 10.1101/2025.08.26.672469.
2
Discovery of optimal cell type classification marker genes from single cell RNA sequencing data.从单细胞RNA测序数据中发现最佳细胞类型分类标记基因。
BMC Methods. 2024;1. doi: 10.1186/s44330-024-00015-2. Epub 2024 Nov 4.
3
HIDE: hierarchical cell-type deconvolution.HIDE:分层细胞类型反卷积
Bioinformatics. 2025 Jul 1;41(Supplement_1):i207-i216. doi: 10.1093/bioinformatics/btaf179.
4
Ligand-Receptor Analysis of Brain Cell Type Marker Data Reveals Intricate Endothelial Interaction.脑细胞类型标记数据的配体-受体分析揭示了复杂的内皮细胞相互作用。
Ann Neurosci. 2025 Jun 21:09727531251343254. doi: 10.1177/09727531251343254.
5
scaLR: a low-resource deep neural network-based platform for single cell analysis and biomarker discovery.scaLR:一个基于低资源深度神经网络的单细胞分析和生物标志物发现平台。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf243.
6
ARTdeConv: adaptive regularized tri-factor non-negative matrix factorization for cell type deconvolution.ARTdeConv:用于细胞类型反卷积的自适应正则化三因子非负矩阵分解
NAR Genom Bioinform. 2025 Apr 26;7(2):lqaf046. doi: 10.1093/nargab/lqaf046. eCollection 2025 Jun.
7
mastR: an R package for automated identification of tissue-specific gene signatures in multi-group differential expression analysis.mastR:一个用于在多组差异表达分析中自动识别组织特异性基因特征的R包。
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf114.
8
Advancements in the Application of scRNA-Seq in Breast Research: A Review.单细胞RNA测序在乳腺研究中的应用进展:综述
Int J Mol Sci. 2024 Dec 22;25(24):13706. doi: 10.3390/ijms252413706.
9
Robust estimation of cancer and immune cell-type proportions from bulk tumor ATAC-Seq data.从 bulk tumor ATAC-Seq 数据中稳健估计癌症和免疫细胞类型比例。
Elife. 2024 Oct 9;13:RP94833. doi: 10.7554/eLife.94833.
10
Tissue specific tumor-gene link prediction through sampling based GNN using a heterogeneous network.基于基于采样的 GNN 的异构网络进行组织特异性肿瘤基因关联预测。
Med Biol Eng Comput. 2024 Aug;62(8):2499-2510. doi: 10.1007/s11517-024-03087-y. Epub 2024 Apr 18.