• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

StarTracer 是一种在单细胞 RNA-Seq 分析中用于精确标记基因识别的加速方法。

starTracer is an accelerated approach for precise marker gene identification in single-cell RNA-Seq analysis.

机构信息

Brain Research Center, Zhongnan Hospital, Second Clinical School, Wuhan University, Wuhan, China.

Graduate School of Biostudies, Kyoto University, Kyoto, Japan.

出版信息

Commun Biol. 2024 Sep 13;7(1):1128. doi: 10.1038/s42003-024-06790-6.

DOI:10.1038/s42003-024-06790-6
PMID:39266658
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11393126/
Abstract

Revealing the heterogeneity among tissues is the greatest advantage of single-cell-sequencing. Marker genes not only act as the key to correctly identify cell types, but also the bio-markers for cell-status under certain experimental imputations. Current analysis methods such as Seurat and Monocle employ algorithms which compares one cluster to all the rest and select markers according to statistical tests. This pattern brings redundant calculations and thus, results in low calculation efficiency, specificity and accuracy. To address these issues, we introduce starTracer, a novel algorithm designed to enhance the efficiency, specificity and accuracy of marker gene identification in single-cell RNA-seq data analysis. starTracer operates as an independent pipeline, which exhibits great flexibility by accepting multiple input file types. The primary output is a marker matrix, where genes are sorted by the potential to function as markers, with those exhibiting the greatest potential positioned at the top. The speed improvement ranges by 2 ~ 3 orders of magnitude compared to Seurat, as observed across three independent datasets with lower false positive rate as observed in a simulated testing dataset with ground-truth. It's worth noting that starTracer exhibits increasing speed improvement with larger data volumes. It also excels in identifying markers in smaller clusters. These advantages solidify starTracer as an important tool for single-cell RNA-seq data, merging robust accuracy with exceptional speed.

摘要

揭示组织间的异质性是单细胞测序的最大优势。标记基因不仅是正确识别细胞类型的关键,也是特定实验推断下细胞状态的生物标志物。当前的分析方法,如 Seurat 和 Monocle,采用的算法是将一个簇与所有其他簇进行比较,并根据统计检验选择标记基因。这种模式带来了冗余的计算,从而导致计算效率、特异性和准确性降低。为了解决这些问题,我们引入了 starTracer,这是一种专门设计的算法,用于提高单细胞 RNA-seq 数据分析中标记基因识别的效率、特异性和准确性。starTracer 作为一个独立的流水线运行,通过接受多种输入文件类型,表现出极大的灵活性。主要输出是一个标记基因矩阵,其中基因按作为标记的潜力排序,具有最大潜力的基因排在顶部。与 Seurat 相比,速度提高了 2 到 3 个数量级,在三个独立的数据集和一个具有真实标记的模拟测试数据集上都观察到了更低的假阳性率。值得注意的是,starTracer 随着数据量的增加,速度提升的幅度也越来越大。它在识别较小簇的标记基因方面也表现出色。这些优势使 starTracer 成为单细胞 RNA-seq 数据的重要工具,它具有稳健的准确性和出色的速度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/1b809ca9fa45/42003_2024_6790_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/650d507a8fde/42003_2024_6790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/317309f34a7e/42003_2024_6790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/0395f5f36333/42003_2024_6790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/170e0184a630/42003_2024_6790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/d421e308fb90/42003_2024_6790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/9e9d55b6452b/42003_2024_6790_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/1b809ca9fa45/42003_2024_6790_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/650d507a8fde/42003_2024_6790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/317309f34a7e/42003_2024_6790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/0395f5f36333/42003_2024_6790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/170e0184a630/42003_2024_6790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/d421e308fb90/42003_2024_6790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/9e9d55b6452b/42003_2024_6790_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd00/11393126/1b809ca9fa45/42003_2024_6790_Fig7_HTML.jpg

相似文献

1
starTracer is an accelerated approach for precise marker gene identification in single-cell RNA-Seq analysis.StarTracer 是一种在单细胞 RNA-Seq 分析中用于精确标记基因识别的加速方法。
Commun Biol. 2024 Sep 13;7(1):1128. doi: 10.1038/s42003-024-06790-6.
2
Improving replicability in single-cell RNA-Seq cell type discovery with Dune.利用 Dune 提高单细胞 RNA-Seq 细胞类型发现的可重复性。
BMC Bioinformatics. 2024 May 24;25(1):198. doi: 10.1186/s12859-024-05814-6.
3
XgCPred: Cell type classification using XGBoost-CNN integration and exploiting gene expression imaging in single-cell RNAseq data.XgCPred:基于 XGBoost-CNN 集成和单细胞 RNAseq 数据中基因表达成像的细胞类型分类。
Comput Biol Med. 2024 Oct;181:109066. doi: 10.1016/j.compbiomed.2024.109066. Epub 2024 Aug 24.
4
Using RNentropy to Detect Significant Variation in Gene Expression Across Multiple RNA-Seq or Single-Cell RNA-Seq Samples.使用 RNentropy 检测多个 RNA-Seq 或单细胞 RNA-Seq 样本中基因表达的显著变化。
Methods Mol Biol. 2021;2284:77-96. doi: 10.1007/978-1-0716-1307-8_6.
5
Identifying cell states in single-cell RNA-seq data at statistically maximal resolution.以统计学上最大分辨率识别单细胞 RNA-seq 数据中的细胞状态。
PLoS Comput Biol. 2024 Jul 12;20(7):e1012224. doi: 10.1371/journal.pcbi.1012224. eCollection 2024 Jul.
6
scCTS: identifying the cell type-specific marker genes from population-level single-cell RNA-seq.scCTS:从群体水平的单细胞 RNA-seq 中识别细胞类型特异性标记基因。
Genome Biol. 2024 Oct 14;25(1):269. doi: 10.1186/s13059-024-03410-8.
7
Polled Digital Cell Sorter (p-DCS): Automatic identification of hematological cell types from single cell RNA-sequencing clusters.流式数字细胞分选仪(p-DCS):从单细胞 RNA 测序簇中自动识别血细胞类型。
BMC Bioinformatics. 2019 Jul 1;20(1):369. doi: 10.1186/s12859-019-2951-x.
8
scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization.scRNMF:一种基于鲁棒非负矩阵分解的单细胞 RNA-seq 数据插补方法。
PLoS Comput Biol. 2024 Aug 8;20(8):e1012339. doi: 10.1371/journal.pcbi.1012339. eCollection 2024 Aug.
9
Hierarchical marker genes selection in scRNA-seq analysis.单细胞RNA测序分析中的分层标记基因选择
PLoS Comput Biol. 2024 Dec 12;20(12):e1012643. doi: 10.1371/journal.pcbi.1012643. eCollection 2024 Dec.
10
scDFN: enhancing single-cell RNA-seq clustering with deep fusion networks.scDFN:利用深度融合网络增强单细胞 RNA-seq 聚类
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae486.

引用本文的文献

1
Single-cell RNA sequencing technology was employed to construct a risk prediction model for genes associated with pyroptosis and ferroptosis in lung adenocarcinoma.采用单细胞RNA测序技术构建肺腺癌中与细胞焦亡和铁死亡相关基因的风险预测模型。
Respir Res. 2025 Jul 18;26(1):249. doi: 10.1186/s12931-025-03323-5.
2
Analysis of microarray and single-cell RNA-seq identifies gene co-expression, cell-cell communication, and tumor environment associated with metabolite interconversion enzyme in prostate cancer.微阵列和单细胞RNA测序分析确定了与前列腺癌中代谢物相互转化酶相关的基因共表达、细胞间通讯和肿瘤环境。
Discov Oncol. 2025 Feb 13;16(1):177. doi: 10.1007/s12672-025-01926-4.

本文引用的文献

1
A comparison of marker gene selection methods for single-cell RNA sequencing data.单细胞 RNA 测序数据中标记基因选择方法的比较。
Genome Biol. 2024 Feb 26;25(1):56. doi: 10.1186/s13059-024-03183-0.
2
A marker gene-based method for identifying the cell-type of origin from single-cell RNA sequencing data.一种基于标记基因的方法,用于从单细胞RNA测序数据中识别细胞起源类型。
MethodsX. 2023 Apr 25;10:102196. doi: 10.1016/j.mex.2023.102196. eCollection 2023.
3
Spatially resolved single-cell translatomics at molecular resolution.分子分辨率下的空间分辨单细胞转导组学。
Science. 2023 Jun 30;380(6652):eadd3067. doi: 10.1126/science.add3067.
4
Proteogenomics of diffuse gliomas reveal molecular subtypes associated with specific therapeutic targets and immune-evasion mechanisms.弥漫性神经胶质瘤的蛋白质基因组学研究揭示了与特定治疗靶点和免疫逃逸机制相关的分子亚型。
Nat Commun. 2023 Jan 31;14(1):505. doi: 10.1038/s41467-023-36005-1.
5
Human prefrontal cortex gene regulatory dynamics from gestation to adulthood at single-cell resolution.人类前额皮质从妊娠到成年的单细胞分辨率基因调控动态。
Cell. 2022 Nov 10;185(23):4428-4447.e28. doi: 10.1016/j.cell.2022.09.039. Epub 2022 Oct 31.
6
CellMarker 2.0: an updated database of manually curated cell markers in human/mouse and web tools based on scRNA-seq data.CellMarker 2.0:一个更新的数据库,包含基于 scRNA-seq 数据的人类/小鼠细胞标志物的人工注释和网络工具。
Nucleic Acids Res. 2023 Jan 6;51(D1):D870-D876. doi: 10.1093/nar/gkac947.
7
Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy.人类扩张型和肥厚型心肌病的单细胞分析。
Nature. 2022 Aug;608(7921):174-180. doi: 10.1038/s41586-022-04817-8. Epub 2022 Jun 22.
8
hECA: The cell-centric assembly of a cell atlas.人类细胞图谱计划(hECA):以细胞为中心构建细胞图谱。
iScience. 2022 Apr 28;25(5):104318. doi: 10.1016/j.isci.2022.104318. eCollection 2022 May 20.
9
UCell: Robust and scalable single-cell gene signature scoring.UCell:强大且可扩展的单细胞基因特征评分
Comput Struct Biotechnol J. 2021 Jun 30;19:3796-3798. doi: 10.1016/j.csbj.2021.06.043. eCollection 2021.
10
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.