• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PhenoComb:一种用于评估高维单细胞数据集中复杂表型的发现工具。

PhenoComb: a discovery tool to assess complex phenotypes in high-dimensional single-cell datasets.

作者信息

Burke Paulo E P, Strange Ann, Monk Emily, Thompson Brian, Amato Carol M, Woods David M

机构信息

Division of Medical Oncology, Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA.

出版信息

Bioinform Adv. 2022 Aug 3;2(1):vbac052. doi: 10.1093/bioadv/vbac052. eCollection 2022.

DOI:10.1093/bioadv/vbac052
PMID:36699375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9710698/
Abstract

MOTIVATION

High-dimensional cytometry assays can simultaneously measure dozens of markers, enabling the investigation of complex phenotypes. However, as manual gating relies on previous biological knowledge, few marker combinations are often assessed. This results in complex phenotypes with the potential for biological relevance being overlooked. Here, we present PhenoComb, an R package that allows agnostic exploration of phenotypes by assessing all combinations of markers. PhenoComb uses signal intensity thresholds to assign markers to discrete states (e.g. negative, low, high) and then counts the number of cells per sample from all possible marker combinations in a memory-safe manner. Time and disk space are the only constraints on the number of markers evaluated. PhenoComb also provides several approaches to perform statistical comparisons, evaluate the relevance of phenotypes and assess the independence of identified phenotypes. PhenoComb allows users to guide analysis by adjusting several function arguments, such as identifying parent populations of interest, filtering of low-frequency populations and defining a maximum complexity of phenotypes to evaluate. We have designed PhenoComb to be compatible with a local computer or server-based use.

RESULTS

In testing of PhenoComb's performance on synthetic datasets, computation on 16 markers was completed in the scale of minutes and up to 26 markers in hours. We applied PhenoComb to two publicly available datasets: an HIV flow cytometry dataset (12 markers and 421 samples) and the COVIDome CyTOF dataset (40 markers and 99 samples). In the HIV dataset, PhenoComb identified immune phenotypes associated with HIV seroconversion, including those highlighted in the original publication. In the COVID dataset, we identified several immune phenotypes with altered frequencies in infected individuals relative to healthy individuals. Collectively, PhenoComb represents a powerful discovery tool for agnostically assessing high-dimensional single-cell data.

AVAILABILITY AND IMPLEMENTATION

The PhenoComb R package can be downloaded from https://github.com/SciOmicsLab/PhenoComb.

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

摘要

动机

高维细胞计数分析能够同时测量数十种标志物,从而对复杂表型进行研究。然而,由于手动设门依赖于先前的生物学知识,常常只能评估少数标志物组合。这就导致具有潜在生物学相关性的复杂表型被忽视。在此,我们展示了PhenoComb,这是一个R软件包,可通过评估标志物的所有组合对表型进行无偏倚探索。PhenoComb使用信号强度阈值将标志物分配到离散状态(例如阴性、低、高),然后以内存安全的方式计算每个样本中所有可能标志物组合的细胞数量。时间和磁盘空间是评估标志物数量的唯一限制因素。PhenoComb还提供了几种进行统计比较、评估表型相关性以及评估所鉴定表型独立性的方法。PhenoComb允许用户通过调整几个函数参数来指导分析,例如识别感兴趣的亲本群体、过滤低频群体以及定义要评估的表型的最大复杂性。我们将PhenoComb设计为与本地计算机或基于服务器的使用兼容。

结果

在对合成数据集测试PhenoComb的性能时,对16个标志物的计算在数分钟内完成,对多达26个标志物的计算在数小时内完成。我们将PhenoComb应用于两个公开可用的数据集:一个HIV流式细胞术数据集(12个标志物和421个样本)以及COVIDome CyTOF数据集(40个标志物和99个样本)。在HIV数据集中,PhenoComb鉴定出了与HIV血清转化相关的免疫表型,包括原始出版物中突出显示的那些。在COVID数据集中,我们鉴定出了几个在感染个体中相对于健康个体频率发生改变的免疫表型。总体而言,PhenoComb是一种用于无偏倚评估高维单细胞数据的强大发现工具。

可用性与实现方式

PhenoComb R软件包可从https://github.com/SciOmicsLab/PhenoComb下载。

补充信息

补充数据可在网上获取。

相似文献

1
PhenoComb: a discovery tool to assess complex phenotypes in high-dimensional single-cell datasets.PhenoComb:一种用于评估高维单细胞数据集中复杂表型的发现工具。
Bioinform Adv. 2022 Aug 3;2(1):vbac052. doi: 10.1093/bioadv/vbac052. eCollection 2022.
2
CyTOFmerge: integrating mass cytometry data across multiple panels.CyTOFmerge:跨多个面板整合液质联用数据。
Bioinformatics. 2019 Oct 15;35(20):4063-4071. doi: 10.1093/bioinformatics/btz180.
3
Gating mass cytometry data by deep learning.通过深度学习对门控质谱流式细胞术数据进行分类。
Bioinformatics. 2017 Nov 1;33(21):3423-3430. doi: 10.1093/bioinformatics/btx448.
4
Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data.高维单细胞流式细胞术和质谱流式细胞术数据聚类方法的比较
Cytometry A. 2016 Dec;89(12):1084-1096. doi: 10.1002/cyto.a.23030. Epub 2016 Dec 19.
5
scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets.scGate:基于标记的异质单细胞 RNA-seq 数据集细胞类型的纯化。
Bioinformatics. 2022 Apr 28;38(9):2642-2644. doi: 10.1093/bioinformatics/btac141.
6
A computational approach for phenotypic comparisons of cell populations in high-dimensional cytometry data.一种用于高维流式细胞术数据中细胞群体表型比较的计算方法。
Methods. 2018 Jan 1;132:66-75. doi: 10.1016/j.ymeth.2017.09.005. Epub 2017 Sep 14.
7
SPADEVizR: an R package for visualization, analysis and integration of SPADE results.SPADEVizR:一个用于SPADE结果可视化、分析和整合的R包。
Bioinformatics. 2017 Mar 1;33(5):779-781. doi: 10.1093/bioinformatics/btw708.
8
SCHNEL: scalable clustering of high dimensional single-cell data.SCHNEL:高维单细胞数据的可扩展聚类。
Bioinformatics. 2020 Dec 30;36(Suppl_2):i849-i856. doi: 10.1093/bioinformatics/btaa816.
9
flowLearn: fast and precise identification and quality checking of cell populations in flow cytometry.flowLearn:流式细胞术中细胞群的快速准确识别和质量检查。
Bioinformatics. 2018 Jul 1;34(13):2245-2253. doi: 10.1093/bioinformatics/bty082.
10
CyAnno: a semi-automated approach for cell type annotation of mass cytometry datasets.CyAnno:一种用于质谱细胞数据集细胞类型注释的半自动方法。
Bioinformatics. 2021 Nov 18;37(22):4164-4171. doi: 10.1093/bioinformatics/btab409.

本文引用的文献

1
The COVIDome Explorer researcher portal.COVIDome Explorer 研究人员门户。
Cell Rep. 2021 Aug 17;36(7):109527. doi: 10.1016/j.celrep.2021.109527. Epub 2021 Jul 28.
2
Nivolumab and ipilimumab are associated with distinct immune landscape changes and response-associated immunophenotypes.纳武利尤单抗和伊匹单抗与不同的免疫景观变化和与应答相关的免疫表型相关。
JCI Insight. 2020 Jun 4;5(11):137066. doi: 10.1172/jci.insight.137066.
3
Dimensionality reduction for visualizing single-cell data using UMAP.使用UMAP进行单细胞数据可视化的降维方法。
Nat Biotechnol. 2018 Dec 3. doi: 10.1038/nbt.4314.
4
Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis.急性髓系白血病的数据驱动表型剖析揭示了与预后相关的祖细胞样细胞。
Cell. 2015 Jul 2;162(1):184-97. doi: 10.1016/j.cell.2015.05.047. Epub 2015 Jun 18.
5
Lineage relationship of effector and memory T cells.效应和记忆 T 细胞的谱系关系。
Curr Opin Immunol. 2013 Oct;25(5):556-63. doi: 10.1016/j.coi.2013.09.003. Epub 2013 Oct 20.
6
CD38 and CD157: a long journey from activation markers to multifunctional molecules.CD38 和 CD157:从激活标志物到多功能分子的漫长旅程。
Cytometry B Clin Cytom. 2013 Jul-Aug;84(4):207-17. doi: 10.1002/cyto.b.21092. Epub 2013 Apr 10.
7
Early immunologic correlates of HIV protection can be identified from computational analysis of complex multivariate T-cell flow cytometry assays.从复杂的多变量 T 细胞流式细胞术分析的计算分析中,可以确定与 HIV 保护相关的早期免疫学指标。
Bioinformatics. 2012 Apr 1;28(7):1009-16. doi: 10.1093/bioinformatics/bts082. Epub 2012 Feb 29.
8
Memories that last forever: strategies for optimizing vaccine T-cell memory.长久的记忆:优化疫苗 T 细胞记忆的策略。
Blood. 2010 Mar 4;115(9):1678-89. doi: 10.1182/blood-2009-06-227546. Epub 2009 Nov 10.
9
A chromatic explosion: the development and future of multiparameter flow cytometry.一场色彩的爆发:多参数流式细胞术的发展与未来
Immunology. 2008 Dec;125(4):441-9. doi: 10.1111/j.1365-2567.2008.02989.x.
10
Increasing age at HIV seroconversion from 18 to 40 years is associated with favorable virologic and immunologic responses to HAART.从18岁到40岁,HIV血清转化年龄的增加与高效抗逆转录病毒疗法(HAART)良好的病毒学和免疫学反应相关。
J Acquir Immune Defic Syndr. 2008 Sep 1;49(1):40-7. doi: 10.1097/QAI.0b013e31817bec05.