• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于在大型高维流式细胞术数据集中自动识别稀有细胞群体的SWIFT可扩展聚类,第1部分:算法设计

SWIFT-scalable clustering for automated identification of rare cell populations in large, high-dimensional flow cytometry datasets, part 1: algorithm design.

作者信息

Naim Iftekhar, Datta Suprakash, Rebhahn Jonathan, Cavenaugh James S, Mosmann Tim R, Sharma Gaurav

机构信息

Department of Computer Science, University of Rochester, Rochester, New York.

出版信息

Cytometry A. 2014 May;85(5):408-21. doi: 10.1002/cyto.a.22446. Epub 2014 Feb 14.

DOI:10.1002/cyto.a.22446
PMID:24677621
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4238829/
Abstract

We present a model-based clustering method, SWIFT (Scalable Weighted Iterative Flow-clustering Technique), for digesting high-dimensional large-sized datasets obtained via modern flow cytometry into more compact representations that are well-suited for further automated or manual analysis. Key attributes of the method include the following: (a) the analysis is conducted in the multidimensional space retaining the semantics of the data, (b) an iterative weighted sampling procedure is utilized to maintain modest computational complexity and to retain discrimination of extremely small subpopulations (hundreds of cells from datasets containing tens of millions), and (c) a splitting and merging procedure is incorporated in the algorithm to preserve distinguishability between biologically distinct populations, while still providing a significant compaction relative to the original data. This article presents a detailed algorithmic description of SWIFT, outlining the application-driven motivations for the different design choices, a discussion of computational complexity of the different steps, and results obtained with SWIFT for synthetic data and relatively simple experimental data that allow validation of the desirable attributes. A companion paper (Part 2) highlights the use of SWIFT, in combination with additional computational tools, for more challenging biological problems.

摘要

我们提出了一种基于模型的聚类方法——SWIFT(可扩展加权迭代流聚类技术),用于将通过现代流式细胞术获得的高维大型数据集提炼成更紧凑的表示形式,以便于进一步进行自动化或手动分析。该方法的关键特性包括:(a) 在保留数据语义的多维空间中进行分析;(b) 采用迭代加权采样程序,以保持适度的计算复杂度,并保留对极小亚群(来自包含数千万个细胞的数据集的数百个细胞)的区分能力;(c) 算法中纳入了分裂和合并程序,以保持生物学上不同群体之间的可区分性,同时相对于原始数据仍能实现显著的压缩。本文详细介绍了SWIFT的算法描述,概述了不同设计选择的应用驱动动机,讨论了不同步骤的计算复杂度,以及使用SWIFT处理合成数据和相对简单的实验数据所获得的结果,这些数据能够验证该方法的理想特性。一篇配套论文(第2部分)重点介绍了将SWIFT与其他计算工具相结合,用于解决更具挑战性的生物学问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/a0cde27b064d/cyto0085-0408-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/744459f56290/cyto0085-0408-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/026c386112ef/cyto0085-0408-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/28832385d380/cyto0085-0408-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/7aa329c1a727/cyto0085-0408-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/a0cde27b064d/cyto0085-0408-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/744459f56290/cyto0085-0408-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/026c386112ef/cyto0085-0408-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/28832385d380/cyto0085-0408-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/7aa329c1a727/cyto0085-0408-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/143b/4238829/a0cde27b064d/cyto0085-0408-f5.jpg

相似文献

1
SWIFT-scalable clustering for automated identification of rare cell populations in large, high-dimensional flow cytometry datasets, part 1: algorithm design.用于在大型高维流式细胞术数据集中自动识别稀有细胞群体的SWIFT可扩展聚类,第1部分:算法设计
Cytometry A. 2014 May;85(5):408-21. doi: 10.1002/cyto.a.22446. Epub 2014 Feb 14.
2
SWIFT-scalable clustering for automated identification of rare cell populations in large, high-dimensional flow cytometry datasets, part 2: biological evaluation.用于在大型高维流式细胞术数据集中自动识别稀有细胞群体的SWIFT可扩展聚类,第2部分:生物学评估。
Cytometry A. 2014 May;85(5):422-33. doi: 10.1002/cyto.a.22445. Epub 2014 Feb 14.
3
Competitive SWIFT cluster templates enhance detection of aging changes.竞争型 SWIFT 簇模板增强了衰老变化的检测。
Cytometry A. 2016 Jan;89(1):59-70. doi: 10.1002/cyto.a.22740. Epub 2015 Oct 6.
4
immunoClust--An automated analysis pipeline for the identification of immunophenotypic signatures in high-dimensional cytometric datasets.免疫聚类——一种用于在高维细胞数据集识别免疫表型特征的自动化分析流程。
Cytometry A. 2015 Jul;87(7):603-15. doi: 10.1002/cyto.a.22626. Epub 2015 Apr 7.
5
Automated gating of flow cytometry data via robust model-based clustering.通过基于稳健模型的聚类实现流式细胞术数据的自动门控。
Cytometry A. 2008 Apr;73(4):321-32. doi: 10.1002/cyto.a.20531.
6
Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data.高维单细胞流式细胞术和质谱流式细胞术数据聚类方法的比较
Cytometry A. 2016 Dec;89(12):1084-1096. doi: 10.1002/cyto.a.23030. Epub 2016 Dec 19.
7
Data reduction for spectral clustering to analyze high throughput flow cytometry data.用于分析高通量流式细胞术数据的谱聚类数据约简。
BMC Bioinformatics. 2010 Jul 28;11:403. doi: 10.1186/1471-2105-11-403.
8
Scalable clustering algorithms for continuous environmental flow cytometry.可扩展的连续环境流式细胞术聚类算法。
Bioinformatics. 2016 Feb 1;32(3):417-23. doi: 10.1093/bioinformatics/btv594. Epub 2015 Oct 17.
9
Misty Mountain clustering: application to fast unsupervised flow cytometry gating.迷雾山脉聚类:在快速无监督流式细胞术门控中的应用。
BMC Bioinformatics. 2010 Oct 9;11:502. doi: 10.1186/1471-2105-11-502.
10
Assessment of Automated Flow Cytometry Data Analysis Tools within Cell and Gene Therapy Manufacturing.评估细胞和基因治疗产品生产过程中的自动化流式细胞术数据分析工具
Int J Mol Sci. 2022 Mar 17;23(6):3224. doi: 10.3390/ijms23063224.

引用本文的文献

1
SWIFT clustering analysis of intracellular cytokine staining flow cytometry data of the HVTN 105 vaccine trial reveals high frequencies of HIV-specific CD4+ T cell responses and associations with humoral responses.SWIFT 聚类分析 HVTN 105 疫苗试验的细胞内细胞因子染色流式细胞术数据显示,HIV 特异性 CD4+ T 细胞反应频率较高,并与体液反应相关联。
Front Immunol. 2024 Jun 6;15:1347926. doi: 10.3389/fimmu.2024.1347926. eCollection 2024.
2
Measurable (Minimal) Residual Disease in Myelodysplastic Neoplasms (MDS): Current State and Perspectives.骨髓增生异常综合征(MDS)中的可测量(微小)残留病:现状与展望
Cancers (Basel). 2024 Apr 15;16(8):1503. doi: 10.3390/cancers16081503.
3

本文引用的文献

1
Hierarchical modeling for rare event detection and cell subset alignment across flow cytometry samples.用于流式细胞术样本中稀有事件检测和细胞亚群对齐的层次建模。
PLoS Comput Biol. 2013;9(7):e1003130. doi: 10.1371/journal.pcbi.1003130. Epub 2013 Jul 11.
2
flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding.flowPeaks:一种基于 K-means 和密度峰值发现的流式细胞术数据快速无监督聚类方法。
Bioinformatics. 2012 Aug 1;28(15):2052-8. doi: 10.1093/bioinformatics/bts300. Epub 2012 May 17.
3
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE.
Rarity: discovering rare cell populations from single-cell imaging data.
稀有性:从单细胞成像数据中发现稀有细胞群体。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad750.
4
A cell-level discriminative neural network model for diagnosis of blood cancers.一种用于血液癌症诊断的细胞水平判别神经网络模型。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad585.
5
Rapid reduction in Staphylococcus aureus in atopic dermatitis subjects following dupilumab treatment.达必妥治疗特应性皮炎患者后金黄色葡萄球菌迅速减少。
J Allergy Clin Immunol. 2023 Nov;152(5):1179-1195. doi: 10.1016/j.jaci.2023.05.026. Epub 2023 Jun 13.
6
Multiomics technologies for comprehensive tumor microenvironment analysis in triple-negative breast cancer under neoadjuvant chemotherapy.新辅助化疗下三阴性乳腺癌综合肿瘤微环境分析的多组学技术
Front Oncol. 2023 May 22;13:1131259. doi: 10.3389/fonc.2023.1131259. eCollection 2023.
7
INFLECT: an R-package for cytometry cluster evaluation using marker modality.INFLECT:一个使用标记模式评估流式细胞术聚类的 R 包。
BMC Bioinformatics. 2022 Nov 16;23(1):487. doi: 10.1186/s12859-022-05018-w.
8
Assessment of Automated Flow Cytometry Data Analysis Tools within Cell and Gene Therapy Manufacturing.评估细胞和基因治疗产品生产过程中的自动化流式细胞术数据分析工具
Int J Mol Sci. 2022 Mar 17;23(6):3224. doi: 10.3390/ijms23063224.
9
UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia.基于UMAP的急性髓系白血病微小残留病定量异常检测
Cancers (Basel). 2022 Feb 11;14(4):898. doi: 10.3390/cancers14040898.
10
CYBERTRACK2.0: zero-inflated model-based cell clustering and population tracking method for longitudinal mass cytometry data.CYBERTRACK2.0:基于零膨胀模型的细胞聚类和群体跟踪方法,用于纵向质谱流式细胞术数据。
Bioinformatics. 2021 Jul 12;37(11):1632-1634. doi: 10.1093/bioinformatics/btaa873.
使用 SPADE 从高维细胞测定数据中提取细胞层次结构。
Nat Biotechnol. 2011 Oct 2;29(10):886-91. doi: 10.1038/nbt.1991.
4
Rapid cell population identification in flow cytometry data.流式细胞术数据中快速的细胞群体识别。
Cytometry A. 2011 Jan;79(1):6-13. doi: 10.1002/cyto.a.21007.
5
Combining Mixture Components for Clustering.组合混合成分用于聚类。
J Comput Graph Stat. 2010 Jun 1;9(2):332-353. doi: 10.1198/jcgs.2010.08111.
6
Elucidation of seventeen human peripheral blood B-cell subsets and quantification of the tetanus response using a density-based method for the automated identification of cell populations in multidimensional flow cytometry data.运用基于密度的方法阐明十七个人外周血 B 细胞亚群,并对破伤风应答进行定量分析,该方法可用于多维流式细胞术数据中细胞群体的自动识别。
Cytometry B Clin Cytom. 2010;78 Suppl 1(Suppl 1):S69-82. doi: 10.1002/cyto.b.20554.
7
Data reduction for spectral clustering to analyze high throughput flow cytometry data.用于分析高通量流式细胞术数据的谱聚类数据约简。
BMC Bioinformatics. 2010 Jul 28;11:403. doi: 10.1186/1471-2105-11-403.
8
The curvHDR method for gating flow cytometry samples. curvHDR 方法用于门控流式细胞术样本。
BMC Bioinformatics. 2010 Jan 22;11:44. doi: 10.1186/1471-2105-11-44.
9
Merging mixture components for cell population identification in flow cytometry.在流式细胞术中合并混合组分以进行细胞群体鉴定。
Adv Bioinformatics. 2009;2009:247646. doi: 10.1155/2009/247646. Epub 2009 Nov 12.
10
Automated high-dimensional flow cytometric data analysis.自动化高维流式细胞术数据分析。
Proc Natl Acad Sci U S A. 2009 May 26;106(21):8519-24. doi: 10.1073/pnas.0903028106. Epub 2009 May 14.