• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Using single-cell cytometry to illustrate integrated multi-perspective evaluation of clustering algorithms using Pareto fronts.

作者信息

Putri Givanna H, Koprinska Irena, Ashhurst Thomas M, King Nicholas J C, Read Mark N

机构信息

School of Computer Science, The University of Sydney, Sydney, 2006, Australia.

Charles Perkins Centre, The University of Sydney, Sydney, 2006, Australia.

出版信息

Bioinformatics. 2021 Jan 28. doi: 10.1093/bioinformatics/btab038.

DOI:10.1093/bioinformatics/btab038
PMID:33508103
Abstract

MOTIVATION

Many 'automated gating' algorithms now exist to cluster cytometry and single cell sequencing data into discrete populations. Comparative algorithm evaluations on benchmark datasets rely either on a single performance metric, or a few metrics considered independently of one another. However, single metrics emphasise different aspects of clustering performance and do not rank clustering solutions in the same order. This underlies the lack of consensus between comparative studies regarding optimal clustering algorithms and undermines the translatability of results onto other non-benchmark datasets.

RESULTS

We propose the Pareto fronts framework as an integrative evaluation protocol, wherein individual metrics are instead leveraged as complementary perspectives. Judged superior are algorithms that provide the best trade-off between the multiple metrics considered simultaneously. This yields a more comprehensive and complete view of clustering performance. Moreover, by broadly and systematically sampling algorithm parameter values using the Latin Hypercube sampling method, our evaluation protocol minimises (un)fortunate parameter value selections as confounding factors. Furthermore, it reveals how meticulously each algorithm must be tuned in order to obtain good results, vital knowledge for users with novel data. We exemplify the protocol by conducting a comparative study between three clustering algorithms (ChronoClust, FlowSOM and Phenograph) using four common performance metrics applied across four cytometry benchmark datasets. To our knowledge, this is the first time Pareto fronts have been used to evaluate the performance of clustering algorithms in any application domain.

AVAILABILITY

Implementation of our Pareto front methodology and all scripts to reproduce this article are available at https://github.com/ghar1821/ParetoBench.

摘要

相似文献

1
Using single-cell cytometry to illustrate integrated multi-perspective evaluation of clustering algorithms using Pareto fronts.
Bioinformatics. 2021 Jan 28. doi: 10.1093/bioinformatics/btab038.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A Clustering-Based Adaptive Evolutionary Algorithm for Multiobjective Optimization With Irregular Pareto Fronts.一种基于聚类的多目标优化自适应进化算法,用于处理不规则帕累托前沿。
IEEE Trans Cybern. 2019 Jul;49(7):2758-2770. doi: 10.1109/TCYB.2018.2834466. Epub 2018 Jun 5.
4
Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data.高维单细胞流式细胞术和质谱流式细胞术数据聚类方法的比较
Cytometry A. 2016 Dec;89(12):1084-1096. doi: 10.1002/cyto.a.23030. Epub 2016 Dec 19.
5
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
6
An improved differential evolution algorithm for multi-modal multi-objective optimization.一种用于多模态多目标优化的改进差分进化算法。
PeerJ Comput Sci. 2024 Mar 14;10:e1839. doi: 10.7717/peerj-cs.1839. eCollection 2024.
7
Impact of similarity metrics on single-cell RNA-seq data clustering.相似度度量对单细胞 RNA-seq 数据聚类的影响。
Brief Bioinform. 2019 Nov 27;20(6):2316-2326. doi: 10.1093/bib/bby076.
8
scEFSC: Accurate single-cell RNA-seq data analysis via ensemble consensus clustering based on multiple feature selections.scEFSC:基于多重特征选择的集成共识聚类实现准确的单细胞RNA测序数据分析
Comput Struct Biotechnol J. 2022 Apr 27;20:2181-2197. doi: 10.1016/j.csbj.2022.04.023. eCollection 2022.
9
Ensembling of Gene Clusters Utilizing Deep Learning and Protein-Protein Interaction Information.利用深度学习和蛋白质-蛋白质相互作用信息进行基因簇组装。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2005-2016. doi: 10.1109/TCBB.2019.2918523. Epub 2020 Dec 8.
10
Cluster stability in the analysis of mass cytometry data.质谱流式细胞术数据分析中的聚类稳定性
Cytometry A. 2017 Jan;91(1):73-84. doi: 10.1002/cyto.a.23001. Epub 2016 Oct 18.