• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HiCat:一种用于细胞类型注释的半监督方法。

HiCat: a semi-supervised approach for cell type annotation.

作者信息

Bi Chang, Bai Kailun, Zhang Xuekui

机构信息

Department of Mathematics and Statistics, University of Victoria, 3800 Finnerty Road, Victoria, BC V8P 5C2, Canada.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf428.

DOI:10.1093/bib/bbaf428
PMID:40833274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12365967/
Abstract

Existing cell type annotation methods face significant hurdles: supervised approaches often fail to differentiate between novel cell types not present in reference data, while unsupervised techniques can suffer from cluster impurity and difficulties in robustly distinguishing multiple distinct unknown cell populations. This critical gap motivated the development of HiCat, a semi-supervised pipeline specifically designed to overcome these limitations. HiCat is a semi-supervised pipeline that integrates both approaches, leveraging reference (labeled) and query (unlabeled) genomic data to simultaneously enhance annotation accuracy for known cell types and improve the discovery and differentiation of novel ones. HiCat follows a structured pipeline: (1) removing batch effects and generate a low-dimensional embedding; (2) nonlinear dimensionality reduction for capturing key patterns; (3) unsupervised clustering for proposing novel cell type candidates; (4) merging multi-resolution features from previous steps into a condensed feature space; (5) training a classifier on reference data for supervised annotation; and (6) resolving inconsistencies between supervised predictions and unsupervised clusters to finalize annotations, particularly for unseen types. Performance was evaluated across 10 public genomic datasets and perform a case study on a molecular cell atlas of the human lung. HiCat demonstrated superior performance in both known cell type classification and novel cell type identification. In benchmark evaluations, HiCat consistently outperformed existing methods, critically excelling in identifying and distinguishing multiple novel cell types. HiCat presents a robust framework for scRNA-seq cell annotation, improving classification accuracy and novel type identification. In addition, it provides a scalable and transferable solution for biomedical research, directly addressing key challenges in automated cell annotation.

摘要

现有的细胞类型注释方法面临重大障碍

监督方法往往无法区分参考数据中不存在的新型细胞类型,而无监督技术可能会受到聚类不纯的影响,并且难以可靠地区分多个不同的未知细胞群体。这一关键差距促使了HiCat的开发,HiCat是一种专门设计用于克服这些限制的半监督流程。HiCat是一种半监督流程,它整合了两种方法,利用参考(标记)和查询(未标记)基因组数据,同时提高已知细胞类型的注释准确性,并改善新型细胞类型的发现和区分。HiCat遵循一个结构化流程:(1)消除批次效应并生成低维嵌入;(2)进行非线性降维以捕获关键模式;(3)进行无监督聚类以提出新型细胞类型候选;(4)将前几步的多分辨率特征合并到一个压缩特征空间;(5)在参考数据上训练分类器以进行监督注释;(6)解决监督预测和无监督聚类之间的不一致以最终确定注释,特别是对于未见类型。在10个公共基因组数据集上评估了性能,并对人类肺的分子细胞图谱进行了案例研究。HiCat在已知细胞类型分类和新型细胞类型识别方面均表现出卓越性能。在基准评估中,HiCat始终优于现有方法,在识别和区分多种新型细胞类型方面表现出色。HiCat为scRNA-seq细胞注释提供了一个强大的框架,提高了分类准确性和新型类型识别能力。此外,它为生物医学研究提供了一种可扩展且可转移的解决方案,直接解决了自动细胞注释中的关键挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/616183350680/bbaf428f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/bc4f6f420f80/bbaf428f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/0571c48cb8c5/bbaf428f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/acc081a3c568/bbaf428f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/9b90a8f01827/bbaf428f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/5672aaab92c5/bbaf428f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/3a3e96abc949/bbaf428f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/616183350680/bbaf428f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/bc4f6f420f80/bbaf428f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/0571c48cb8c5/bbaf428f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/acc081a3c568/bbaf428f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/9b90a8f01827/bbaf428f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/5672aaab92c5/bbaf428f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/3a3e96abc949/bbaf428f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c99/12365967/616183350680/bbaf428f7.jpg

相似文献

1
HiCat: a semi-supervised approach for cell type annotation.HiCat:一种用于细胞类型注释的半监督方法。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf428.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
An Unsupervised Learning Algorithm for the Automatic Classification of Coronary Artery Lesions.一种用于冠状动脉病变自动分类的无监督学习算法。
Cureus. 2025 Jul 24;17(7):e88638. doi: 10.7759/cureus.88638. eCollection 2025 Jul.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
ScInfeR: an efficient method for annotating cell types and sub-types in single-cell RNA-seq, ATAC-seq, and spatial omics.ScInfeR:一种用于在单细胞RNA测序、ATAC测序和空间组学中注释细胞类型和亚型的有效方法。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf253.
6
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
7
Annotating neurophysiologic data at scale with optimized human input.通过优化的人工输入大规模标注神经生理数据。
J Neural Eng. 2025 Jul 3;22(4). doi: 10.1088/1741-2552/ade402.
8
Semi-supervised semantic segmentation of cell nuclei with diffusion model and collaborative learning.基于扩散模型和协同学习的细胞核半监督语义分割
J Med Imaging (Bellingham). 2025 Nov;12(6):061403. doi: 10.1117/1.JMI.12.6.061403. Epub 2025 Mar 20.
9
Ensemble machine learning-based pre-trained annotation approach for scRNA-seq data using gradient boosting with genetic optimizer.基于集成机器学习的预训练注释方法,用于使用带有遗传优化器的梯度提升的单细胞RNA测序数据。
BMC Bioinformatics. 2025 Jul 1;26(1):166. doi: 10.1186/s12859-025-06151-y.
10
Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.半监督学习可利用多中心MRI数据,通过减少脑转移瘤的标注来改进分割。
J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.

本文引用的文献

1
Interleukin-7-based identification of liver lymphatic endothelial cells reveals their unique structural features.基于白细胞介素-7对肝淋巴管内皮细胞的鉴定揭示了它们独特的结构特征。
JHEP Rep. 2024 Mar 18;6(7):101069. doi: 10.1016/j.jhepr.2024.101069. eCollection 2024 Jul.
2
Mast cells: a novel therapeutic avenue for cardiovascular diseases?肥大细胞:心血管疾病的新治疗途径?
Cardiovasc Res. 2024 May 29;120(7):681-698. doi: 10.1093/cvr/cvae066.
3
scDOT: enhancing single-cell RNA-Seq data annotation and uncovering novel cell types through multi-reference integration.
scDOT:通过多参考整合增强单细胞RNA测序数据注释并揭示新型细胞类型
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae072.
4
Mast cell marker gene signature: prognosis and immunotherapy response prediction in lung adenocarcinoma through integrated scRNA-seq and bulk RNA-seq.肥大细胞标志物基因特征:通过整合 scRNA-seq 和 bulk RNA-seq 预测肺腺癌的预后和免疫治疗反应。
Front Immunol. 2023 May 15;14:1189520. doi: 10.3389/fimmu.2023.1189520. eCollection 2023.
5
Dictionary learning for integrative, multimodal and scalable single-cell analysis.基于字典学习的综合、多模态和可扩展的单细胞分析。
Nat Biotechnol. 2024 Feb;42(2):293-304. doi: 10.1038/s41587-023-01767-y. Epub 2023 May 25.
6
scAnnotate: an automated cell-type annotation tool for single-cell RNA-sequencing data.scAnnotate:一种用于单细胞RNA测序数据的自动细胞类型注释工具。
Bioinform Adv. 2023 Mar 13;3(1):vbad030. doi: 10.1093/bioadv/vbad030. eCollection 2023.
7
scSemiGAN: a single-cell semi-supervised annotation and dimensionality reduction framework based on generative adversarial network.scSemiGAN:基于生成对抗网络的单细胞半监督注释和降维框架。
Bioinformatics. 2022 Nov 15;38(22):5042-5048. doi: 10.1093/bioinformatics/btac652.
8
Mapping and Validation of scRNA-Seq-Derived Cell-Cell Communication Networks in the Tumor Microenvironment.单细胞 RNA 测序衍生的肿瘤微环境细胞间通讯网络的绘制和验证。
Front Immunol. 2022 Apr 28;13:885267. doi: 10.3389/fimmu.2022.885267. eCollection 2022.
9
Single-cell RNA sequencing technologies and applications: A brief overview.单细胞 RNA 测序技术及应用:简述。
Clin Transl Med. 2022 Mar;12(3):e694. doi: 10.1002/ctm2.694.
10
scMAGIC: accurately annotating single cells using two rounds of reference-based classification.scMAGIC:使用两轮基于参考的分类方法准确注释单细胞。
Nucleic Acids Res. 2022 May 6;50(8):e43. doi: 10.1093/nar/gkab1275.