Suppr超能文献

用于单细胞RNA测序分析中聚类和细胞类型分类的神经网络迭代迁移学习

Iterative transfer learning with neural network for clustering and cell type classification in single-cell RNA-seq analysis.

作者信息

Hu Jian, Li Xiangjie, Hu Gang, Lyu Yafei, Susztak Katalin, Li Mingyao

机构信息

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA.

State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100037, China.

出版信息

Nat Mach Intell. 2020 Oct;2(10):607-618. doi: 10.1038/s42256-020-00233-7. Epub 2020 Oct 5.

Abstract

Clustering and cell type classification are important steps in single-cell RNA-seq (scRNA-seq) analysis. As more and more scRNA-seq data are becoming available, supervised cell type classification methods that utilize external well-annotated source data start to gain popularity over unsupervised clustering algorithms. However, the performance of existing supervised methods is highly dependent on source data quality, and they often have limited accuracy to classify cell types that are missing in the source data. To overcome these limitations, we developed ItClust, a transfer learning algorithm that borrows idea from supervised cell type classification algorithms, but also leverages information in target data to ensure sensitivity in classifying cells that are only present in the target data. Through extensive evaluations using data from different species and tissues generated with diverse scRNA-seq protocols, we show that ItClust significantly improves clustering and cell type classification accuracy over popular unsupervised clustering and supervised cell type classification algorithms.

摘要

聚类和细胞类型分类是单细胞RNA测序(scRNA-seq)分析中的重要步骤。随着越来越多的scRNA-seq数据可用,利用外部注释良好的源数据的监督细胞类型分类方法开始比无监督聚类算法更受欢迎。然而,现有监督方法的性能高度依赖于源数据质量,并且它们在对源数据中缺失的细胞类型进行分类时准确性往往有限。为了克服这些限制,我们开发了ItClust,这是一种迁移学习算法,它借鉴了监督细胞类型分类算法的思想,但也利用目标数据中的信息来确保对仅存在于目标数据中的细胞进行分类时的敏感性。通过使用来自不同物种和组织、采用不同scRNA-seq方案生成的数据进行广泛评估,我们表明ItClust比流行的无监督聚类和监督细胞类型分类算法显著提高了聚类和细胞类型分类的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb8/8009055/8424a097f6b1/nihms-1623444-f0007.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验