Suppr超能文献

scLKME:一种基于地标从单细胞数据生成多细胞样本嵌入的方法。

scLKME: A Landmark-based Approach for Generating Multi-cellular Sample Embeddings from Single-cell Data.

作者信息

Yi Haidong, Stanley Natalie

机构信息

Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill NC 27599, USA.

Computational Medicine Program, University of North Carolina at Chapel Hill, Chapel Hill NC 27599, USA.

出版信息

bioRxiv. 2023 Nov 15:2023.11.13.566846. doi: 10.1101/2023.11.13.566846.

Abstract

Single-cell technologies enable high-dimensional profiling of individual cells, therefore offering profound insights into subtle variation between specialized cell-types. However, translating the multitude of nuanced cellular profiles into meaningful per-sample representations is challenging due to heterogeneous cellular composition across individual profiled samples. To compute informative per-sample representations, we developed scLKME, a novel approach that uses a landmark-based kernel mean embedding method to convert multi-sample single-cell data into compact per-sample embeddings. Treating each sample as a distribution over cells, scLKME identifies landmarks across samples and maps these distributions into a reproducing kernel Hilbert space. Overall, scLKME outperforms state-of-the-art techniques in robustness, efficiency, accuracy, and practical usefulness of sample embeddings. Its application on a CyTOF dataset profiling immune responses in preterm birth highlighted its capacity to accurately identify patient-specific variations correlating with gestational age, suggesting broad applicability to multi-sample single-cell datasets with complex experimental designs. scLKME is available as an open-sourced python package at https://github.com/CompCy-lab/scLKME.

摘要

单细胞技术能够对单个细胞进行高维分析,从而深入洞察特化细胞类型之间的细微差异。然而,由于各个被分析样本的细胞组成存在异质性,将众多细微的细胞分析结果转化为有意义的每个样本的表征具有挑战性。为了计算有信息量的每个样本的表征,我们开发了scLKME,这是一种新颖的方法,它使用基于地标点的核均值嵌入方法将多样本单细胞数据转换为紧凑的每个样本的嵌入。将每个样本视为细胞上的分布,scLKME识别样本间的地标点,并将这些分布映射到再生核希尔伯特空间。总体而言,scLKME在样本嵌入的稳健性、效率、准确性和实际实用性方面优于现有技术。它在一个分析早产免疫反应的CyTOF数据集上的应用突出了其准确识别与胎龄相关的患者特异性差异的能力,表明其广泛适用于具有复杂实验设计的多样本单细胞数据集。scLKME作为一个开源的Python包可在https://github.com/CompCy-lab/scLKME获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67aa/10680595/fc0723c97f3e/nihpp-2023.11.13.566846v1-f0004.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验