Bueckle Andreas, Herr Bruce W, Chen Lu, Bolin Daniel, Qaurooni Danial, Ginda Michael, Jain Yashvardhan, Puig-Barbe Aleix, Ardlie Kristin, Wang Fusheng, Börner Katy
Department of Intelligent Systems Engineering, Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA.
Department of Computer Science and Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, 11794, USA.
bioRxiv. 2025 Aug 20:2025.08.14.670406. doi: 10.1101/2025.08.14.670406.
The human body contains ~27-36 trillion cells of up to 10,000 cell types (CTs) within a volume of ~62-120 liters (males) and 52-89 liters (females). The Human Reference Atlas (HRA) v2.3 provides a quantitative 3D framework of CTs across 73 reference organs and 1,283 3D anatomical structures (ASs). The HRA Cell Type Population (HRApop) effort quantifies CTs per AS using high-quality single-cell (sc) data processed through scalable, reproducible workflows and cell type annotation (CTann) tools. HRApop v1.0 includes reference CT populations for 73 ASs (112 when sex-specific) using 662 datasets spatially registered to 230 locations across 17 organs (31 when sex-specific). For 558 sc-transcriptomics datasets (11,042,750 cells), CTs and biomarker expression were computed using Azimuth, CellTypist, and popV. To test generalizability, 104 sc-proteomics datasets (16,576,863 cells) were integrated. In total, HRApop includes 27,619,613 cells. HRApop can be used to predict (1) CT populations for 3D volumes in the human body and (2) the spatial origin of a tissue block, given a CT population. Data and code are at cns-iu.github.io/hra-cell-type-populations-supporting-information.
人体在约62 - 120升(男性)和52 - 89升(女性)的体积内包含约27 - 36万亿个细胞,多达10000种细胞类型(CTs)。人类参考图谱(HRA)v2.3提供了跨越73个参考器官和1283个三维解剖结构(ASs)的细胞类型定量三维框架。HRA细胞类型群体(HRApop)工作通过可扩展、可重复的工作流程和细胞类型注释(CTann)工具,使用高质量的单细胞(sc)数据对每个解剖结构的细胞类型进行定量。HRApop v1.0包括73个解剖结构(性别特异性时为112个)的参考细胞类型群体,使用了662个在空间上注册到17个器官(性别特异性时为31个)的230个位置的数据集。对于558个sc转录组学数据集(11042750个细胞),使用Azimuth、CellTypist和popV计算细胞类型和生物标志物表达。为了测试通用性,整合了104个sc蛋白质组学数据集(16576863个细胞)。HRApop总共包括27619613个细胞。HRApop可用于预测(1)人体三维体积中的细胞类型群体,以及(2)给定细胞类型群体时组织块的空间起源。数据和代码位于cns - iu.github.io/hra - cell - type - populations - supporting - information。