Gondal Mahnoor N, Shah Saad Ur Rehman, Chinnaiyan Arul M, Cieslik Marcin
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States.
Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, United States.
Front Bioinform. 2024 Jul 8;4:1417428. doi: 10.3389/fbinf.2024.1417428. eCollection 2024.
Rapid advancements in high-throughput single-cell RNA-seq (scRNA-seq) technologies and experimental protocols have led to the generation of vast amounts of transcriptomic data that populates several online databases and repositories. Here, we systematically examined large-scale scRNA-seq databases, categorizing them based on their scope and purpose such as general, tissue-specific databases, disease-specific databases, cancer-focused databases, and cell type-focused databases. Next, we discuss the technical and methodological challenges associated with curating large-scale scRNA-seq databases, along with current computational solutions. We argue that understanding scRNA-seq databases, including their limitations and assumptions, is crucial for effectively utilizing this data to make robust discoveries and identify novel biological insights. Such platforms can help bridge the gap between computational and wet lab scientists through user-friendly web-based interfaces needed for democratizing access to single-cell data. These platforms would facilitate interdisciplinary research, enabling researchers from various disciplines to collaborate effectively. This review underscores the importance of leveraging computational approaches to unravel the complexities of single-cell data and offers a promising direction for future research in the field.
高通量单细胞RNA测序(scRNA-seq)技术和实验方案的快速发展,已产生了大量转录组数据,并被存入多个在线数据库和资源库。在此,我们系统地研究了大规模scRNA-seq数据库,并根据其范围和用途进行分类,如通用数据库、组织特异性数据库、疾病特异性数据库、癌症聚焦数据库和细胞类型聚焦数据库。接下来,我们讨论了整理大规模scRNA-seq数据库所面临的技术和方法挑战,以及当前的计算解决方案。我们认为,了解scRNA-seq数据库,包括其局限性和假设,对于有效利用这些数据做出可靠发现并识别新的生物学见解至关重要。此类平台可以通过用户友好的基于网络的界面,帮助弥合计算科学家和湿实验室科学家之间的差距,从而实现单细胞数据的普及访问。这些平台将促进跨学科研究,使来自不同学科的研究人员能够有效合作。本综述强调了利用计算方法来揭示单细胞数据复杂性的重要性,并为该领域未来的研究提供了一个有前景的方向。