• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

XgCPred:基于 XGBoost-CNN 集成和单细胞 RNAseq 数据中基因表达成像的细胞类型分类。

XgCPred: Cell type classification using XGBoost-CNN integration and exploiting gene expression imaging in single-cell RNAseq data.

机构信息

Hijjawi Faculty for Engineering Technology, Biomedical Systems and Informatics Engineering Department, Yarmouk University, Irbid, 21163, Jordan.

Hijjawi Faculty for Engineering Technology, Biomedical Systems and Informatics Engineering Department, Yarmouk University, Irbid, 21163, Jordan.

出版信息

Comput Biol Med. 2024 Oct;181:109066. doi: 10.1016/j.compbiomed.2024.109066. Epub 2024 Aug 24.

DOI:10.1016/j.compbiomed.2024.109066
PMID:39180857
Abstract

BACKGROUND

The advent of single-cell RNA sequencing (scRNA-seq) has revolutionized transcriptomic research by enabling the exploration of gene expression at an individual cell level. This advancement sheds light on how cells differentiate and evolve over time. Effectively classification cell types within scRNA-seq datasets are essential for understanding the intricate cell compositions within tissues and elucidating the origins of various diseases. Challenges persist in the field, emphasizing the need for precise categorization across diverse datasets, addressing aggregated cell data, and managing the complexity of high-dimensional data spaces.

METHODOLOGY

XgCPred is a novel approach combining XGBoost with convolutional neural networks (CNNs) to provide cell type classification with better accuracy in single-cell RNA-seq data. This combo reveals how well CNNs can detect spatial hierarchy in gene expression images and how XGBoost performs with large volumes of data. XgCPred utilizes an imaging representation of gene expression that is based on the hierarchical organization of genes found in the KEGG BRITE database.

RESULTS

Rigorous testing of XgCPred across multiple scRNA-seq datasets, each presenting unique challenges such as varying cell counts, gene expression diversity, and cellular heterogeneity, has demonstrated its superiority compared to earlier methods. The algorithm shows remarkable accuracy and precision in cell type annotation, achieving near-perfect classification scores in some cases. These results underscore its capability to effectively manage data variability.

CONCLUSIONS

XgCPred distinguishes itself through its dependable and accurate cell type classification across a range of scRNA-seq datasets. Its effectiveness stems from sophisticated data handling and its ability to adapt to the complexities inherent in scRNA-seq data. XgCPred delivers reliable cell annotations essential for further biological analysis and research, marking a significant advancement in genomic studies. With scRNA-seq datasets growing in size and complexity, XgCPred offers a scalable and potent solution for cell type identification, potentially enhancing our understanding of cellular biology and aiding in the precise detection of diseases. XgCPred is a useful tool in genomic research and tailored therapy because it solves current constraints on computing efficiency and generalizability.

摘要

背景

单细胞 RNA 测序(scRNA-seq)的出现彻底改变了转录组学研究,使人们能够在单个细胞水平上探索基因表达。这一进展揭示了细胞是如何随着时间的推移而分化和进化的。在 scRNA-seq 数据集中有效地对细胞类型进行分类对于理解组织内复杂的细胞组成以及阐明各种疾病的起源至关重要。该领域仍然存在挑战,强调需要在不同数据集之间进行精确分类,处理聚合细胞数据,并管理高维数据空间的复杂性。

方法

XgCPred 是一种将 XGBoost 与卷积神经网络(CNNs)相结合的新方法,用于提供单细胞 RNA-seq 数据中具有更高准确性的细胞类型分类。该组合揭示了 CNN 如何检测基因表达图像中的空间层次结构,以及 XGBoost 如何处理大量数据。XgCPred 利用基于 KEGG BRITE 数据库中发现的基因层次结构的基因表达图像表示。

结果

在多个 scRNA-seq 数据集上对 XgCPred 进行了严格的测试,每个数据集都提出了独特的挑战,如不同的细胞计数、基因表达多样性和细胞异质性,结果表明它优于早期的方法。该算法在细胞类型注释方面表现出了出色的准确性和精度,在某些情况下达到了近乎完美的分类分数。这些结果强调了它有效管理数据变异性的能力。

结论

XgCPred 通过在一系列 scRNA-seq 数据集中进行可靠和准确的细胞类型分类而脱颖而出。其有效性源于复杂的数据处理和对 scRNA-seq 数据固有复杂性的适应能力。XgCPred 提供了可靠的细胞注释,这对于进一步的生物学分析和研究至关重要,标志着基因组研究的重大进展。随着 scRNA-seq 数据集的规模和复杂性的增加,XgCPred 为细胞类型识别提供了一个可扩展且强大的解决方案,可能增强我们对细胞生物学的理解,并有助于精确检测疾病。XgCPred 是基因组研究和个体化治疗的有用工具,因为它解决了计算效率和通用性方面的当前限制。

相似文献

1
XgCPred: Cell type classification using XGBoost-CNN integration and exploiting gene expression imaging in single-cell RNAseq data.XgCPred:基于 XGBoost-CNN 集成和单细胞 RNAseq 数据中基因表达成像的细胞类型分类。
Comput Biol Med. 2024 Oct;181:109066. doi: 10.1016/j.compbiomed.2024.109066. Epub 2024 Aug 24.
2
scPLAN: a hierarchical computational framework for single transcriptomics data annotation, integration and cell-type label refinement.scPLAN:一种用于单细胞转录组学数据注释、整合和细胞类型标签细化的分层计算框架。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae305.
3
A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.单细胞 RNA 测序研究中差异表达分析的统计方法综合综述。
Genes (Basel). 2021 Dec 2;12(12):1947. doi: 10.3390/genes12121947.
4
Scanorama: integrating large and diverse single-cell transcriptomic datasets.Scanorama:整合大型和多样化的单细胞转录组数据集。
Nat Protoc. 2024 Aug;19(8):2283-2297. doi: 10.1038/s41596-024-00991-3. Epub 2024 Jun 6.
5
scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network.scGGAN:基于图的生成对抗网络的单细胞RNA测序数据插补
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad040.
6
Latent cellular analysis robustly reveals subtle diversity in large-scale single-cell RNA-seq data.潜伏细胞分析能稳健地揭示大规模单细胞 RNA-seq 数据中的细微多样性。
Nucleic Acids Res. 2019 Dec 16;47(22):e143. doi: 10.1093/nar/gkz826.
7
scSwinFormer: A Transformer-Based Cell-Type Annotation Method for scRNA-Seq Data Using Smooth Gene Embedding and Global Features.scSwinFormer:一种基于 Transformer 的单细胞 RNA-Seq 数据细胞类型注释方法,使用平滑基因嵌入和全局特征。
J Chem Inf Model. 2024 Aug 26;64(16):6316-6323. doi: 10.1021/acs.jcim.4c00616. Epub 2024 Aug 5.
8
Clustering scRNA-seq data with the cross-view collaborative information fusion strategy.使用跨视图协同信息融合策略对 scRNA-seq 数据进行聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae511.
9
Identifying cell states in single-cell RNA-seq data at statistically maximal resolution.以统计学上最大分辨率识别单细胞 RNA-seq 数据中的细胞状态。
PLoS Comput Biol. 2024 Jul 12;20(7):e1012224. doi: 10.1371/journal.pcbi.1012224. eCollection 2024 Jul.
10
Single-cell analysis via manifold fitting: A framework for RNA clustering and beyond.单细胞分析通过流形拟合:RNA 聚类及其他。
Proc Natl Acad Sci U S A. 2024 Sep 10;121(37):e2400002121. doi: 10.1073/pnas.2400002121. Epub 2024 Sep 3.

引用本文的文献

1
Intelligent traffic congestion forecasting using BiLSTM and adaptive secretary bird optimizer for sustainable urban transportation.基于双向长短期记忆网络(BiLSTM)和自适应秘书鸟优化器的智能交通拥堵预测用于可持续城市交通
Sci Rep. 2025 May 26;15(1):18423. doi: 10.1038/s41598-025-02933-9.
2
Accelerating antimicrobial peptide design: Leveraging deep learning for rapid discovery.加速抗菌肽设计:利用深度学习实现快速发现
PLoS One. 2024 Dec 20;19(12):e0315477. doi: 10.1371/journal.pone.0315477. eCollection 2024.