• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多分辨率分类回归用于可解释的细胞类型注释。

Multiresolution categorical regression for interpretable cell-type annotation.

机构信息

School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.

Department of Biostatistics, University of Washington, Seattle, Washington, USA.

出版信息

Biometrics. 2023 Dec;79(4):3485-3496. doi: 10.1111/biom.13926. Epub 2023 Oct 5.

DOI:10.1111/biom.13926
PMID:37798600
Abstract

In many categorical response regression applications, the response categories admit a multiresolution structure. That is, subsets of the response categories may naturally be combined into coarser response categories. In such applications, practitioners are often interested in estimating the resolution at which a predictor affects the response category probabilities. In this paper, we propose a method for fitting the multinomial logistic regression model in high dimensions that addresses this problem in a unified and data-driven way. Our method allows practitioners to identify which predictors distinguish between coarse categories but not fine categories, which predictors distinguish between fine categories, and which predictors are irrelevant. For model fitting, we propose a scalable algorithm that can be applied when the coarse categories are defined by either overlapping or nonoverlapping sets of fine categories. Statistical properties of our method reveal that it can take advantage of this multiresolution structure in a way existing estimators cannot. We use our method to model cell-type probabilities as a function of a cell's gene expression profile (i.e., cell-type annotation). Our fitted model provides novel biological insights which may be useful for future automated and manual cell-type annotation methodology.

摘要

在许多分类响应回归应用中,响应类别具有多分辨率结构。也就是说,响应类别的子集可以自然地组合成更粗糙的响应类别。在这种应用中,从业者通常有兴趣估计预测器影响响应类别概率的分辨率。在本文中,我们提出了一种用于拟合多项逻辑回归模型的方法,该方法以统一和数据驱动的方式解决了这个问题。我们的方法允许从业者识别哪些预测器区分粗类别但不区分细类别,哪些预测器区分细类别,以及哪些预测器是不相关的。对于模型拟合,我们提出了一种可扩展的算法,当粗类别由细类别的重叠或非重叠集合定义时,可以应用该算法。我们方法的统计性质表明,它可以以现有估计器无法做到的方式利用这种多分辨率结构。我们使用我们的方法来模拟细胞类型的概率作为细胞表达谱(即细胞类型注释)的函数。我们拟合的模型提供了新的生物学见解,这可能对未来的自动和手动细胞类型注释方法有用。

相似文献

1
Multiresolution categorical regression for interpretable cell-type annotation.多分辨率分类回归用于可解释的细胞类型注释。
Biometrics. 2023 Dec;79(4):3485-3496. doi: 10.1111/biom.13926. Epub 2023 Oct 5.
2
scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets.scAnno:一种基于去卷积策略的单细胞 RNA 测序数据集自动细胞类型注释工具。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad179.
3
A nonparametric multiple imputation approach for missing categorical data.一种针对缺失分类数据的非参数多重填补方法。
BMC Med Res Methodol. 2017 Jun 6;17(1):87. doi: 10.1186/s12874-017-0360-2.
4
The multiscale coarse-graining method. VIII. Multiresolution hierarchical basis functions and basis function selection in the construction of coarse-grained force fields.多尺度粗粒化方法。VIII. 粗粒化力场构建中的多分辨层次基函数和基函数选择。
J Chem Phys. 2012 May 21;136(19):194113. doi: 10.1063/1.4705384.
5
scGAD: a new task and end-to-end framework for generalized cell type annotation and discovery.scGAD:用于广义细胞类型注释和发现的新任务和端到端框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad045.
6
An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes.改进的斑马鱼转录组注释,用于敏感和全面检测细胞类型特异性基因。
Elife. 2020 Aug 24;9:e55792. doi: 10.7554/eLife.55792.
7
Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213.牛呼吸道疾病病原体的转录组图谱:溶血曼海姆菌 PHL213。
BMC Bioinformatics. 2012;13 Suppl 15(Suppl 15):S4. doi: 10.1186/1471-2105-13-S15-S4. Epub 2012 Sep 11.
8
A Multi-way Multi-task Learning Approach for Multinomial Logistic Regression*. An Application in Joint Prediction of Appointment Miss-opportunities across Multiple Clinics.一种用于多项式逻辑回归的多路多任务学习方法*。在多个诊所预约错失机会联合预测中的应用。
Methods Inf Med. 2017 Aug 11;56(4):294-307. doi: 10.3414/ME16-01-0112. Epub 2017 Jun 7.
9
Automatic Cell Type Annotation Using Marker Genes for Single-Cell RNA Sequencing Data.基于标记基因的单细胞 RNA 测序数据自动细胞类型注释。
Biomolecules. 2022 Oct 21;12(10):1539. doi: 10.3390/biom12101539.
10
Improving autocoding performance of rare categories in injury classification: Is more training data or filtering the solution?提高伤害分类中罕见类别的自动编码性能:更多的训练数据还是过滤是解决方案?
Accid Anal Prev. 2018 Jan;110:115-127. doi: 10.1016/j.aap.2017.10.020. Epub 2017 Nov 8.