• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MIMIC:一种用于鉴定细胞分选的细胞类型特异性标记组合的优化方法。

MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting.

机构信息

Department of Mathematics, Huazhong University of Science and Technology, Beijing 100190, China.

Department of Genetics and Biochemistry, Clemson University, Beijing 100190, China.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab235.

DOI:10.1093/bib/bbab235
PMID:34180954
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8575015/
Abstract

Multi-omics data allow us to select a small set of informative markers for the discrimination of specific cell types and study of cellular heterogeneity. However, it is often challenging to choose an optimal marker panel from the high-dimensional molecular profiles for a large amount of cell types. Here, we propose a method called Mixed Integer programming Model to Identify Cell type-specific marker panel (MIMIC). MIMIC maintains the hierarchical topology among different cell types and simultaneously maximizes the specificity of a fixed number of selected markers. MIMIC was benchmarked on the mouse ENCODE RNA-seq dataset, with 29 diverse tissues, for 43 surface markers (SMs) and 1345 transcription factors (TFs). MIMIC could select biologically meaningful markers and is robust for different accuracy criteria. It shows advantages over the standard single gene-based approaches and widely used dimensional reduction methods, such as multidimensional scaling and t-SNE, both in accuracy and in biological interpretation. Furthermore, the combination of SMs and TFs achieves better specificity than SMs or TFs alone. Applying MIMIC to a large collection of 641 RNA-seq samples covering 231 cell types identifies a panel of TFs and SMs that reveal the modularity of cell type association networks. Finally, the scalability of MIMIC is demonstrated by selecting enhancer markers from mouse ENCODE data. MIMIC is freely available at https://github.com/MengZou1/MIMIC.

摘要

多组学数据使我们能够选择一小部分信息丰富的标记物,用于区分特定的细胞类型和研究细胞异质性。然而,从大量细胞类型的高维分子谱中选择最佳的标记物组合通常具有挑战性。在这里,我们提出了一种称为混合整数规划模型来识别细胞类型特异性标记物组合(MIMIC)的方法。MIMIC 保持了不同细胞类型之间的层次拓扑结构,同时最大化了固定数量选定标记物的特异性。MIMIC 在小鼠 ENCODE RNA-seq 数据集上进行了基准测试,该数据集包含 29 种不同的组织,用于 43 个表面标记物(SMs)和 1345 个转录因子(TFs)。MIMIC 可以选择具有生物学意义的标记物,并且对于不同的准确性标准具有鲁棒性。它在准确性和生物学解释方面都优于标准的基于单个基因的方法和广泛使用的降维方法,如多维缩放和 t-SNE。此外,SMs 和 TFs 的组合比单独使用 SMs 或 TFs 具有更好的特异性。将 MIMIC 应用于包含 231 种细胞类型的 641 个 RNA-seq 样本的大型集合中,鉴定出一组 TFs 和 SMs,这些标记物揭示了细胞类型关联网络的模块化。最后,通过从小鼠 ENCODE 数据中选择增强子标记物,证明了 MIMIC 的可扩展性。MIMIC 可在 https://github.com/MengZou1/MIMIC 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/327d4d8a4e8e/bbab235f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/3740ff57f6e9/bbab235f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/4614dcf36df4/bbab235f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/0363d7672aa1/bbab235f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/447557841416/bbab235f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/fd1f4904e275/bbab235f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/d8ca24e18b1d/bbab235f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/327d4d8a4e8e/bbab235f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/3740ff57f6e9/bbab235f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/4614dcf36df4/bbab235f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/0363d7672aa1/bbab235f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/447557841416/bbab235f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/fd1f4904e275/bbab235f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/d8ca24e18b1d/bbab235f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3f1/8575015/327d4d8a4e8e/bbab235f7.jpg

相似文献

1
MIMIC: an optimization method to identify cell type-specific marker panel for cell sorting.MIMIC:一种用于鉴定细胞分选的细胞类型特异性标记组合的优化方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab235.
2
MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data.MAGIC:一种利用 ENCODE 数据预测转录因子和协同因子驱动基因集的工具。
PLoS Comput Biol. 2020 Apr 6;16(4):e1007800. doi: 10.1371/journal.pcbi.1007800. eCollection 2020 Apr.
3
Polled Digital Cell Sorter (p-DCS): Automatic identification of hematological cell types from single cell RNA-sequencing clusters.流式数字细胞分选仪(p-DCS):从单细胞 RNA 测序簇中自动识别血细胞类型。
BMC Bioinformatics. 2019 Jul 1;20(1):369. doi: 10.1186/s12859-019-2951-x.
4
Mathematical Programming for Modeling Expression of a Gene Using Gurobi Optimizer to Identify Its Transcriptional Regulators.使用 Gurobi 优化器对基因表达进行数学规划,以识别其转录调控因子。
Methods Mol Biol. 2021;2328:99-113. doi: 10.1007/978-1-0716-1534-8_6.
5
scTIM: seeking cell-type-indicative marker from single cell RNA-seq data by consensus optimization.scTIM:通过共识优化从单细胞 RNA-seq 数据中寻找细胞类型指示性标记物。
Bioinformatics. 2020 Apr 15;36(8):2474-2485. doi: 10.1093/bioinformatics/btz936.
6
scSorter: assigning cells to known cell types according to marker genes.scSorter:根据标记基因将细胞分配到已知细胞类型中。
Genome Biol. 2021 Feb 22;22(1):69. doi: 10.1186/s13059-021-02281-7.
7
mTFkb: a knowledgebase for fundamental annotation of mouse transcription factors.mTFkb:一个用于注释小鼠转录因子的基础知识的知识库。
Sci Rep. 2017 Jun 8;7(1):3022. doi: 10.1038/s41598-017-02404-w.
8
DeePathology: Deep Multi-Task Learning for Inferring Molecular Pathology from Cancer Transcriptome.DeePathology:从癌症转录组推断分子病理学的深度多任务学习。
Sci Rep. 2019 Nov 11;9(1):16526. doi: 10.1038/s41598-019-52937-5.
9
TF-Marker: a comprehensive manually curated database for transcription factors and related markers in specific cell and tissue types in human.TF-Marker:一个全面的、经过人工精心整理的数据库,包含人类特定细胞和组织类型中转录因子及其相关标志物的信息。
Nucleic Acids Res. 2022 Jan 7;50(D1):D402-D412. doi: 10.1093/nar/gkab1114.
10
A computational approach for phenotypic comparisons of cell populations in high-dimensional cytometry data.一种用于高维流式细胞术数据中细胞群体表型比较的计算方法。
Methods. 2018 Jan 1;132:66-75. doi: 10.1016/j.ymeth.2017.09.005. Epub 2017 Sep 14.

引用本文的文献

1
Multiomics with Evolutionary Computation to Identify Molecular and Module Biomarkers for Early Diagnosis and Treatment of Complex Disease.结合多组学与进化计算以识别复杂疾病早期诊断和治疗的分子及模块生物标志物。
Genes (Basel). 2025 Feb 20;16(3):244. doi: 10.3390/genes16030244.

本文引用的文献

1
A Quantitative Proteome Map of the Human Body.人体定量蛋白质组图谱。
Cell. 2020 Oct 1;183(1):269-283.e19. doi: 10.1016/j.cell.2020.08.036. Epub 2020 Sep 10.
2
Integrative analysis of single-cell genomics data by coupled nonnegative matrix factorizations.基于耦合非负矩阵分解的单细胞基因组学数据综合分析。
Proc Natl Acad Sci U S A. 2018 Jul 24;115(30):7723-7728. doi: 10.1073/pnas.1805681115. Epub 2018 Jul 9.
3
Modeling gene regulation from paired expression and chromatin accessibility data.基于表达和染色质可及性数据的基因调控建模。
Proc Natl Acad Sci U S A. 2017 Jun 20;114(25):E4914-E4923. doi: 10.1073/pnas.1704553114. Epub 2017 Jun 2.
4
Models of global gene expression define major domains of cell type and tissue identity.全球基因表达模型定义了细胞类型和组织特性的主要领域。
Nucleic Acids Res. 2017 Mar 17;45(5):2354-2367. doi: 10.1093/nar/gkx054.
5
A systematic method to identify modulation of transcriptional regulation via chromatin activity reveals regulatory network during mESC differentiation.一种通过染色质活性鉴定转录调控调节的系统方法揭示了小鼠胚胎干细胞分化过程中的调控网络。
Sci Rep. 2016 Mar 7;6:22656. doi: 10.1038/srep22656.
6
A Systematic Approach to Identify Candidate Transcription Factors that Control Cell Identity.一种系统的方法来确定控制细胞身份的候选转录因子。
Stem Cell Reports. 2015 Nov 10;5(5):763-775. doi: 10.1016/j.stemcr.2015.09.016.
7
Sharing and Specificity of Co-expression Networks across 35 Human Tissues.35种人体组织中共表达网络的共享性与特异性
PLoS Comput Biol. 2015 May 13;11(5):e1004220. doi: 10.1371/journal.pcbi.1004220. eCollection 2015 May.
8
Robust enumeration of cell subsets from tissue expression profiles.从组织表达谱中可靠地枚举细胞亚群。
Nat Methods. 2015 May;12(5):453-7. doi: 10.1038/nmeth.3337. Epub 2015 Mar 30.
9
Genomic analysis of LPS-stimulated myeloid cells identifies a common pro-inflammatory response but divergent IL-10 anti-inflammatory responses.脂多糖刺激的髓样细胞的基因组分析确定了一种共同的促炎反应,但白细胞介素-10抗炎反应存在差异。
Sci Rep. 2015 Mar 13;5:9100. doi: 10.1038/srep09100.
10
CD3G gene defects in familial autoimmune thyroiditis.家族性自身免疫性甲状腺炎中的CD3G基因缺陷
Scand J Immunol. 2014 Nov;80(5):354-61. doi: 10.1111/sji.12200.