• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HiCImpute:一种用于识别结构零点和增强单细胞 Hi-C 数据的贝叶斯分层模型。

HiCImpute: A Bayesian hierarchical model for identifying structural zeros and enhancing single cell Hi-C data.

机构信息

Interdisciplinary Ph.D. Program in Biostatistics, Ohio State University, Columbus, Ohio, United State of America.

Department of Molecular Medicine, University of Texas Health Science Center, San Antonio, Texas, United State of America.

出版信息

PLoS Comput Biol. 2022 Jun 13;18(6):e1010129. doi: 10.1371/journal.pcbi.1010129. eCollection 2022 Jun.

DOI:10.1371/journal.pcbi.1010129
PMID:35696429
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9232133/
Abstract

Single cell Hi-C techniques enable one to study cell to cell variability in chromatin interactions. However, single cell Hi-C (scHi-C) data suffer severely from sparsity, that is, the existence of excess zeros due to insufficient sequencing depth. Complicating the matter further is the fact that not all zeros are created equal: some are due to loci truly not interacting because of the underlying biological mechanism (structural zeros); others are indeed due to insufficient sequencing depth (sampling zeros or dropouts), especially for loci that interact infrequently. Differentiating between structural zeros and dropouts is important since correct inference would improve downstream analyses such as clustering and discovery of subtypes. Nevertheless, distinguishing between these two types of zeros has received little attention in the single cell Hi-C literature, where the issue of sparsity has been addressed mainly as a data quality improvement problem. To fill this gap, in this paper, we propose HiCImpute, a Bayesian hierarchical model that goes beyond data quality improvement by also identifying observed zeros that are in fact structural zeros. HiCImpute takes spatial dependencies of scHi-C 2D data structure into account while also borrowing information from similar single cells and bulk data, when such are available. Through an extensive set of analyses of synthetic and real data, we demonstrate the ability of HiCImpute for identifying structural zeros with high sensitivity, and for accurate imputation of dropout values. Downstream analyses using data improved from HiCImpute yielded much more accurate clustering of cell types compared to using observed data or data improved by several comparison methods. Most significantly, HiCImpute-improved data have led to the identification of subtypes within each of the excitatory neuronal cells of L4 and L5 in the prefrontal cortex.

摘要

单细胞 Hi-C 技术使人们能够研究染色质相互作用中的细胞间可变性。然而,单细胞 Hi-C(scHi-C)数据严重稀疏,即由于测序深度不足而存在过多的零值。使问题更加复杂的是,并非所有的零值都是平等产生的:有些是由于潜在的生物学机制导致的真实不存在相互作用的区域(结构零值);另一些确实是由于测序深度不足(采样零值或缺失值)造成的,尤其是对于那些很少相互作用的区域。区分结构零值和缺失值很重要,因为正确的推断可以改善下游分析,如聚类和发现亚型。然而,在单细胞 Hi-C 文献中,很少关注区分这两种零值的问题,其中稀疏性问题主要作为数据质量改进问题来解决。为了填补这一空白,在本文中,我们提出了 HiCImpute,这是一种贝叶斯层次模型,通过识别实际上是结构零值的观察零值,超越了数据质量改进。HiCImpute 在考虑 scHi-C 2D 数据结构的空间依赖性的同时,还利用了相似的单细胞和批量数据的信息(如果有的话)。通过对合成和真实数据的广泛分析,我们证明了 HiCImpute 具有高灵敏度识别结构零值的能力,以及准确推断缺失值的能力。使用 HiCImpute 改进后的数据进行下游分析,与使用观察数据或几种比较方法改进后的数据相比,细胞类型的聚类更加准确。最重要的是,HiCImpute 改进后的数据导致了在大脑前额叶皮层 L4 和 L5 的兴奋性神经元细胞中识别出亚型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/f66c39b0b24d/pcbi.1010129.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/119d98dfc8a2/pcbi.1010129.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/02f87f7201dc/pcbi.1010129.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/f6d8f9e5335c/pcbi.1010129.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/f66c39b0b24d/pcbi.1010129.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/119d98dfc8a2/pcbi.1010129.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/02f87f7201dc/pcbi.1010129.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/f6d8f9e5335c/pcbi.1010129.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7886/9232133/f66c39b0b24d/pcbi.1010129.g004.jpg

相似文献

1
HiCImpute: A Bayesian hierarchical model for identifying structural zeros and enhancing single cell Hi-C data.HiCImpute:一种用于识别结构零点和增强单细胞 Hi-C 数据的贝叶斯分层模型。
PLoS Comput Biol. 2022 Jun 13;18(6):e1010129. doi: 10.1371/journal.pcbi.1010129. eCollection 2022 Jun.
2
Are dropout imputation methods for scRNA-seq effective for scHi-C data?单细胞 RNA 测序(scRNA-seq)的缺失值插补方法对 scHi-C 数据有效吗?
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa289.
3
scHi-CSim: a flexible simulator that generates high-fidelity single-cell Hi-C data for benchmarking.scHi-CSim:一种灵活的模拟器,可生成用于基准测试的高保真单细胞 Hi-C 数据。
J Mol Cell Biol. 2023 Jun 1;15(1). doi: 10.1093/jmcb/mjad003.
4
Bayesian Estimation of Three-Dimensional Chromosomal Structure from Single-Cell Hi-C Data.基于单细胞Hi-C数据的三维染色体结构的贝叶斯估计
J Comput Biol. 2019 Nov;26(11):1191-1202. doi: 10.1089/cmb.2019.0100. Epub 2019 Jun 18.
5
A posterior probability based Bayesian method for single-cell RNA-seq data imputation.基于后验概率的贝叶斯单细胞 RNA-seq 数据插补方法。
Methods. 2023 Aug;216:21-38. doi: 10.1016/j.ymeth.2023.06.004. Epub 2023 Jun 12.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Sparsity-Penalized Stacked Denoising Autoencoders for Imputing Single-Cell RNA-Seq Data.基于稀疏惩罚堆叠去噪自动编码器的单细胞 RNA-Seq 数据插补。
Genes (Basel). 2020 May 11;11(5):532. doi: 10.3390/genes11050532.
8
SinCWIm: An imputation method for single-cell RNA sequence dropouts using weighted alternating least squares.SinCWIm:一种基于加权交替最小二乘法的单细胞 RNA 序列缺失数据插补方法。
Comput Biol Med. 2024 Mar;171:108225. doi: 10.1016/j.compbiomed.2024.108225. Epub 2024 Feb 27.
9
CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute:一种用于单细胞 RNA 序列缺失的集成相似性插补方法。
Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.
10
Subgraph extraction and graph representation learning for single cell Hi-C imputation and clustering.单细胞 Hi-C 插补和聚类的子图提取和图表示学习。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad379.

引用本文的文献

1
Can Random Walking on a Hi-C Contact Matrix Lead to Data Quality Improvement? An Assessment.在Hi-C接触矩阵上进行随机游走能否提高数据质量?一项评估。
bioRxiv. 2025 Jun 17:2025.06.11.659235. doi: 10.1101/2025.06.11.659235.
2
Topologically associating domains of chromatin on single-cell Hi-C data: a survey of bioinformatic tools and applications in the light of artificial intelligence.基于单细胞Hi-C数据的染色质拓扑相关结构域:人工智能视角下生物信息学工具及应用综述
Front Genet. 2025 Jul 1;16:1602234. doi: 10.3389/fgene.2025.1602234. eCollection 2025.
3
Enhancing Single-Cell and Bulk Hi-C Data Using a Generative Transformer Model.

本文引用的文献

1
Are dropout imputation methods for scRNA-seq effective for scHi-C data?单细胞 RNA 测序(scRNA-seq)的缺失值插补方法对 scHi-C 数据有效吗?
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa289.
2
DeepHiC: A generative adversarial network for enhancing Hi-C data resolution.DeepHiC:一种用于提高 Hi-C 数据分辨率的生成对抗网络。
PLoS Comput Biol. 2020 Feb 21;16(2):e1007287. doi: 10.1371/journal.pcbi.1007287. eCollection 2020 Feb.
3
Sci-Hi-C: A single-cell Hi-C method for mapping 3D genome organization in large number of single cells.
使用生成式变压器模型增强单细胞和批量Hi-C数据
Biology (Basel). 2025 Mar 12;14(3):288. doi: 10.3390/biology14030288.
4
Single-Cell Hi-C Technologies and Computational Data Analysis.单细胞Hi-C技术与计算数据分析
Adv Sci (Weinh). 2025 Mar;12(9):e2412232. doi: 10.1002/advs.202412232. Epub 2025 Jan 30.
5
Single-cell omics: experimental workflow, data analyses and applications.单细胞组学:实验工作流程、数据分析及应用
Sci China Life Sci. 2025 Jan;68(1):5-102. doi: 10.1007/s11427-023-2561-0. Epub 2024 Jul 23.
6
[Advances in methods and applications of single-cell Hi-C data analysis].[单细胞Hi-C数据分析的方法与应用进展]
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Oct 25;40(5):1033-1039. doi: 10.7507/1001-5515.202303046.
Sci-Hi-C:一种在大量单细胞中绘制 3D 基因组结构的单细胞 Hi-C 方法。
Methods. 2020 Jan 1;170:61-68. doi: 10.1016/j.ymeth.2019.09.012. Epub 2019 Sep 16.
4
Simultaneous profiling of 3D genome structure and DNA methylation in single human cells.在单个人类细胞中同时分析 3D 基因组结构和 DNA 甲基化。
Nat Methods. 2019 Oct;16(10):999-1006. doi: 10.1038/s41592-019-0547-z. Epub 2019 Sep 9.
5
Conserved cell types with divergent features in human versus mouse cortex.人类与小鼠大脑皮层中具有不同特征的保守细胞类型。
Nature. 2019 Sep;573(7772):61-68. doi: 10.1038/s41586-019-1506-7. Epub 2019 Aug 21.
6
The single-cell sequencing: new developments and medical applications.单细胞测序:新进展与医学应用
Cell Biosci. 2019 Jun 26;9:53. doi: 10.1186/s13578-019-0314-y. eCollection 2019.
7
A Single-Cell Transcriptomic Atlas of Human Neocortical Development during Mid-gestation.人类皮质中期发育的单细胞转录组图谱。
Neuron. 2019 Sep 4;103(5):785-801.e8. doi: 10.1016/j.neuron.2019.06.011. Epub 2019 Jul 11.
8
Robust single-cell Hi-C clustering by convolution- and random-walk-based imputation.基于卷积和随机游走的推断进行稳健的单细胞 Hi-C 聚类。
Proc Natl Acad Sci U S A. 2019 Jul 9;116(28):14011-14018. doi: 10.1073/pnas.1901423116. Epub 2019 Jun 24.
9
Bayesian Estimation of Three-Dimensional Chromosomal Structure from Single-Cell Hi-C Data.基于单细胞Hi-C数据的三维染色体结构的贝叶斯估计
J Comput Biol. 2019 Nov;26(11):1191-1202. doi: 10.1089/cmb.2019.0100. Epub 2019 Jun 18.
10
SCL: a lattice-based approach to infer 3D chromosome structures from single-cell Hi-C data.SCL:一种基于格点的方法,用于从单细胞 Hi-C 数据推断 3D 染色体结构。
Bioinformatics. 2019 Oct 15;35(20):3981-3988. doi: 10.1093/bioinformatics/btz181.