• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于持续谱单纯复形的机器学习在细胞分化中的染色体结构分析。

Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation.

机构信息

Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing, China 100124.

Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371.

出版信息

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac168.

DOI:10.1093/bib/bbac168
PMID:35536545
Abstract

The three-dimensional (3D) chromosomal structure plays an essential role in all DNA-templated processes, including gene transcription, DNA replication and other cellular processes. Although developing chromosome conformation capture (3C) methods, such as Hi-C, which can generate chromosomal contact data characterized genome-wide chromosomal structural properties, understanding 3D genomic nature-based on Hi-C data remains lacking. Here, we propose a persistent spectral simplicial complex (PerSpectSC) model to describe Hi-C data for the first time. Specifically, a filtration process is introduced to generate a series of nested simplicial complexes at different scales. For each of these simplicial complexes, its spectral information can be calculated from the corresponding Hodge Laplacian matrix. PerSpectSC model describes the persistence and variation of the spectral information of the nested simplicial complexes during the filtration process. Different from all previous models, our PerSpectSC-based features provide a quantitative global-scale characterization of chromosome structures and topology. Our descriptors can successfully classify cell types and also cellular differentiation stages for all the 24 types of chromosomes simultaneously. In particular, persistent minimum best characterizes cell types and Dim (1) persistent multiplicity best characterizes cellular differentiation. These results demonstrate the great potential of our PerSpectSC-based models in polymeric data analysis.

摘要

三维(3D)染色体结构在所有基于 DNA 的过程中都起着至关重要的作用,包括基因转录、DNA 复制和其他细胞过程。尽管开发了染色体构象捕获(3C)方法,如 Hi-C,可以生成具有全基因组染色体结构特征的染色体接触数据,但基于 Hi-C 数据理解 3D 基因组性质仍然存在不足。在这里,我们首次提出了持久谱单纯复形(PerSpectSC)模型来描述 Hi-C 数据。具体来说,引入了过滤过程来生成一系列嵌套的单纯复形,这些复形在不同的尺度上。对于每个单纯复形,可以从相应的 Hodge Laplacian 矩阵计算其谱信息。PerSpectSC 模型描述了嵌套单纯复形在过滤过程中谱信息的持久性和变化。与之前的所有模型不同,我们基于 PerSpectSC 的特征提供了对染色体结构和拓扑的定量全局特征描述。我们的描述符可以成功地对所有 24 种染色体的细胞类型和细胞分化阶段进行分类。特别是,持久最小特征可以很好地区分细胞类型,而一维持久多重性特征可以很好地区分细胞分化。这些结果表明,我们基于 PerSpectSC 的模型在聚合物数据分析中具有巨大的潜力。

相似文献

1
Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation.基于持续谱单纯复形的机器学习在细胞分化中的染色体结构分析。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac168.
2
Hodge theory-based biomolecular data analysis.基于 Hodge 理论的生物分子数据分析。
Sci Rep. 2022 Jun 11;12(1):9699. doi: 10.1038/s41598-022-12877-z.
3
Persistent Homology for RNA Data Analysis.RNA 数据分析中的持久同调。
Methods Mol Biol. 2023;2627:211-229. doi: 10.1007/978-1-0716-2974-1_12.
4
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction.基于持久谱超图的机器学习(PSH-ML)用于蛋白质-配体结合亲和力预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab127.
5
Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction.基于 Forman 持续 Ricci 曲率 (FPRC) 的蛋白质-配体结合亲和力预测机器学习模型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab136.
6
Persistent Path-Spectral (PPS) Based Machine Learning for Protein-Ligand Binding Affinity Prediction.基于持续路径谱(PPS)的机器学习的蛋白质-配体结合亲和力预测。
J Chem Inf Model. 2023 Feb 13;63(3):1066-1075. doi: 10.1021/acs.jcim.2c01251. Epub 2023 Jan 16.
7
Spectral detection of simplicial communities via Hodge Laplacians.通过霍奇拉普拉斯算子对单纯复形群落进行光谱检测。
Phys Rev E. 2021 Dec;104(6-1):064303. doi: 10.1103/PhysRevE.104.064303.
8
Persistent spectral based ensemble learning (PerSpect-EL) for protein-protein binding affinity prediction.基于持续谱的集成学习(PerSpect-EL)在蛋白质-蛋白质结合亲和力预测中的应用。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac024.
9
Weighted simplicial complexes and their representation power of higher-order network data and topology.加权单纯复形及其对高阶网络数据和拓扑的表示能力。
Phys Rev E. 2022 Sep;106(3-1):034319. doi: 10.1103/PhysRevE.106.034319.
10
GSDB: a database of 3D chromosome and genome structures reconstructed from Hi-C data.GSDB:一个从 Hi-C 数据中重建的 3D 染色体和基因组结构数据库。
BMC Mol Cell Biol. 2020 Aug 5;21(1):60. doi: 10.1186/s12860-020-00304-y.

引用本文的文献

1
Spatially aligned graph transfer learning for characterizing spatial regulatory heterogeneity.用于表征空间调控异质性的空间对齐图转移学习
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf021.
2
Genome structural dynamics: insights from Gaussian network analysis of Hi-C data.基因组结构动力学:Hi-C 数据的高斯网络分析的启示。
Brief Funct Genomics. 2024 Sep 27;23(5):525-537. doi: 10.1093/bfgp/elae014.
3
Deep neural network learning biological condition information refines gene-expression-based cell subtypes.深度学习网络学习生物状态信息可改善基于基因表达的细胞亚型分类。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad512.
4
Predicting drug-protein interactions by preserving the graph information of multi source data.通过保留多源数据的图信息来预测药物-蛋白质相互作用。
BMC Bioinformatics. 2024 Jan 4;25(1):10. doi: 10.1186/s12859-023-05620-6.
5
An application of topological data analysis in predicting sumoylation sites.拓扑数据分析在预测 SUMO 化位点中的应用。
PeerJ. 2023 Oct 12;11:e16204. doi: 10.7717/peerj.16204. eCollection 2023.
6
Machine learning for RNA 2D structure prediction benchmarked on experimental data.基于实验数据的 RNA 2D 结构预测的机器学习基准测试
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad153.
7
Hierarchical cell-type identifier accurately distinguishes immune-cell subtypes enabling precise profiling of tissue microenvironment with single-cell RNA-sequencing.层次化细胞类型标识符能够准确地区分免疫细胞亚型,从而能够通过单细胞 RNA 测序对组织微环境进行精确分析。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad006.