• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HiCLift:一种用于在基因组组装之间转换染色质相互作用数据的快速高效工具。

HiCLift: a fast and efficient tool for converting chromatin interaction data between genome assemblies.

机构信息

Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, United States.

Robert H. Lurie Comprehensive Cancer Center of Northwestern University, Chicago, IL 60611, United States.

出版信息

Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad389.

DOI:10.1093/bioinformatics/btad389
PMID:37335863
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10313346/
Abstract

MOTIVATION

With the continuous effort to improve the quality of human reference genome and the generation of more and more personal genomes, the conversion of genomic coordinates between genome assemblies is critical in many integrative and comparative studies. While tools have been developed for such task for linear genome signals such as ChIP-Seq, no tool exists to convert genome assemblies for chromatin interaction data, despite the importance of three-dimensional genome organization in gene regulation and disease.

RESULTS

Here, we present HiCLift, a fast and efficient tool that can convert the genomic coordinates of chromatin contacts such as Hi-C and Micro-C from one assembly to another, including the latest T2T-CHM13 genome. Comparing with the strategy of directly remapping raw reads to a different genome, HiCLift runs on average 42 times faster (hours vs. days), while outputs nearly identical contact matrices. More importantly, as HiCLift does not need to remap the raw reads, it can directly convert human patient sample data, where the raw sequencing reads are sometimes hard to acquire or not available.

AVAILABILITY AND IMPLEMENTATION

HiCLift is publicly available at https://github.com/XiaoTaoWang/HiCLift.

摘要

动机

随着不断努力提高人类参考基因组的质量和产生越来越多的个人基因组,在许多综合和比较研究中,基因组组装之间的基因组坐标转换至关重要。虽然已经为线性基因组信号(如 ChIP-Seq)开发了此类任务的工具,但没有用于转换染色质相互作用数据的基因组组装的工具,尽管三维基因组组织在基因调控和疾病中非常重要。

结果

在这里,我们提出了 HiCLift,这是一种快速高效的工具,可以将 Hi-C 和 Micro-C 等染色质接触的基因组坐标从一个组装转换到另一个组装,包括最新的 T2T-CHM13 基因组。与直接将原始读数重新映射到不同基因组的策略相比,HiCLift 的平均运行速度快 42 倍(小时与天相比),同时输出几乎相同的接触矩阵。更重要的是,由于 HiCLift 不需要重新映射原始读数,因此可以直接转换人类患者样本数据,其中原始测序读数有时难以获取或不可用。

可用性和实现

HiCLift 可在 https://github.com/XiaoTaoWang/HiCLift 上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e664/10313346/7a9efebf2357/btad389f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e664/10313346/7a9efebf2357/btad389f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e664/10313346/7a9efebf2357/btad389f1.jpg

相似文献

1
HiCLift: a fast and efficient tool for converting chromatin interaction data between genome assemblies.HiCLift:一种用于在基因组组装之间转换染色质相互作用数据的快速高效工具。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad389.
2
HiCLift: A fast and efficient tool for converting chromatin interaction data between genome assemblies.HiCLift:一种在基因组组装之间转换染色质相互作用数据的快速高效工具。
bioRxiv. 2023 Jan 20:2023.01.17.524475. doi: 10.1101/2023.01.17.524475.
3
FastRemap: a tool for quickly remapping reads between genome assemblies.FastRemap:一种快速在基因组组装之间重新映射读取的工具。
Bioinformatics. 2022 Sep 30;38(19):4633-4635. doi: 10.1093/bioinformatics/btac554.
4
SQUAT: a Sequencing Quality Assessment Tool for data quality assessments of genome assemblies.SQUAT:用于基因组组装数据质量评估的测序质量评估工具。
BMC Genomics. 2019 Apr 18;19(Suppl 9):238. doi: 10.1186/s12864-019-5445-3.
5
ARCS: scaffolding genome drafts with linked reads.ARCS:使用链接读取构建基因组草图。
Bioinformatics. 2018 Mar 1;34(5):725-731. doi: 10.1093/bioinformatics/btx675.
6
CrossMap: a versatile tool for coordinate conversion between genome assemblies.CrossMap:一种用于基因组组装之间坐标转换的通用工具。
Bioinformatics. 2014 Apr 1;30(7):1006-7. doi: 10.1093/bioinformatics/btt730. Epub 2013 Dec 18.
7
ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.ARKS:基于链接读取子的人类基因组草图染色体级 scaffolding。
BMC Bioinformatics. 2018 Jun 20;19(1):234. doi: 10.1186/s12859-018-2243-x.
8
GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads.GAPPadder:一种使用短序列读长来闭合草图基因组缺口的灵敏方法。
BMC Genomics. 2019 Jun 6;20(Suppl 5):426. doi: 10.1186/s12864-019-5703-4.
9
A spectral algorithm for fast de novo layout of uncorrected long nanopore reads.一种用于快速从头设计未经校正的长纳米孔读段的谱算法。
Bioinformatics. 2017 Oct 15;33(20):3188-3194. doi: 10.1093/bioinformatics/btx370.
10
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.

引用本文的文献

1
Single-cell transcriptomics of ventral forebrain progenitors identifies Evf2 enhancer lncRNA-enhancer gene guidance through direct RNA binding and RNP recruitment domains.腹侧前脑祖细胞的单细胞转录组学通过直接RNA结合和RNP募集结构域鉴定了Evf2增强子lncRNA-增强子基因导向。
Nat Commun. 2025 Jul 26;16(1):6902. doi: 10.1038/s41467-025-62205-y.
2
Genome-wide chromosome architecture prediction reveals biophysical principles underlying gene structure.全基因组染色体结构预测揭示了基因结构背后的生物物理原理。
Cell Genom. 2024 Dec 11;4(12):100698. doi: 10.1016/j.xgen.2024.100698. Epub 2024 Nov 25.

本文引用的文献

1
Long-range phasing of dynamic, tissue-specific and allele-specific regulatory elements.动态、组织特异性和等位基因特异性调控元件的长程相位。
Nat Genet. 2022 Oct;54(10):1504-1513. doi: 10.1038/s41588-022-01188-8. Epub 2022 Oct 4.
2
EagleC: A deep-learning framework for detecting a full range of structural variations from bulk and single-cell contact maps.EagleC:一种用于从批量和单细胞接触图谱中检测全范围结构变异的深度学习框架。
Sci Adv. 2022 Jun 17;8(24):eabn9215. doi: 10.1126/sciadv.abn9215. Epub 2022 Jun 15.
3
The 4D Nucleome Data Portal as a resource for searching and visualizing curated nucleomics data.
4D 核组学数据门户,用作搜索和可视化已策核组学数据的资源。
Nat Commun. 2022 May 2;13(1):2365. doi: 10.1038/s41467-022-29697-4.
4
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
5
Multiplex chromatin interactions with single-molecule precision.多聚体染色质相互作用的单分子精度研究
Nature. 2019 Feb;566(7745):558-562. doi: 10.1038/s41586-019-0949-1. Epub 2019 Feb 18.
6
Higher-Order Inter-chromosomal Hubs Shape 3D Genome Organization in the Nucleus.高级染色体间枢纽塑造细胞核内的三维基因组结构。
Cell. 2018 Jul 26;174(3):744-757.e24. doi: 10.1016/j.cell.2018.05.024. Epub 2018 Jun 7.
7
HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient.HiCRep:使用分层调整相关系数评估 Hi-C 数据的可重复性。
Genome Res. 2017 Nov;27(11):1939-1949. doi: 10.1101/gr.220640.117. Epub 2017 Aug 30.
8
HiChIP: efficient and sensitive analysis of protein-directed genome architecture.HiChIP:蛋白质导向的基因组结构的高效灵敏分析
Nat Methods. 2016 Nov;13(11):919-922. doi: 10.1038/nmeth.3999. Epub 2016 Sep 19.
9
Mapping Nucleosome Resolution Chromosome Folding in Yeast by Micro-C.利用Micro-C技术绘制酵母中核小体分辨率的染色体折叠图谱。
Cell. 2015 Jul 2;162(1):108-19. doi: 10.1016/j.cell.2015.05.048. Epub 2015 Jun 25.
10
CrossMap: a versatile tool for coordinate conversion between genome assemblies.CrossMap:一种用于基因组组装之间坐标转换的通用工具。
Bioinformatics. 2014 Apr 1;30(7):1006-7. doi: 10.1093/bioinformatics/btt730. Epub 2013 Dec 18.