• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CHOP:群体图中的单倍型感知路径索引。

CHOP: haplotype-aware path indexing in population graphs.

机构信息

Delft Bioinformatics Lab, Delft University of Technology, Van Mourik Broekmanweg 6, Delft, 2628 XE, The Netherlands.

Department of Clinical Genetics, VU University Medical Center, Van der Boechorststraat 7, Amsterdam, 1081 BT, The Netherlands.

出版信息

Genome Biol. 2020 Mar 11;21(1):65. doi: 10.1186/s13059-020-01963-y.

DOI:10.1186/s13059-020-01963-y
PMID:32160922
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7066762/
Abstract

The practical use of graph-based reference genomes depends on the ability to align reads to them. Performing substring queries to paths through these graphs lies at the core of this task. The combination of increasing pattern length and encoded variations inevitably leads to a combinatorial explosion of the search space. Instead of heuristic filtering or pruning steps to reduce the complexity, we propose CHOP, a method that constrains the search space by exploiting haplotype information, bounding the search space to the number of haplotypes so that a combinatorial explosion is prevented. We show that CHOP can be applied to large and complex datasets, by applying it on a graph-based representation of the human genome encoding all 80 million variants reported by the 1000 Genomes Project.

摘要

基于图的参考基因组的实际应用取决于将读取内容与它们对齐的能力。执行这些图中的路径的子字符串查询是此任务的核心。模式长度的增加和编码变化的组合不可避免地导致搜索空间的组合爆炸。我们没有采用启发式过滤或剪枝步骤来降低复杂度,而是提出了 CHOP,这是一种通过利用单倍型信息来约束搜索空间的方法,将搜索空间限制在单倍型的数量内,从而防止组合爆炸。我们通过将其应用于基于图形的人类基因组表示,该表示编码了 1000 基因组计划报告的所有 8000 万个变体,证明了 CHOP 可以应用于大型和复杂的数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/0112b55495b6/13059_2020_1963_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/8dc46f637e77/13059_2020_1963_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/2ddb6847d4a5/13059_2020_1963_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/9e428dcf58c5/13059_2020_1963_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/b7d735c973fd/13059_2020_1963_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/877e54b896ca/13059_2020_1963_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/272affe116f2/13059_2020_1963_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/b367abb93ea8/13059_2020_1963_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/dade1b89c683/13059_2020_1963_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/0112b55495b6/13059_2020_1963_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/8dc46f637e77/13059_2020_1963_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/2ddb6847d4a5/13059_2020_1963_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/9e428dcf58c5/13059_2020_1963_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/b7d735c973fd/13059_2020_1963_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/877e54b896ca/13059_2020_1963_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/272affe116f2/13059_2020_1963_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/b367abb93ea8/13059_2020_1963_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/dade1b89c683/13059_2020_1963_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb18/7066762/0112b55495b6/13059_2020_1963_Fig9_HTML.jpg

相似文献

1
CHOP: haplotype-aware path indexing in population graphs.CHOP:群体图中的单倍型感知路径索引。
Genome Biol. 2020 Mar 11;21(1):65. doi: 10.1186/s13059-020-01963-y.
2
Haplotype-aware sequence alignment to pangenome graphs.基于单倍型感知的序列比对到泛基因组图谱。
Genome Res. 2024 Oct 11;34(9):1265-1275. doi: 10.1101/gr.279143.124.
3
DNA sequences alignment method using sparse index on pan-genome graph.基于泛基因组图的稀疏索引的 DNA 序列比对方法。
J Bioinform Comput Biol. 2024 Aug;22(4):2450019. doi: 10.1142/S0219720024500197. Epub 2024 Aug 31.
4
Assessing graph-based read mappers against a baseline approach highlights strengths and weaknesses of current methods.评估基于图的读映射器相对于基线方法,可以突出当前方法的优缺点。
BMC Genomics. 2020 Apr 6;21(1):282. doi: 10.1186/s12864-020-6685-y.
5
Indexing Graphs for Path Queries with Applications in Genome Research.用于路径查询的图索引及其在基因组研究中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2014 Mar-Apr;11(2):375-88. doi: 10.1109/TCBB.2013.2297101.
6
Haplotype-aware graph indexes.单体型感知图索引。
Bioinformatics. 2020 Jan 15;36(2):400-407. doi: 10.1093/bioinformatics/btz575.
7
A graph extension of the positional Burrows-Wheeler transform and its applications.位置布罗算法变换的图形扩展及其应用
Algorithms Mol Biol. 2017 Jul 11;12:18. doi: 10.1186/s13015-017-0109-9. eCollection 2017.
8
A graph-based algorithm for estimating clonal haplotypes of tumor sample from sequencing data.基于图的算法,用于从测序数据估计肿瘤样本的克隆单倍型。
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):27. doi: 10.1186/s12920-018-0457-4.
9
GRAFIMO: Variant and haplotype aware motif scanning on pangenome graphs.基于泛基因组图的变体和单倍型感知 motif 扫描
PLoS Comput Biol. 2021 Sep 27;17(9):e1009444. doi: 10.1371/journal.pcbi.1009444. eCollection 2021 Sep.
10
BrownieAligner: accurate alignment of Illumina sequencing data to de Bruijn graphs.BrownieAligner:Illumina 测序数据到 de Bruijn 图的精确比对。
BMC Bioinformatics. 2018 Sep 4;19(1):311. doi: 10.1186/s12859-018-2319-7.

引用本文的文献

1
Plant graph-based pangenomics: techniques, applications, and challenges.基于植物图谱的泛基因组学:技术、应用与挑战。
aBIOTECH. 2025 Mar 28;6(2):361-376. doi: 10.1007/s42994-025-00206-7. eCollection 2025 Jun.
2
A survey of sequence-to-graph mapping algorithms in the pangenome era.泛基因组时代序列到图谱映射算法综述。
Genome Biol. 2025 May 22;26(1):138. doi: 10.1186/s13059-025-03606-6.
3
Haplotype-aware sequence alignment to pangenome graphs.基于单倍型感知的序列比对到泛基因组图谱。

本文引用的文献

1
Fully-sensitive seed finding in sequence graphs using a hybrid index.使用混合索引在序列图中进行完全敏感的种子发现。
Bioinformatics. 2019 Jul 15;35(14):i81-i89. doi: 10.1093/bioinformatics/btz341.
2
Haplotype-aware graph indexes.单体型感知图索引。
Bioinformatics. 2020 Jan 15;36(2):400-407. doi: 10.1093/bioinformatics/btz575.
3
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.基于图的基因组比对和基因分型与 HISAT2 和 HISAT-genotype。
Genome Res. 2024 Oct 11;34(9):1265-1275. doi: 10.1101/gr.279143.124.
4
Co-linear chaining on pangenome graphs.泛基因组图谱上的共线性连锁
Algorithms Mol Biol. 2024 Jan 27;19(1):4. doi: 10.1186/s13015-024-00250-w.
5
From the reference human genome to human pangenome: Premise, promise and challenge.从参考人类基因组到人类泛基因组:前提、前景与挑战。
Front Genet. 2022 Nov 10;13:1042550. doi: 10.3389/fgene.2022.1042550. eCollection 2022.
6
The Human Pangenome Project: a global resource to map genomic diversity.人类泛基因组计划:绘制基因组多样性图谱的全球资源。
Nature. 2022 Apr;604(7906):437-446. doi: 10.1038/s41586-022-04601-8. Epub 2022 Apr 20.
7
A variant selection framework for genome graphs.基因组图的变体选择框架。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i460-i467. doi: 10.1093/bioinformatics/btab302.
8
Reference flow: reducing reference bias using multiple population genomes.参考文献流向:利用多个群体基因组减少参考文献偏差。
Genome Biol. 2021 Jan 4;22(1):8. doi: 10.1186/s13059-020-02229-3.
9
GraphAligner: rapid and versatile sequence-to-graph alignment.GraphAligner:快速且通用的序列到图的对齐方法。
Genome Biol. 2020 Sep 24;21(1):253. doi: 10.1186/s13059-020-02157-2.
10
Pangenome Graphs.泛基因组图谱。
Annu Rev Genomics Hum Genet. 2020 Aug 31;21:139-162. doi: 10.1146/annurev-genom-120219-080406. Epub 2020 May 26.
Nat Biotechnol. 2019 Aug;37(8):907-915. doi: 10.1038/s41587-019-0201-4. Epub 2019 Aug 2.
4
Bit-parallel sequence-to-graph alignment.位并行序列到图的对齐。
Bioinformatics. 2019 Oct 1;35(19):3599-3607. doi: 10.1093/bioinformatics/btz162.
5
FORGe: prioritizing variants for graph genomes.FORGe:对图基因组中的变体进行优先级排序。
Genome Biol. 2018 Dec 17;19(1):220. doi: 10.1186/s13059-018-1595-x.
6
Variation graph toolkit improves read mapping by representing genetic variation in the reference.变异图谱工具包通过表示参考中的遗传变异来提高读映射质量。
Nat Biotechnol. 2018 Oct;36(9):875-879. doi: 10.1038/nbt.4227. Epub 2018 Aug 20.
7
Graphtyper enables population-scale genotyping using pangenome graphs.Graphtyper 可使用泛基因组图谱进行人群规模的基因分型。
Nat Genet. 2017 Nov;49(11):1654-1660. doi: 10.1038/ng.3964. Epub 2017 Sep 25.
8
A graph extension of the positional Burrows-Wheeler transform and its applications.位置布罗算法变换的图形扩展及其应用
Algorithms Mol Biol. 2017 Jul 11;12:18. doi: 10.1186/s13015-017-0109-9. eCollection 2017.
9
A hybrid cloud read aligner based on MinHash and kmer voting that preserves privacy.一种基于 MinHash 和 kmer 投票的混合云读取对齐器,可保护隐私。
Nat Commun. 2017 May 16;8:15311. doi: 10.1038/ncomms15311.
10
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.