• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算泛基因组学:现状、前景与挑战。

Computational pan-genomics: status, promises and challenges.

出版信息

Brief Bioinform. 2018 Jan 1;19(1):118-135. doi: 10.1093/bib/bbw089.

DOI:10.1093/bib/bbw089
PMID:27769991
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5862344/
Abstract

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.

摘要

许多学科,从人类遗传学和肿瘤学到植物育种、微生物学和病毒学,通常都面临着分析数量迅速增加的基因组的挑战。就智人而言,在未来几年内,测序的基因组数量将接近数十万。仅仅扩大现有的生物信息学管道将不足以充分利用这些丰富基因组数据集的全部潜力。相反,需要新颖的、性质不同的计算方法和范例。我们将见证计算泛基因组学的快速扩展,这是计算生物学中的一个新的研究领域。在本文中,我们推广了现有的定义,并将泛基因组理解为任何要联合分析或用作参考的基因组序列集合。我们研究了已经存在的构建和使用泛基因组的方法,讨论了未来技术和方法的潜在好处,并从上述生物学学科的角度回顾了开放的挑战。作为计算范例转变的一个突出例子,我们特别强调了从将参考基因组表示为字符串到表示为图的转变。我们概述了这种转变以及来自不同应用领域的其他挑战如何转化为常见的计算问题,指出相关的生物信息学技术并确定计算机科学中的开放问题。通过这篇综述,我们旨在提高人们的认识,即计算泛基因组学的联合方法可以帮助解决目前在各个领域面临的许多问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/81bd094f1036/bbw089f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/3437c173b1ab/bbw089f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/29994824829c/bbw089f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/81bd094f1036/bbw089f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/3437c173b1ab/bbw089f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/29994824829c/bbw089f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18ee/5862344/81bd094f1036/bbw089f3.jpg

相似文献

1
Computational pan-genomics: status, promises and challenges.计算泛基因组学:现状、前景与挑战。
Brief Bioinform. 2018 Jan 1;19(1):118-135. doi: 10.1093/bib/bbw089.
2
3
PanTools: representation, storage and exploration of pan-genomic data.泛基因组工具:泛基因组数据的表示、存储与探索
Bioinformatics. 2016 Sep 1;32(17):i487-i493. doi: 10.1093/bioinformatics/btw455.
4
Pan-Genome Storage and Analysis Techniques.泛基因组存储与分析技术
Methods Mol Biol. 2018;1704:29-53. doi: 10.1007/978-1-4939-7463-4_2.
5
seq-seq-pan: building a computational pan-genome data structure on whole genome alignment.seq-seq-pan:在全基因组比对的基础上构建计算泛基因组数据结构。
BMC Genomics. 2018 Jan 15;19(1):47. doi: 10.1186/s12864-017-4401-3.
6
Simplitigs as an efficient and scalable representation of de Bruijn graphs.Simplitigs 作为一种高效且可扩展的 de Bruijn 图表示方法。
Genome Biol. 2021 Apr 6;22(1):96. doi: 10.1186/s13059-021-02297-z.
7
Plant pan-genomics: recent advances, new challenges, and roads ahead.植物泛基因组学:最新进展、新挑战和未来展望。
J Genet Genomics. 2022 Sep;49(9):833-846. doi: 10.1016/j.jgg.2022.06.004. Epub 2022 Jun 21.
8
Building pan-genome infrastructures for crop plants and their use in association genetics.构建作物的泛基因组基础设施及其在关联遗传学中的应用。
DNA Res. 2021 Jan 19;28(1). doi: 10.1093/dnares/dsaa030.
9
Pan-Tetris: an interactive visualisation for Pan-genomes.泛基因组的Pan-Tetris交互式可视化工具
BMC Bioinformatics. 2015;16 Suppl 11(Suppl 11):S3. doi: 10.1186/1471-2105-16-S11-S3. Epub 2015 Aug 13.
10
Ten years of pan-genome analyses.泛基因组分析十年
Curr Opin Microbiol. 2015 Feb;23:148-54. doi: 10.1016/j.mib.2014.11.016. Epub 2014 Dec 5.

引用本文的文献

1
Gene-level analysis of core carbohydrate metabolism across the Enterobacteriaceae pan-genome.肠杆菌科泛基因组中核心碳水化合物代谢的基因水平分析。
Commun Biol. 2025 Aug 18;8(1):1241. doi: 10.1038/s42003-025-08640-5.
2
b-move: faster lossless approximate pattern matching in a run-length compressed index.b移动:在游程长度压缩索引中实现更快的无损近似模式匹配。
Algorithms Mol Biol. 2025 Aug 12;20(1):15. doi: 10.1186/s13015-025-00281-x.
3
Phased genome assemblies and pangenome graphs of human populations of Japan and Saudi Arabia.日本和沙特阿拉伯人群的阶段性基因组组装和泛基因组图谱。

本文引用的文献

1
Quasispecies Theory in Virology.病毒学中的准种理论
J Virol. 2002 Jan 1;76(1):463-465. doi: 10.1128/JVI.76.1.463-465.2002.
2
PanTools: representation, storage and exploration of pan-genomic data.泛基因组工具:泛基因组数据的表示、存储与探索
Bioinformatics. 2016 Sep 1;32(17):i487-i493. doi: 10.1093/bioinformatics/btw455.
3
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown.基于 HISAT、StringTie 和 Ballgown 的 RNA-seq 实验的转录本水平表达分析。
Sci Data. 2025 Aug 12;12(1):1316. doi: 10.1038/s41597-025-05652-y.
4
Structural variation in 1,019 diverse humans based on long-read sequencing.基于长读长测序的1019名不同个体的结构变异
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09290-7.
5
Exploiting uniqueness: seed-chain-extend alignment on elastic founder graphs.利用独特性:弹性奠基者图上的种子-链-扩展比对
Bioinformatics. 2025 Jul 1;41(Supplement_1):i265-i274. doi: 10.1093/bioinformatics/btaf225.
6
Cryptic variation fuels plant phenotypic change through hierarchical epistasis.隐秘变异通过层级上位性推动植物表型变化。
Nature. 2025 Jul 9. doi: 10.1038/s41586-025-09243-0.
7
Lossless Pangenome Indexing Using Tag Arrays.使用标签数组的无损全基因组索引
bioRxiv. 2025 May 15:2025.05.12.653561. doi: 10.1101/2025.05.12.653561.
8
A survey of sequence-to-graph mapping algorithms in the pangenome era.泛基因组时代序列到图谱映射算法综述。
Genome Biol. 2025 May 22;26(1):138. doi: 10.1186/s13059-025-03606-6.
9
AlfaPang: alignment free algorithm for pangenome graph construction.AlfaPang:用于构建泛基因组图的无比对算法。
Algorithms Mol Biol. 2025 May 15;20(1):7. doi: 10.1186/s13015-025-00277-7.
10
Reference-free identification and pangenome analysis of accessory chromosomes in a major fungal plant pathogen.一种主要真菌植物病原体中附属染色体的无参考鉴定与泛基因组分析
NAR Genom Bioinform. 2025 Apr 2;7(2):lqaf034. doi: 10.1093/nargab/lqaf034. eCollection 2025 Jun.
Nat Protoc. 2016 Sep;11(9):1650-67. doi: 10.1038/nprot.2016.095. Epub 2016 Aug 11.
4
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
5
Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis.全基因组关联分析确定了肌萎缩侧索硬化症的新风险变异和遗传结构。
Nat Genet. 2016 Sep;48(9):1043-8. doi: 10.1038/ng.3622. Epub 2016 Jul 25.
6
Read mapping on de Bruijn graphs.在德布鲁因图上进行读段映射。
BMC Bioinformatics. 2016 Jun 16;17(1):237. doi: 10.1186/s12859-016-1103-9.
7
Haplotyping germline and cancer genomes with high-throughput linked-read sequencing.利用高通量连锁读长测序对种系和癌症基因组进行单倍型分型
Nat Biotechnol. 2016 Mar;34(3):303-11. doi: 10.1038/nbt.3432. Epub 2016 Feb 1.
8
Assessing the performance of the Oxford Nanopore Technologies MinION.评估牛津纳米孔技术公司的MinION测序仪的性能。
Biomol Detect Quantif. 2015 Mar;3:1-8. doi: 10.1016/j.bdq.2015.02.001.
9
A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing.利用全基因组测序对癌症中体细胞突变检测进行的全面评估。
Nat Commun. 2015 Dec 9;6:10001. doi: 10.1038/ncomms10001.
10
Ensembl Genomes 2016: more genomes, more complexity.《Ensembl基因组2016:更多基因组,更多复杂性》
Nucleic Acids Res. 2016 Jan 4;44(D1):D574-80. doi: 10.1093/nar/gkv1209. Epub 2015 Nov 17.