• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超级基因组的坐标系

Coordinate systems for supergenomes.

作者信息

Gärtner Fabian, Höner Zu Siederdissen Christian, Müller Lydia, Stadler Peter F

机构信息

1Competence Center for Scalable Data Services and Solutions Dresden/Leipzig, Universität Leipzig, Augustusplatz 12, 04107 Leipzig, Germany.

2Bioinformatics Group, Department of Computer Science, Universität Leipzig, Härtelstraße 16-18, 04107 Leipzig, Germany.

出版信息

Algorithms Mol Biol. 2018 Sep 24;13:15. doi: 10.1186/s13015-018-0133-4. eCollection 2018.

DOI:10.1186/s13015-018-0133-4
PMID:30258487
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6151955/
Abstract

BACKGROUND

Genome sequences and genome annotation data have become available at ever increasing rates in response to the rapid progress in sequencing technologies. As a consequence the demand for methods supporting comparative, evolutionary analysis is also growing. In particular, efficient tools to visualize-omics data simultaneously for multiple species are sorely lacking. A first and crucial step in this direction is the construction of a common coordinate system. Since genomes not only differ by rearrangements but also by large insertions, deletions, and duplications, the use of a single reference genome is insufficient, in particular when the number of species becomes large.

RESULTS

The computational problem then becomes to determine an order and orientations of optimal local alignments that are as co-linear as possible with all the genome sequences. We first review the most prominent approaches to model the problem formally and then proceed to showing that it can be phrased as a particular variant of the Betweenness Problem. It is NP hard in general. As exact solutions are beyond reach for the problem sizes of practical interest, we introduce a collection of heuristic simplifiers to resolve ordering conflicts.

CONCLUSION

Benchmarks on real-life data ranging from bacterial to fly genomes demonstrate the feasibility of computing good common coordinate systems.

摘要

背景

随着测序技术的飞速发展,基因组序列和基因组注释数据的获取速度不断加快。因此,对支持比较和进化分析方法的需求也在增加。特别是,非常缺乏能同时为多个物种可视化组学数据的高效工具。朝着这个方向迈出的第一步也是关键一步是构建一个通用坐标系。由于基因组不仅因重排不同,还因大量插入、缺失和重复而不同,使用单个参考基因组是不够的,尤其是当物种数量增多时。

结果

计算问题就变成了确定与所有基因组序列尽可能共线的最优局部比对的顺序和方向。我们首先回顾了正式建模该问题的最突出方法,然后证明它可以表述为中间性问题的一个特定变体。一般来说它是NP难问题。由于对于实际感兴趣的问题规模,精确解难以实现,我们引入了一组启发式简化方法来解决排序冲突。

结论

从细菌到果蝇基因组的实际数据基准测试证明了计算良好通用坐标系的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/a4d37b63c474/13015_2018_133_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/6a45141ee8b9/13015_2018_133_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/48041422b725/13015_2018_133_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/fe8dcacfdbe6/13015_2018_133_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/d1ce350914db/13015_2018_133_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/d5aee2092da3/13015_2018_133_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/a4d37b63c474/13015_2018_133_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/6a45141ee8b9/13015_2018_133_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/48041422b725/13015_2018_133_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/fe8dcacfdbe6/13015_2018_133_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/d1ce350914db/13015_2018_133_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/d5aee2092da3/13015_2018_133_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/6151955/a4d37b63c474/13015_2018_133_Fig7_HTML.jpg

相似文献

1
Coordinate systems for supergenomes.超级基因组的坐标系
Algorithms Mol Biol. 2018 Sep 24;13:15. doi: 10.1186/s13015-018-0133-4. eCollection 2018.
2
Towards realistic benchmarks for multiple alignments of non-coding sequences.针对非编码序列多重比对的现实基准。
BMC Bioinformatics. 2010 Jan 26;11:54. doi: 10.1186/1471-2105-11-54.
3
A universal genomic coordinate translator for comparative genomics.用于比较基因组学的通用基因组坐标转换器。
BMC Bioinformatics. 2014 Jun 30;15:227. doi: 10.1186/1471-2105-15-227.
4
Pangenome graphs in infectious disease: a comprehensive genetic variation analysis of leveraging Oxford Nanopore long reads.传染病中的泛基因组图谱:利用牛津纳米孔长读长进行的全面遗传变异分析
Front Genet. 2023 Aug 10;14:1225248. doi: 10.3389/fgene.2023.1225248. eCollection 2023.
5
Building a pan-genome reference for a population.构建某一群体的泛基因组参考图谱。
J Comput Biol. 2015 May;22(5):387-401. doi: 10.1089/cmb.2014.0146. Epub 2015 Jan 7.
6
Enhancement of Plant Productivity in the Post-Genomics Era.后基因组时代植物生产力的提高
Curr Genomics. 2016 Aug;17(4):295-6. doi: 10.2174/138920291704160607182507.
7
Graph-based pan-genomes: increased opportunities in plant genomics.基于图的泛基因组:植物基因组学的更多机会。
J Exp Bot. 2023 Jan 1;74(1):24-39. doi: 10.1093/jxb/erac412.
8
Comparative genomics tools applied to bioterrorism defence.应用于生物恐怖主义防御的比较基因组学工具
Brief Bioinform. 2003 Jun;4(2):133-49. doi: 10.1093/bib/4.2.133.
9
10
Heuristics for the inversion median problem.启发式算法在中值反问题中的应用。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S30. doi: 10.1186/1471-2105-11-S1-S30.

引用本文的文献

1
A layout framework for genome-wide multiple sequence alignment graphs.一种用于全基因组多序列比对图的布局框架。
Front Bioinform. 2024 Aug 16;4:1358374. doi: 10.3389/fbinf.2024.1358374. eCollection 2024.
2
Genomic variations and epigenomic landscape of the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel.日本青鳉近交系 Kiyosu-Karlsruhe(MIKK)panel 的基因组变异和表观基因组景观。
Genome Biol. 2022 Feb 21;23(1):58. doi: 10.1186/s13059-022-02602-4.
3
A tri-tuple coordinate system derived for fast and accurate analysis of the colored de Bruijn graph-based pangenomes.

本文引用的文献

1
A Flow Procedure for Linearization of Genome Sequence Graphs.一种基因组序列图线性化的流程
J Comput Biol. 2018 Jul;25(7):664-676. doi: 10.1089/cmb.2017.0248. Epub 2018 May 24.
2
Characterization of multiple sequence alignment errors using complete-likelihood score and position-shift map.使用完全似然得分和位置偏移图对多序列比对错误进行表征。
BMC Bioinformatics. 2016 Mar 18;17:133. doi: 10.1186/s12859-016-0945-5.
3
Fast turnover of genome transcription across evolutionary time exposes entire non-coding DNA to de novo gene emergence.
一种三坐标坐标系统,用于快速准确地分析基于三色 de Bruijn 图的泛基因组。
BMC Bioinformatics. 2021 May 27;22(1):282. doi: 10.1186/s12859-021-04149-w.
4
Efficient merging of genome profile alignments.基因组图谱比对的高效合并。
Bioinformatics. 2019 Jul 15;35(14):i71-i80. doi: 10.1093/bioinformatics/btz377.
5
Superbubbles revisited.再探超级气泡
Algorithms Mol Biol. 2018 Dec 1;13:16. doi: 10.1186/s13015-018-0134-3. eCollection 2018.
在进化时间跨度内,基因组转录的快速周转使整个非编码DNA暴露于从头基因产生的过程中。
Elife. 2016 Feb 2;5:e09977. doi: 10.7554/eLife.09977.
4
Single-cell genome sequencing: current state of the science.单细胞基因组测序:科学现状。
Nat Rev Genet. 2016 Mar;17(3):175-88. doi: 10.1038/nrg.2015.16. Epub 2016 Jan 25.
5
Moss Phylogeny Reconstruction Using Nucleotide Pangenome of Complete Mitogenome Sequences.利用完整线粒体基因组序列的核苷酸泛基因组进行苔藓系统发育重建。
Biochemistry (Mosc). 2015 Nov;80(11):1522-7. doi: 10.1134/S0006297915110152.
6
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
7
Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species.通过对17个物种转录组的直接比较得出的长链非编码RNA进化原理。
Cell Rep. 2015 May 19;11(7):1110-22. doi: 10.1016/j.celrep.2015.04.023. Epub 2015 May 7.
8
Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved.剪接位点的比较表明,长链非编码RNA在进化上具有良好的保守性。
RNA. 2015 May;21(5):801-12. doi: 10.1261/rna.046342.114. Epub 2015 Mar 23.
9
Building a pan-genome reference for a population.构建某一群体的泛基因组参考图谱。
J Comput Biol. 2015 May;22(5):387-401. doi: 10.1089/cmb.2014.0146. Epub 2015 Jan 7.
10
Comparison of the transcriptional landscapes between human and mouse tissues.人类和小鼠组织之间转录图谱的比较。
Proc Natl Acad Sci U S A. 2014 Dec 2;111(48):17224-9. doi: 10.1073/pnas.1413624111. Epub 2014 Nov 20.