• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新审视基因组序列图的线性化

Linearization of genome sequence graphs revisited.

作者信息

Lisiecka Anna, Dojer Norbert

机构信息

Institute of Informatics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland.

出版信息

iScience. 2021 Jun 19;24(7):102755. doi: 10.1016/j.isci.2021.102755. eCollection 2021 Jul 23.

DOI:10.1016/j.isci.2021.102755
PMID:34278263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8264155/
Abstract

The need to include the genetic variation within a population into a reference genome led to the concept of a genome sequence graph. Nodes of such a graph are labeled with DNA sequences occurring in represented genomes. Due to double-stranded nature of DNA, each node may be oriented in one of two possible ways, resulting in marking one end of the labeling sequence as in-side and the other as out-side. Edges join pairs of sides and reflect adjacency between node sequences in genomes constituting the graph. Linearization of a sequence graph aims at orienting and ordering graph nodes in a way that makes it more efficient for visualization and further analysis, e.g. access and traversal. We propose a new linearization algorithm, called ALIBI - Algorithm for Linearization by Incremental graph BuIlding. The evaluation shows that ALIBI is computationally very efficient and generates high-quality results.

摘要

将群体内的遗传变异纳入参考基因组的需求催生了基因组序列图的概念。这种图的节点用所代表基因组中出现的DNA序列进行标记。由于DNA的双链性质,每个节点可能以两种可能的方式之一进行定向,从而将标记序列的一端标记为内侧,另一端标记为外侧。边连接两侧的对,并反映构成该图的基因组中节点序列之间的邻接关系。序列图的线性化旨在以一种使其更便于可视化和进一步分析(例如访问和遍历)的方式对图节点进行定向和排序。我们提出了一种新的线性化算法,称为ALIBI——通过增量图构建进行线性化的算法。评估表明,ALIBI在计算上非常高效,并能产生高质量的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/8f0fd8e9ab60/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/78a1a38b8d61/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/f6aa9b5f0bdd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/5bdb166ed444/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/295937c481d8/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/bfcfaca8706d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/93efaa6f621d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/d8ef0c980158/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/1a04ab50deb0/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/8f0fd8e9ab60/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/78a1a38b8d61/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/f6aa9b5f0bdd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/5bdb166ed444/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/295937c481d8/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/bfcfaca8706d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/93efaa6f621d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/d8ef0c980158/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/1a04ab50deb0/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91ae/8264155/8f0fd8e9ab60/gr8.jpg

相似文献

1
Linearization of genome sequence graphs revisited.重新审视基因组序列图的线性化
iScience. 2021 Jun 19;24(7):102755. doi: 10.1016/j.isci.2021.102755. eCollection 2021 Jul 23.
2
A Flow Procedure for Linearization of Genome Sequence Graphs.一种基因组序列图线性化的流程
J Comput Biol. 2018 Jul;25(7):664-676. doi: 10.1089/cmb.2017.0248. Epub 2018 May 24.
3
Bit-parallel sequence-to-graph alignment.位并行序列到图的对齐。
Bioinformatics. 2019 Oct 1;35(19):3599-3607. doi: 10.1093/bioinformatics/btz162.
4
Multi-way association extraction and visualization from biological text documents using hyper-graphs: applications to genetic association studies for diseases.使用超图从生物文本文档中进行多方面关联提取和可视化:在疾病的遗传关联研究中的应用。
Artif Intell Med. 2010 Jul;49(3):145-54. doi: 10.1016/j.artmed.2010.03.002. Epub 2010 Apr 9.
5
A new algorithm to find fuzzy Hamilton cycle in a fuzzy network using adjacency matrix and minimum vertex degree.一种使用邻接矩阵和最小顶点度在模糊网络中寻找模糊哈密顿圈的新算法。
Springerplus. 2016 Oct 22;5(1):1854. doi: 10.1186/s40064-016-3473-x. eCollection 2016.
6
Key-Node-Separated Graph Clustering and Layouts for Human Relationship Graph Visualization.用于人际关系图可视化的键节点分隔图聚类与布局
IEEE Comput Graph Appl. 2015 Nov-Dec;35(6):30-40. doi: 10.1109/MCG.2015.115. Epub 2015 Sep 23.
7
FastEtch: A Fast Sketch-Based Assembler for Genomes.FastEtch:一种基于草图的快速基因组装配器。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1091-1106. doi: 10.1109/TCBB.2017.2737999. Epub 2017 Sep 11.
8
Sequence tube maps: making graph genomes intuitive to commuters.序列管图:让图基因组更容易被通勤者理解。
Bioinformatics. 2019 Dec 15;35(24):5318-5320. doi: 10.1093/bioinformatics/btz597.
9
Sequence Alignment on Directed Graphs.有向图上的序列比对
J Comput Biol. 2019 Jan;26(1):53-67. doi: 10.1089/cmb.2017.0264. Epub 2018 Sep 8.
10
GenGraph: a python module for the simple generation and manipulation of genome graphs.GenGraph:一个用于简单生成和操作基因组图的 Python 模块。
BMC Bioinformatics. 2019 Oct 25;20(1):519. doi: 10.1186/s12859-019-3115-8.

本文引用的文献

1
Getting insight into the pan-genome structure with PangTree.利用 PangTree 深入了解泛基因组结构。
BMC Genomics. 2020 Apr 16;21(Suppl 2):274. doi: 10.1186/s12864-020-6610-4.
2
Variation graph toolkit improves read mapping by representing genetic variation in the reference.变异图谱工具包通过表示参考中的遗传变异来提高读映射质量。
Nat Biotechnol. 2018 Oct;36(9):875-879. doi: 10.1038/nbt.4227. Epub 2018 Aug 20.
3
A Flow Procedure for Linearization of Genome Sequence Graphs.一种基因组序列图线性化的流程
J Comput Biol. 2018 Jul;25(7):664-676. doi: 10.1089/cmb.2017.0248. Epub 2018 May 24.
4
Genome graphs and the evolution of genome inference.基因组图谱与基因组推断的演变
Genome Res. 2017 May;27(5):665-676. doi: 10.1101/gr.214155.116. Epub 2017 Mar 30.
5
Computational pan-genomics: status, promises and challenges.计算泛基因组学:现状、前景与挑战。
Brief Bioinform. 2018 Jan 1;19(1):118-135. doi: 10.1093/bib/bbw089.
6
Mapping Bias Overestimates Reference Allele Frequencies at the HLA Genes in the 1000 Genomes Project Phase I Data.映射偏差高估了千人基因组计划一期数据中HLA基因的参考等位基因频率。
G3 (Bethesda). 2015 Mar 17;5(5):931-41. doi: 10.1534/g3.114.015784.
7
Genome alignment with graph data structures: a comparison.基于图数据结构的基因组比对:比较。
BMC Bioinformatics. 2014 Apr 9;15:99. doi: 10.1186/1471-2105-15-99.
8
RSVSim: an R/Bioconductor package for the simulation of structural variations.RSVSim:一个用于模拟结构变异的 R/Bioconductor 包。
Bioinformatics. 2013 Jul 1;29(13):1679-81. doi: 10.1093/bioinformatics/btt198. Epub 2013 Apr 25.
9
Variation analysis and gene annotation of eight MHC haplotypes: the MHC Haplotype Project.八种主要组织相容性复合体单倍型的变异分析与基因注释:主要组织相容性复合体单倍型项目
Immunogenetics. 2008 Jan;60(1):1-18. doi: 10.1007/s00251-007-0262-2. Epub 2008 Jan 10.